Skip to content
Snippets Groups Projects
  1. Jul 30, 2020
    • ale's avatar
      Panic on fatal errors · de767cce
      ale authored
      This allows users of crawl-as-a-library to recover from unexpected
      errors as a last resort.
      de767cce
  2. Feb 17, 2020
  3. Dec 04, 2019
  4. Nov 13, 2019
  5. Oct 07, 2019
  6. Sep 26, 2019
  7. Jan 20, 2019
  8. Jan 19, 2019
    • ale's avatar
      Replace URLInfo with a simple URL presence check · cce28f44
      ale authored
      The whole URLInfo structure, while neat, is unused except for the
      purpose of verifying if we have already seen a specific URL.
      
      The presence check is also now limited to Enqueue().
      cce28f44
  9. Jan 02, 2019
    • ale's avatar
      Add multi-file output · c5ec7eb8
      ale authored
      The output stage can now write to size-limited, rotating WARC files
      using a user-specified pattern, so that output files are always
      unique.
      c5ec7eb8
  10. Dec 28, 2018
  11. Dec 27, 2018
  12. Dec 06, 2018
  13. Sep 02, 2018
  14. Aug 31, 2018
  15. Aug 30, 2018
  16. Dec 19, 2017
Loading