- Sep 02, 2018
-
-
ale authored
-
- Aug 31, 2018
-
-
ale authored
-
ale authored
-
ale authored
Makes it possible to retry requests for temporary HTTP errors (429, 500, etc).
-
ale authored
Handler errors are fatal, so that an error writing the WARC output will cause the crawl to abort.
-
ale authored
-
ale authored
Detect write errors (both on the database and to the WARC output) and abort with an error message. Also fix a bunch of harmless lint warnings.
-
ale authored
-
- Aug 30, 2018
- Dec 19, 2017
-
-
ale authored
Defaults that are more suitable to real-world site archiving.
-
ale authored
-
ale authored
-
ale authored
-
ale authored
-
ale authored
-
ale authored
-
ale authored
-
ale authored
-
ale authored
-
ale authored
This change allows more complex scope boundaries, including loosening edges a bit to include related resources of HTML pages (which makes for more complete archives if desired).
-
- Dec 18, 2017
- Jul 03, 2015
-
-
ale authored
-
- Jun 29, 2015
- Jun 28, 2015
- Dec 20, 2014
- Dec 19, 2014
-
-
ale authored
-