- Nov 13, 2022
- Jul 12, 2021
-
- Jul 11, 2021
-
-
renovate authored
-
- Jun 19, 2021
- Aug 26, 2020
- Aug 24, 2020
- Aug 23, 2020
- Aug 20, 2020
-
-
ale authored
-
- Jul 30, 2020
- Feb 17, 2020
- Dec 04, 2019
-
-
ale authored
-
- Nov 13, 2019
- Oct 07, 2019
- Sep 26, 2019
- Jan 20, 2019
-
-
ale authored
Introduce an interface to decouple the Enqueue functionality from the Crawler implementation.
-
- Jan 19, 2019
-
-
ale authored
The whole URLInfo structure, while neat, is unused except for the purpose of verifying if we have already seen a specific URL. The presence check is also now limited to Enqueue().
-
- Jan 02, 2019
-
-
ale authored
The output stage can now write to size-limited, rotating WARC files using a user-specified pattern, so that output files are always unique.
-
- Dec 28, 2018
-
-
ale authored
-