Skip to content

GitLab

Explore

Sign in

Primary navigation

Project

C

crawl
- Activity
- Members
- Labels
- Releases
- Model registry
- Environments
- Incidents
- Service Desk

Snippets Groups Projects

c5ec7eb8

Commit c5ec7eb8 authored 6 years ago by ale

Downloads
- Patches
- Plain Diff

Add multi-file output

The output stage can now write to size-limited, rotating WARC files
using a user-specified pattern, so that output files are always
unique.

parent 3518feaf

Branches

No related tags found

No related merge requests found

Changes 5

Show whitespace changes

Inline Side-by-side

Showing

README.md 9 additions, 0 deletions

README.md
cmd/crawl/crawl.go 17 additions, 4 deletions

cmd/crawl/crawl.go
warc/multi.go 121 additions, 0 deletions

warc/multi.go
warc/warc.go 45 additions, 26 deletions

warc/warc.go
warc/warc_test.go 138 additions, 0 deletions

warc/warc_test.go

with 330 additions and 30 deletions

Loading

0% Loading or .

You are about to add 0 people to the discussion. Proceed with caution.

Finish editing this message first!

Please register or sign in to comment