pkg/

directory
v0.1.0 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Jun 13, 2026 License: Apache-2.0

Directories

Path Synopsis
Package warc reads WARC files the way Common Crawl stores them: a stream of gzip members, one record per member.
Package warc reads WARC files the way Common Crawl stores them: a stream of gzip members, one record per member.
Package wat reads WAT files, the Common Crawl archive of per-page metadata: the response status and content type, the HTML title and meta tags, and the outbound links.
Package wat reads WAT files, the Common Crawl archive of per-page metadata: the response status and content type, the HTML title and meta tags, and the outbound links.
Package wet reads WET files, the Common Crawl archive of extracted plain text.
Package wet reads WET files, the Common Crawl archive of extracted plain text.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL