buckets

package

v0.0.0-...-c180764 Latest Latest Go to latest Published: Jan 31, 2023 License: Apache-2.0 Imports: 6 Imported by: 0

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

Repository

github.com/gleanerio/gleaner

README ¶

Buckets

NOTE

code isn't wired in or working for any purpose yet. it's a placeholder for a plan

About

Code here needs to manage the buckets that a crawl goes into.

Buckets can be moved for archive reasons or simply purged.

The sitemap.xml + prov graph does not tell us much really. We don't know if a DO has been updated without a hash. We can not rely on the sitemap update date.

On each index we can "honor" the sitemap and not index a resource in prov (from s3select calls) or "ignore" the sitemap and do a file index.

We can "honor" for a time too. N days for example.

Config file section

update mode: honor One of honor, ignore, age

The process is easy

ignore