Documentation
¶
Index ¶
Constants ¶
View Source
const ( StrategyReadability = "readability" StrategySelector = "selector" )
Strategy constants for extraction methods.
Variables ¶
This section is empty.
Functions ¶
This section is empty.
Types ¶
type Extractor ¶
type Extractor interface {
Extract(html io.Reader, pageURL string, logger *slog.Logger) (string, error)
}
Extractor pulls the main content from an HTML document.
type ReadabilityExtractor ¶
type ReadabilityExtractor struct{}
ReadabilityExtractor uses the go-readability library to extract article content.
Click to show internal directories.
Click to hide internal directories.