Documentation
¶
Overview ¶
Package removals provides content-pattern-based removal for the defuddle extraction pipeline.
Index ¶
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
func RemoveByContentPattern ¶
func RemoveByContentPattern(mainContent *goquery.Selection, _ *goquery.Document, debug bool, pageURL string)
RemoveByContentPattern detects and removes boilerplate, metadata, and navigational fragments from mainContent. It is a faithful port of the TypeScript removeByContentPattern function.
Types ¶
This section is empty.
Click to show internal directories.
Click to hide internal directories.