parser

package
v0.2.1 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Sep 21, 2020 License: MIT Imports: 4 Imported by: 6

Documentation

Index

Constants

This section is empty.

Variables

This section is empty.

Functions

This section is empty.

Types

type DocumentParser

type DocumentParser struct {
}

DocumentParser is an HTML document parser.

func (p *DocumentParser) ExtractLinks(baseURL string, body io.Reader) ([]string, error)

ExtractLinks extracts relative links from an net/http response with a base url. It returns links which only contain the base url to avoid crawling external links.

type Parser

type Parser interface {
	ExtractLinks(string, io.Reader) ([]string, error)
}

Parser is a common interface to parse a document.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL