Documentation
¶
Index ¶
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
This section is empty.
Types ¶
type HtmlExtractor ¶
type HtmlExtractor struct {
// contains filtered or unexported fields
}
func NewHtmlExtractor ¶
func NewHtmlExtractor(path string, totalOfPages int) *HtmlExtractor
NewHtmlExtractor creates a new HtmlExtractor. The path argument is the path to the PDF file. The totalOfPages argument is the total of pages of the PDF file.
func (*HtmlExtractor) Extract ¶
func (e *HtmlExtractor) Extract() (string, error)
Extract extracts the html from the PDF file. It returns an error if the html cannot be extracted.
func (*HtmlExtractor) ExtractPage ¶
func (e *HtmlExtractor) ExtractPage(pageNumber int) (string, error)
Extract extracts the text from the PDF file. It returns an error if the html cannot be extracted. The pageNumber argument is the page number to extract the html from.
Click to show internal directories.
Click to hide internal directories.