Documentation
¶
Index ¶
- Constants
- Variables
- type Config
- type DefaultLogger
- type Logger
- type PDFReader
- func (r *PDFReader) Author() string
- func (r *PDFReader) CreationDate() string
- func (r *PDFReader) Encrypted() bool
- func (r *PDFReader) Html() (string, error)
- func (r *PDFReader) Pages() ([]*Page, error)
- func (r *PDFReader) PagesNumber() int
- func (r *PDFReader) Text() (string, error)
- func (r *PDFReader) Title() string
- type Page
Constants ¶
const ( LogLevelError = "error" LogLevelDegub = "debug" )
LogLevelDegub is the debug log level.
Variables ¶
var ( ErrInvalidPath = errors.New("Invalid pdf path") ErrInvalidPagesNumber = errors.New("Invalid pages number. It must be greater than 0") )
Functions ¶
This section is empty.
Types ¶
type Config ¶
type Config struct { // Logger is the logger used by the PDFReader. // If nil, the DefaultLogger is used. Logger Logger // LogLevel is the log level used by the logger. // It can be "error" or "debug". LogLevel string }
Config is the configuration used by the PDFReader. It is used to configure the logger and the log level.
type DefaultLogger ¶
type DefaultLogger struct {
// contains filtered or unexported fields
}
DefaultLogger is the default logger.
func NewDefaultLogger ¶
func NewDefaultLogger(logLevel string) *DefaultLogger
NewDefaultLogger creates a new DefaultLogger.
func (*DefaultLogger) Debugf ¶
func (l *DefaultLogger) Debugf(format string, args ...interface{})
Debugf logs a debug message. It logs only if the level is "debug".
func (*DefaultLogger) Errorf ¶
func (l *DefaultLogger) Errorf(format string, args ...interface{})
Errorf logs an error message.
type Logger ¶
type Logger interface { // Debugf logs a debug message. Debugf(format string, args ...interface{}) // Errorf logs an error message. Errorf(format string, args ...interface{}) }
Logger is the interface that wraps the Debugf method.
type PDFReader ¶
type PDFReader struct {
// contains filtered or unexported fields
}
PDFReader represents a PDF file.
func New ¶
New creates a new PDFReader. It returns an error if the PDFReader cannot be created. The path argument is the path to the PDF file.
func NewPDFReader ¶
NewPDFReader creates a new PDFReader. It returns an error if the PDFReader cannot be created. The path argument is the path to the PDF file.
func (*PDFReader) Author ¶
Author returns the author of the PDF file or an empty string if the author is not defined.
func (*PDFReader) CreationDate ¶
CreationDate returns the creation date of the PDF file or an empty string if the creation date is not defined.
func (*PDFReader) Html ¶
Html returns the html of all pages from the PDF file. It returns an error if the html cannot be extracted.
func (*PDFReader) Pages ¶
Pages returns a slice of pointers to Page structs. Each Page struct contains the page number and the text and html extractors.
func (*PDFReader) PagesNumber ¶
PagesNumber returns the number of pages in the PDF file associated with the PDFReader.
type Page ¶
type Page struct { Number int TextExtractor core.Extractor HtmlExtractor core.Extractor // contains filtered or unexported fields }
Page represents a page from the PDF file.