Documentation
¶
Index ¶
- type PDFVersion
- type Reader
- func (r *Reader) CacheSize() int
- func (r *Reader) ClearCache()
- func (r *Reader) Close() error
- func (r *Reader) ExtractText(page *pages.Page) (string, error)
- func (r *Reader) ExtractTextFragments(page *pages.Page) ([]text.TextFragment, error)
- func (r *Reader) FileSize() int64
- func (r *Reader) GetCatalog() (core.Dict, error)
- func (r *Reader) GetInfo() (core.Dict, error)
- func (r *Reader) GetObject(objNum int) (core.Object, error)
- func (r *Reader) GetPage(index int) (*pages.Page, error)
- func (r *Reader) NumObjects() int
- func (r *Reader) ObjectStreamCacheSize() int
- func (r *Reader) PageCount() (int, error)
- func (r *Reader) Resolve(obj core.Object) (core.Object, error)
- func (r *Reader) ResolveDeep(obj core.Object) (core.Object, error)
- func (r *Reader) ResolveReference(ref core.IndirectRef) (core.Object, error)
- func (r *Reader) Trailer() core.Dict
- func (r *Reader) Version() PDFVersion
- func (r *Reader) XRefTable() *core.XRefTable
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
This section is empty.
Types ¶
type PDFVersion ¶
PDFVersion represents a PDF version
func (PDFVersion) String ¶
func (v PDFVersion) String() string
String returns the version as a string (e.g., "1.7")
type Reader ¶
type Reader struct {
// contains filtered or unexported fields
}
Reader represents a PDF file reader
func (*Reader) ClearCache ¶
func (r *Reader) ClearCache()
ClearCache clears the object cache and object stream cache Useful for freeing memory when processing large PDFs
func (*Reader) ExtractText ¶
ExtractText extracts text from a page and returns it as a string This is a convenience method for simple text extraction
func (*Reader) ExtractTextFragments ¶
ExtractTextFragments extracts text fragments from a page This is a convenience method that handles content stream decoding and font registration
func (*Reader) GetCatalog ¶
GetCatalog returns the document catalog (root object)
func (*Reader) GetObject ¶
GetObject loads an object by its number Uses caching to avoid re-reading objects Supports both uncompressed objects and objects in object streams (PDF 1.5+)
func (*Reader) NumObjects ¶
NumObjects returns the total number of objects in the PDF
func (*Reader) ObjectStreamCacheSize ¶
ObjectStreamCacheSize returns the number of cached object streams
func (*Reader) Resolve ¶
Resolve resolves an object if it's an indirect reference, otherwise returns it as-is Implements pages.ObjectResolver interface
func (*Reader) ResolveDeep ¶
ResolveDeep recursively resolves all indirect references in an object Implements pages.ObjectResolver interface
func (*Reader) ResolveReference ¶
ResolveReference resolves an indirect reference