parser

package
v1.0.1 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Oct 20, 2025 License: MIT Imports: 10 Imported by: 0

Documentation

Index

Constants

This section is empty.

Variables

This section is empty.

Functions

This section is empty.

Types

type DynamicParser

type DynamicParser struct {
	// contains filtered or unexported fields
}

DynamicParser handles dynamic JavaScript extraction using headless browser

func NewDynamicParser

func NewDynamicParser(timeout time.Duration, waitTime int, useHeadless bool) *DynamicParser

NewDynamicParser creates a new DynamicParser instance

func (*DynamicParser) ExtractJSFiles

func (dynamicParser *DynamicParser) ExtractJSFiles(pageURL string) ([]string, error)

ExtractJSFiles extracts JavaScript files using chromedp

func (*DynamicParser) GetPageInfo

func (dynamicParser *DynamicParser) GetPageInfo(pageURL string) (*PageInfo, error)

GetPageInfo extracts additional page information

type PageInfo

type PageInfo struct {
	URL   string
	Title string
	HTML  string
}

PageInfo contains information about a web page

type Parser

type Parser struct {
	// contains filtered or unexported fields
}

Parser extracts JavaScript files from HTML

func NewParser

func NewParser() *Parser

NewParser creates a new Parser

func (*Parser) ExtractInlineJS

func (p *Parser) ExtractInlineJS(html string) []string

ExtractInlineJS extracts inline JavaScript code blocks

func (*Parser) ExtractJSFiles

func (p *Parser) ExtractJSFiles(html, baseURL string) ([]string, error)

ExtractJSFiles extracts all unique JS file URLs from HTML

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL