Documentation
¶
Index ¶
- func AddEdgesIfDoNotExist(currentNode string, neighborNodes []string) (neighborsAdded []string, err error)
- func CleanUrl(link string) string
- func FilterPage(e *colly.HTMLElement) (*colly.HTMLElement, error)
- func GetRandomNode() (string, error)
- func IsValidCrawlLink(link string) bool
- type Page
- type RArticleResp
- type RQuery
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
func AddEdgesIfDoNotExist ¶
func AddEdgesIfDoNotExist( currentNode string, neighborNodes []string, ) ( neighborsAdded []string, err error, )
adds edge to DB, returns new neighbors added (to crawl on)
func FilterPage ¶
func FilterPage(e *colly.HTMLElement) (*colly.HTMLElement, error)
filters down full page body to elements we want to focus on
func GetRandomNode ¶
gets random article from metawiki API returns article in the form "/wiki/XXXXX"
func IsValidCrawlLink ¶
determines if is good link to crawl on
Types ¶
type RArticleResp ¶
type RArticleResp struct {
Query RQuery `json:"query"`
}
Click to show internal directories.
Click to hide internal directories.