Documentation
¶
Overview ¶
Package text provides text analysis utilities for content extraction.
Index ¶
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
func CountWords ¶
CountWords counts words in plain text, handling CJK characters correctly. Each CJK character (Han, Hangul, Hiragana, Katakana) counts as one word. Latin and other scripts are counted by whitespace boundaries. Go's native rune iteration handles supplementary planes (CJK Extension B+) that the TypeScript charCodeAt approach cannot.
Types ¶
This section is empty.
Click to show internal directories.
Click to hide internal directories.