uni

package module
v0.0.0-...-18b4bb5 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Mar 18, 2023 License: MIT Imports: 6 Imported by: 0

README

A Dictionary of Kagome v2

A dictionary package of kagome v2. This software includes a binary and/or source version of data from

  • unidic-mecab-2.1.2_src

which can be obtained from

Feature Fields

Features are information given to a word, such as follows:

公園に行ったex
公園	名詞,普通名詞,一般,*,*,*,コウエン,公園,公園,コーエン,公園,コーエン,漢,*,*,*,*
に	助詞,格助詞,*,*,*,*,ニ,に,に,ニ,に,ニ,和,*,*,*,*
行っ	動詞,非自立可能,*,*,五段-カ行,連用形-促音便,イク,行く,行っ,イッ,行く,イク,和,*,*,*,*
た	助動詞,*,*,*,助動詞-タ,終止形-一般,タ,た,た,タ,た,タ,和,*,*,*,*
No. feature name description
0 POS hierarchy 品詞大分類 The POS name and each level in its hierarchical structure.
1 POS hierarchy 1 品詞中分類
2 POS hierarchy 2 品詞小分類
3 POS hierarchy 3 品詞細分類
4 cType 活用型 Inflection type indicates a category that is an inflected form, e.g. 五段-カ行.
5 cForm 活用形 Inflected form, e.g. 連用形-促音便.
6 lForm 語彙素読み A possible reading for a lemma entry. Readings are given in katakana, e.g. コウエン, イク.
7 lemma 語彙素表記 A lemma, e.g. 公園, 行く.
8 orth 書字形 A conjugation form for an orthBase. e.g. 行く -> 行っ.
9 pron 発音形 A possible pronunciations for an orthToken. Pronunciations are given in katakana, .e.g. コーエン, イッ.
10 orthBase 書字形基本形 A form of dictionary headword, e.g. 行っ -> 行く.
11 pronBase 発音形基本形 A possible pronunciations for an entry headword. Pronunciations are given in katakana, .e.g. イク.
12 goshu 語種 A possible pronunciations for an entry. Pronunciations are given in katakana, .e.g. コーエン.
13 iType 語頭変化型
14 iForm 語頭変化形
15 fType 語末変化型
16 fForm 語末変化型

Licence

MIT

Documentation

Index

Constants

View Source
const (

	// POSHierarchy represents part-of-speech hierarchy
	// e.g. Columns 動詞,非自立可能,*,* are POSs which hierarchy depth is 4.
	POSHierarchy = 4
	// CType represents  活用型 (e.g. 五段-カ行).
	CType = 4
	// CForm represents 活用形 (e.g. 連用形-促音便).
	CForm = 5
	// LForm represents 読み (e.g. コウエン).
	LForm = 6
	// Lemma represents 語彙素 (e.g. 公園, 行く).
	Lemma = 7
	// Orth represents 書字形出現形.
	Orth = 8
	// Pron represents 発音形出現形.
	Pron = 9
	// OrthBase represents 書字形基本型.
	OrthBase = 10
	// PronBase represents 発音形基本型.
	PronBase = 11
	// Goshu represents 語種.
	Goshu = 12
	// IType represents 語頭変化型.
	IType = 13
	// IForm represents 語頭変化形.
	IForm = 14
	// FType represents 語末変化型.
	FType = 15
	// FForm represents 語末変化形.
	FForm = 16

	// InflectionalType represents 活用型 (e.g. 五段-カ行), an alias for CType.
	InflectionalType FeatureIndex = 4
	// InflectionalForm represents 活用形 (e.g. 連用形-促音便), an alias for CForm.
	InflectionalForm = 5
	// BaseForm represents 基本形 (e.g. 行く), an alias for Lemma.
	BaseForm = 7
	// Pronunciation represents 発音 (e.g. コーエン), an alias for Pron.
	Pronunciation = 9
)

Variables

This section is empty.

Functions

func Dict

func Dict() *dict.Dict

Dict returns a dictionary.

func DictShrink

func DictShrink() *dict.Dict

DictShrink returns a dictionary without content part. note. If an unshrinked dictionary already exists, this function returns it.

Types

type FeatureIndex

type FeatureIndex = int

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL