speechrecog

module

v0.0.0-...-26fe002 Latest Latest Go to latest Published: Jun 24, 2017 License: BSD-2-Clause

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/unixpickle/speechrecog

Links

Open Source Insights

README ¶

speechrecog

This is a set of tools for implementing speech recognition. This is the first time I have played with speech recognition, so I am not exactly sure what will be needed. Nonetheless, here is what I have so far:

An MFCC package
A web app for recording and labeling speech samples
CTC recurrent neural net training

License

This is under a BSD 2-clause license. See LICENSE.

Directories ¶

Path	Synopsis
ctc Package ctc implements Connectionist Temporal Classification for training models (typically neural networks) to predict output sequences.	Package ctc implements Connectionist Temporal Classification for training models (typically neural networks) to predict output sequences.
mfcc Package mfcc can compute Mel-frequency cepstrum coefficients from raw sample data.	Package mfcc can compute Mel-frequency cepstrum coefficients from raw sample data.
mfcc-graph Command mfcc-graph produces HTML graphs of MFCC coefficients for an audio file.	Command mfcc-graph produces HTML graphs of MFCC coefficients for an audio file.
recorder Command recorder is a web app for recording speech clips.	Command recorder is a web app for recording speech clips.
speechdata Package speechdata facilitates loading and saving databases of labeled speech samples.	Package speechdata facilitates loading and saving databases of labeled speech samples.

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL