goSRT

command module

v1.2.0 Latest Latest Go to latest Published: Nov 22, 2025 License: MIT Imports: 18 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/MeteorsLiu/goSRT

Links

Open Source Insights

README ¶

goTranscriber

GoTranscriber启发于pyTranscriber

由于pyTranscriber极其低效的音频转换效率和高额内存占用。

我使用Golang重塑了pyTranscriber

goTranscriber在处理一个两小时的视频的时候，仅需要100MB内存的占用，相比于pyTranscriber超过1GB内存占用，有了较大的提升。

同时，更换pyTranscriber中的使用计算音频RMS(Root Mean Sqrt)获取声强方式，goTranscriber默认采用16000Hz的WebRTC VAD算法进行检测声音区域，更加精准。

当然，经过一个星期的测试，goTranscriber拥有极佳的稳定性，并不会像pyTranscriber一样随便崩溃。

基本设计

视频 -> FFmpeg预处理音频 -> WebRTC VAD (保守模式)识别人声区域 -> 切片 -> 多线程上传Google Speech-To-Text API -> 翻译 -> 生成SRT文件

问题

WebRTC VAD 识别正确性很高，但是切片会原本是同一句话硬切成两份，这是因为Google Speech-To-Text API最多容许10秒（实测，官方说12秒）音频片段。目前补救手段是，通过分析停顿来优化切分，避免一句话被切成碎片。尽管听起来可靠，但遇到有BGM人声区域，还是会乱切。
BGM会一定程度上影响识别

后续开发

上述问题，其实很难解决。因为，即使把vad切片准确性优化到极致，依然无法避免10秒切片带来的上下文丢失

后续，这个项目就此终止，其替代品我推荐：faster-whisper

faster-whisper对whisper进行了优化，而且引入基于机器学习的Silero Vad进行人声区域切片避免上下文过长和引入一些无关片段。

我试着用了下，除了速度比goTranscriber慢很多，其效果是相当好的。

我想，goTranscriber其实已经可以就此停止了。如果追求速度，可以用goTranscriber，其速度是相当快的，一个2小时的视频大概需要30-45分钟就可以完成，而且对于使用者算力并无要求。

但如果你有一定算力，想要更好的质量，那我推荐faster-whisper。

安装教程

Linux(Debian为例)

apt install gcc g++ ffmpeg -y
wget https://go.dev/dl/go1.19.3.linux-amd64.tar.gz && rm -rf /usr/local/go && tar -C /usr/local -xzf go1.19.3.linux-amd64.tar.gz
git clone https://github.com/MeteorsLiu/goTranscriber.git
cd goTranscriber
/usr/local/go/bin/go build 
./goSRT -file xxx -lang xx

English

I am sorry that I don't provide binary file.

It's required to build it by yourself.

Currently, goTranscriber DON'T support Windows, for the Gcc compiler reason.

YOU NEED TO Build this project in Linux.

Linux(Debian/Ubuntu)

apt install gcc g++ ffmpeg -y
wget https://go.dev/dl/go1.19.3.linux-amd64.tar.gz && rm -rf /usr/local/go && tar -C /usr/local -xzf go1.19.3.linux-amd64.tar.gz
git clone https://github.com/MeteorsLiu/goTranscriber.git
cd goTranscriber
/usr/local/go/bin/go build 
./goSRT -file xxx -lang xx

Documentation ¶

There is no documentation for this package.

Source Files ¶

View all Source files

Directories ¶

Path	Synopsis
srt
transcribe
voice

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL