bench_spec

command

v1.26.2 Latest Latest Go to latest Published: Mar 27, 2026 License: Apache-2.0 Imports: 13 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/zerfoo/zerfoo

Links

Open Source Insights

Documentation ¶

Overview ¶

Command bench_spec benchmarks speculative decoding speedup by comparing standalone target model decode against speculative decode (target + draft).

Usage:

bench_spec --model-target /path/to/27B.gguf --model-draft /path/to/1B.gguf [--tokens 200] [--prompts 10] [--backend cuda] [--warmup 2] [--draft-len 4]

Source Files ¶

View all Source files

main.go

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL