bench_spec

command
v1.26.0 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Mar 27, 2026 License: Apache-2.0 Imports: 13 Imported by: 0

Documentation

Overview

Command bench_spec benchmarks speculative decoding speedup by comparing standalone target model decode against speculative decode (target + draft).

Usage:

bench_spec --model-target /path/to/27B.gguf --model-draft /path/to/1B.gguf [--tokens 200] [--prompts 10] [--backend cuda] [--warmup 2] [--draft-len 4]

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL