bench_prefix

command
v1.38.1 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Mar 30, 2026 License: Apache-2.0 Imports: 5 Imported by: 0

Documentation

Overview

Command bench_prefix simulates a multi-turn chat workload to measure prefix cache hit rate and TTFT reduction. Each simulated user shares a common system prompt and appends unique per-turn history tokens.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL