Directories
¶
| Path | Synopsis |
|---|---|
|
cmd
|
|
|
bench-cli
command
|
|
|
internal
|
|
|
auth
Package auth provides authentication middleware and context helpers.
|
Package auth provides authentication middleware and context helpers. |
|
benchdb
Package benchdb manages PostgreSQL connections and bench schema migrations.
|
Package benchdb manages PostgreSQL connections and bench schema migrations. |
|
pkg
|
|
|
adapter
Package adapter defines the agent adapter contract and built-in adapters.
|
Package adapter defines the agent adapter contract and built-in adapters. |
|
agent
Package agent provides pluggable LLM providers and a multi-turn tool-use agent loop.
|
Package agent provides pluggable LLM providers and a multi-turn tool-use agent loop. |
|
artifact
Package artifact writes local run artifact bundles.
|
Package artifact writes local run artifact bundles. |
|
autopsy
Package autopsy classifies failed benchmark runs from existing artifacts.
|
Package autopsy classifies failed benchmark runs from existing artifacts. |
|
bench
Package timeline classifies agent tool calls into decision phases.
|
Package timeline classifies agent tool calls into decision phases. |
|
config
Package config defines run configuration for bench-cli.
|
Package config defines run configuration for bench-cli. |
|
environment
Package environment manages disposable cluster lifecycles.
|
Package environment manages disposable cluster lifecycles. |
|
harness
Package harness orchestrates the benchmark run loop.
|
Package harness orchestrates the benchmark run loop. |
|
jobqueue
Package jobqueue provides River-based job scheduling for parallel bench runs.
|
Package jobqueue provides River-based job scheduling for parallel bench runs. |
|
orchestrator
Package orchestrator manages the full bench lifecycle: provision cluster → run scenarios in parallel → teardown.
|
Package orchestrator manages the full bench lifecycle: provision cluster → run scenarios in parallel → teardown. |
|
report
Package report provides offline evidence writing for benchmark runs.
|
Package report provides offline evidence writing for benchmark runs. |
|
scenario
Package scenario defines the scenario model and loader.
|
Package scenario defines the scenario model and loader. |
|
signalaudit
Package signalaudit loads and analyzes signal-audit expectations for run artifacts.
|
Package signalaudit loads and analyzes signal-audit expectations for run artifacts. |
|
store
Package store provides structured result storage with SQLite + JSONL backup.
|
Package store provides structured result storage with SQLite + JSONL backup. |
|
tui
Package tui provides an interactive terminal UI for browsing and running scenarios.
|
Package tui provides an interactive terminal UI for browsing and running scenarios. |
|
verifier
Package verifier evaluates scenario outcome quality.
|
Package verifier evaluates scenario outcome quality. |
|
workspace
Package workspace provides isolated directories for parallel bench jobs.
|
Package workspace provides isolated directories for parallel bench jobs. |
|
generate-catalog reads all scenario.yaml files and writes ui/src/data/catalog.ts.
|
generate-catalog reads all scenario.yaml files and writes ui/src/data/catalog.ts. |
Click to show internal directories.
Click to hide internal directories.