benchrunner

command

v0.4.1 Latest Latest Go to latest Published: Jun 8, 2026 License: MIT Imports: 17 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/amemiya02/deepseekcode

Links

Open Source Insights

Documentation ¶

Overview ¶

benchrunner is a black-box benchmark harness that compares coding agents by running them against a set of tasks and collecting structured traces.

Usage:

go run ./bench/cmd/benchrunner/ [flags]
go build ./bench/cmd/benchrunner/ && ./benchrunner [flags]

Flags:

--agent string    Filter to a single agent ID (e.g., "deepseekcode-current")
--task string     Filter to a single task ID (e.g., "ctx-long-readonly")
--dry-run         Show what would run without executing
--bench-dir string Root bench directory (default "bench")

Source Files ¶

View all Source files

main.go

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL