Directories
¶
| Path | Synopsis |
|---|---|
|
benchrunner is a black-box benchmark harness that compares coding agents by running them against a set of tasks and collecting structured traces.
|
benchrunner is a black-box benchmark harness that compares coding agents by running them against a set of tasks and collecting structured traces. |
Click to show internal directories.
Click to hide internal directories.