eval

command
v0.1.0 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Jun 1, 2026 License: MIT Imports: 13 Imported by: 0

Documentation

Overview

Command eval runs mneme's prompt-evaluation harness against a live LLM and prints a per-metric table plus an aggregate score per prompt version. It is the number we trust when we change the extraction prompt.

Usage:

go run ./cmd/eval                 # uses .env / MNEME_* env, fake embedder
go run ./cmd/eval -model x -k 5   # override model and search depth

It is intentionally outside `go test ./...` (which must stay offline): a run needs network and an API key.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL