08-batch-inference

command
v1.11.1 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Mar 24, 2026 License: Apache-2.0 Imports: 6 Imported by: 0

Documentation

Overview

Recipe 08: Batch Inference

Run inference over many prompts concurrently using goroutines. This pattern is useful for processing datasets, evaluations, or any batch workload.

The program loads a single model and fans out generation across a configurable number of worker goroutines, collecting results in order.

Usage:

go run ./docs/cookbook/08-batch-inference/ --model path/to/model.gguf

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL