zerfoo-edge

command

v1.31.0 Latest Latest Go to latest Published: Mar 28, 2026 License: Apache-2.0 Imports: 17 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/zerfoo/zerfoo

Links

Open Source Insights

README ¶

zerfoo-edge

Minimal edge/embedded inference binary for Zerfoo. Supports CPU-only GGUF model inference with no training, distributed, serving, GPU, or AutoML dependencies.

Build

go build -tags edge ./cmd/zerfoo-edge/

Cross-compile for ARM64 (e.g., Raspberry Pi 5):

GOOS=linux GOARCH=arm64 go build -tags edge ./cmd/zerfoo-edge/

Usage

Interactive mode:

./zerfoo-edge google/gemma-3-1b

Single-shot mode:

./zerfoo-edge google/gemma-3-1b --prompt "What is 2+2?"

With generation parameters:

./zerfoo-edge google/gemma-3-1b \
  --temperature 0.7 \
  --max-tokens 512 \
  --system "You are a helpful assistant"

Options

Flag	Description
`--prompt <text>`	Single-shot prompt (exits after generating)
`--system <text>`	System prompt
`--temperature <float>`	Sampling temperature (default: 1.0)
`--top-k <int>`	Top-K sampling
`--top-p <float>`	Top-P nucleus sampling
`--repetition-penalty <float>`	Penalize repeated tokens
`--max-tokens <int>`	Maximum tokens to generate
`--cache-dir <dir>`	Override model cache directory
`--version`	Print version and exit

What's excluded

The edge binary intentionally excludes:

Training (training/)
Distributed training (distributed/)
HTTP/API server (serve/)
GPU backends (CUDA, ROCm, OpenCL)
AutoML and NAS
Tabular model support

Documentation ¶

Overview ¶

Package main provides a minimal edge/embedded inference binary for Zerfoo.

Build: go build -tags edge ./cmd/zerfoo-edge/

The edge binary supports CPU-only inference with GGUF models. It excludes training, distributed, serve/API, GPU backends, AutoML, and NAS to produce a small, self-contained binary suitable for edge and embedded deployments.

Source Files ¶

View all Source files

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL