zerfoo-edge

command
v1.31.0 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Mar 28, 2026 License: Apache-2.0 Imports: 17 Imported by: 0

README

zerfoo-edge

Minimal edge/embedded inference binary for Zerfoo. Supports CPU-only GGUF model inference with no training, distributed, serving, GPU, or AutoML dependencies.

Build

go build -tags edge ./cmd/zerfoo-edge/

Cross-compile for ARM64 (e.g., Raspberry Pi 5):

GOOS=linux GOARCH=arm64 go build -tags edge ./cmd/zerfoo-edge/

Usage

Interactive mode:

./zerfoo-edge google/gemma-3-1b

Single-shot mode:

./zerfoo-edge google/gemma-3-1b --prompt "What is 2+2?"

With generation parameters:

./zerfoo-edge google/gemma-3-1b \
  --temperature 0.7 \
  --max-tokens 512 \
  --system "You are a helpful assistant"

Options

Flag Description
--prompt <text> Single-shot prompt (exits after generating)
--system <text> System prompt
--temperature <float> Sampling temperature (default: 1.0)
--top-k <int> Top-K sampling
--top-p <float> Top-P nucleus sampling
--repetition-penalty <float> Penalize repeated tokens
--max-tokens <int> Maximum tokens to generate
--cache-dir <dir> Override model cache directory
--version Print version and exit

What's excluded

The edge binary intentionally excludes:

  • Training (training/)
  • Distributed training (distributed/)
  • HTTP/API server (serve/)
  • GPU backends (CUDA, ROCm, OpenCL)
  • AutoML and NAS
  • Tabular model support

Documentation

Overview

Package main provides a minimal edge/embedded inference binary for Zerfoo.

Build: go build -tags edge ./cmd/zerfoo-edge/

The edge binary supports CPU-only inference with GGUF models. It excludes training, distributed, serve/API, GPU backends, AutoML, and NAS to produce a small, self-contained binary suitable for edge and embedded deployments.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL