fino

package module

v0.9.0 Latest Latest Go to latest Published: Jun 3, 2026 License: MIT Imports: 0 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/nethinwei/fino

Links

Open Source Insights

README ¶

fino

A minimal, reliable ReAct agent SDK for Go — built from small composable primitives, not a framework.

English | 简体中文

fino gives you exactly one thing, done well: the ReAct feedback loop that makes an LLM agent work.

model response → tool call → tool execution → tool result → next model response → final answer

Everything else — orchestration, persistence, RAG, MCP, permissions, deployment — stays in your code, behind small interfaces you can implement in an afternoon. No graph engine. No hidden state. No vendor lock-in. The core depends on the Go standard library only.

Why fino?

Most agent frameworks grow until they own your application. fino does the opposite: it draws a deliberately narrow boundary and exposes every capability as an open primitive instead of a closed feature.

You want…	fino gives you a primitive	You stay in control of
A model	`model.Model` interface	Which LLM, proxy, or local runtime
A tool	`tool.Tool` + `tool.NewFunc`	Filesystem, bash, MCP, RAG, DB, any API
Authorization	`policy.Policy` interface	Confirmation, RBAC, audit, sandbox, allowlists
Personas	`agent.Mode`	plan / code / review / debug instructions & toolsets
Observability	`hooks.Hooks`	Logging, tracing, metrics, cost accounting
Multi-agent	handoff tool helper	LLM-driven or deterministic Go control flow
Memory & history	explicit message input	SQLite, Redis, files, your own session store
Execution	`runner.Run` / `runner.Stream`	HTTP handlers, CLI loops, queues, cron, workflows

A capability earns a place in the core only if it is part of the ReAct loop and cannot be implemented cleanly via a Tool, Policy, Hook, Mode, Model, or code around the Runner. Otherwise it belongs in your code — not ours.

Features

ReAct loop, done right — turn limits, tool authorization, lifecycle hooks, and clean termination.
Streaming as first-class semantic events — text deltas, live reasoning, tool calls, tool results, handoffs, a complete assistant snapshot per model turn (TurnMessage), and one run-terminal event — FinalMessage on completion or Suspended when a policy suspends for approval — all via iter.Seq2.
Modes — one agent, multiple personas (distinct instructions, tools, and model options).
Handoffs — model-driven transfer between agents, modeled as an ordinary tool.
Pluggable policy — authorize, deny, or gate every tool call before it runs.
Lifecycle hooks — observe and extend model calls and tool executions without forking the loop.
Bounded parallel tools — opt-in concurrent execution within a single tool-call batch, with deterministic ordering.
Resilient transport — streaming-safe connection timeouts and retry-with-backoff in the bundled providers.
Zero core dependencies — the standard library, nothing else.

Install

go get github.com/nethinwei/fino

Requires Go 1.23+ (for iter.Seq2).

Quickstart

A model, a tool, and an agent — copy-paste runnable:

package main

import (
	"context"
	"fmt"
	"os"

	"github.com/nethinwei/fino/agent"
	"github.com/nethinwei/fino/providers/deepseek"
	"github.com/nethinwei/fino/runner"
	"github.com/nethinwei/fino/tool"
)

type addInput struct {
	A int `json:"a" jsonschema:"description=first addend"`
	B int `json:"b" jsonschema:"description=second addend"`
}

func main() {
	m, _ := deepseek.New("deepseek-v4-flash", os.Getenv("DEEPSEEK_API_KEY"))

	add, _ := tool.NewFunc("add", "Add two integers",
		func(ctx context.Context, in addInput) (string, error) {
			return fmt.Sprintf("%d", in.A+in.B), nil
		})

	mode, _ := agent.NewMode("default", "Use the add tool for arithmetic.", agent.WithTools(add))
	a, _ := agent.New("assistant", agent.WithMode(mode), agent.WithDefaultMode("default"))

	r, _ := runner.New(m)
	result, _ := r.Run(context.Background(), a, runner.Text("What is 2 + 3? Use the add tool."))
	fmt.Println(result.Text())
}

DEEPSEEK_API_KEY=sk-... go run .

Errors are elided with _ for brevity. All constructors return errors and the runnable programs in examples/ handle them properly.

Core concepts

Seven single-purpose packages:

message/  roles, messages, content blocks (text / tool_use / tool_result / thinking)
tool/     Tool interface, function-tool helper, JSON Schema inference
model/    Model interface (Generate + Stream), stream event types
agent/    Agent, Mode (instructions + tools), handoff tool helper
policy/   Policy interface (pre-execution authorization), AllowAll default
hooks/    lifecycle hooks (BeforeModel / AfterModel / BeforeTool / AfterTool / OnError)
runner/   the ReAct loop executor — Run, Stream, Input, Result

The Runner holds only configuration; each run owns its own message list, so one Runner is safe to reuse across concurrent runs.

Streaming

Stream yields semantic events you can pipe straight into a terminal UI, WebSocket, or trace.

for ev, err := range r.Stream(ctx, a, runner.Text(prompt)) {
	if err != nil {
		log.Fatal(err) // terminal error; iteration stops
	}
	switch e := ev.(type) {
	case model.TextDelta:
		fmt.Print(e.Text) // token-by-token
	case model.ContentBlockDelta:
		// live reasoning ("thinking") fragments
	case model.ToolCall:
		fmt.Printf("\n→ %s(%s)\n", e.Call.Name, e.Call.Input)
	case model.ToolResult:
		fmt.Printf("← %s\n", e.Result.Text())
	case model.Handoff:
		fmt.Printf("⇄ handoff to %s\n", e.Target)
	case model.TurnMessage:
		// complete assistant snapshot for each model turn
	case model.FinalMessage:
		// run-terminal result on completion (emitted once, by the Runner)
	case model.Suspended:
		// a policy suspended the batch for human approval; rebuild a
		// SuspendedRun and resume after collecting approvals:
		//   sr := runner.SuspendedRunFrom(e)
		//   r.ResumeApproved(ctx, a, sr, approvals)
	}
}

All terminal errors are reported through the iterator's second return value and paired with a final model.StreamError event — one consistent error path. Use errors.Is / errors.As for ErrMaxTurns, ErrToolNotFound, ToolDeniedError, or context.Canceled.

Tools

A tool is any type implementing tool.Tool. The NewFunc helper turns a typed Go function into one and infers the JSON Schema from struct tags:

search, err := tool.NewFunc(
	"search", "Search the web",
	func(ctx context.Context, in SearchInput) (string, error) {
		return searchWeb(in.Query)
	},
	tool.WithMetadata("category", "network"),
)

Return string (auto-wrapped as a text block) or a tool.Result for structured/multi-block output. Need a hand-written schema? Pass tool.WithSchema(...).

Authorization with Policy

A Policy is consulted before every tool call. Implement confirmation, RBAC, sandboxing, or risk scoring — the core ships only AllowAll.

type confirmPolicy struct{}

func (confirmPolicy) Authorize(ctx context.Context, req policy.Request) (policy.Decision, error) {
	if req.Tool.Name == "delete_file" {
		return policy.Decision{Allow: false, Reason: "destructive op needs review"}, nil
	}
	return policy.Decision{Allow: true}, nil
}

r, _ := runner.New(m, runner.WithPolicy(confirmPolicy{}))

A denied call surfaces as a *runner.ToolDeniedError; a returned error means the policy system itself failed — the two are distinct by design.

Observability with Hooks

Hooks observe and extend the loop without changing it. All fields are nil-safe.

r, _ := runner.New(m, runner.WithHooks(&hooks.Hooks{
	BeforeModel: func(ctx context.Context, c hooks.ModelCall) context.Context {
		log.Printf("→ model (%s/%s), %d msgs", c.AgentName, c.ModeName, len(c.Messages))
		return ctx
	},
	AfterTool: func(ctx context.Context, r hooks.ToolResult) {
		log.Printf("← tool %s", r.Tool.Name)
	},
	OnError: func(ctx context.Context, err error) { log.Printf("error: %v", err) },
}))

Modes & multi-agent handoff

One agent can hold several modes (personas); a run can start in any of them, and the model can switch agents through a handoff tool.

plan, _ := agent.NewMode("plan", "Think and outline. Do not edit files.")
code, _ := agent.NewMode("code", "Implement the plan.", agent.WithTools(editFile))
a, _ := agent.New("assistant", agent.WithMode(plan), agent.WithMode(code), agent.WithDefaultMode("plan"))

result, _ := r.Run(ctx, a, runner.Text("Add a /health endpoint"), runner.WithMode("code"))

// Hand control to a specialist agent — modeled as a plain tool:
handoff, _ := agent.NewHandoffTool(reviewer)

Providers

providers/ ships seven adapters, all standard-library only, so the core stays dependency-free:

Generic: providers/openai (OpenAI-compatible), providers/anthropic (Anthropic-compatible)
Presets: providers/deepseek, providers/kimi, providers/glm, providers/qwen, providers/minimax

Presets wrap the generic adapters with the right base URL and vendor-specific parameters. The universal extension points are model.WithExtra and openai.WithExtraBody. Adapters include streaming-safe connection timeouts and retry-with-backoff:

m, _ := openai.New("gpt-4o",
	openai.WithAPIKey(os.Getenv("OPENAI_API_KEY")),
	openai.WithTimeout(30*time.Second), // bounds dial + TLS, never the stream
	openai.WithMaxRetries(2),           // exponential backoff on 429 / 5xx
)

Implementing your own provider is just satisfying model.Model (Generate + Stream).

Examples

Example	What it shows
`examples/hello`	Minimal end-to-end run with a hook trace log
`examples/multi_mode`	One agent switching between `plan` and `code` modes
`examples/streaming`	Consuming `Stream` events with visible token-by-token output
`examples/history_trim`	Wrapping `model.Model` to trim history — the composition pattern, one wrapper for all providers
`examples/cookbook`	Offline, deterministic recipes for the hard problems — HITL approval + resume, bounded parallel tools, RAG-as-a-tool — plus guidance for MCP-as-a-tool
finocode ↗	The flagship reference app, in its own repo: a Claude Code-style coding agent built only on fino — REPL, y/N tool authorization with write diffs, mode switching, handoff sub-agents, full hooks, and a real `go` toolchain in a temp workspace. A constructive proof of the sufficiency thesis.

Provider-backed examples run against DeepSeek by default (examples/cookbook uses an in-file scripted model and runs offline, no API key needed):

DEEPSEEK_API_KEY=sk-... go run ./examples/streaming
# optional: DEEPSEEK_MODEL=deepseek-v4-pro (stronger tier; both support thinking modes)

What fino is not

By design, fino does not include: graph/DAG orchestration · RAG pipelines (loaders, chunking, embeddings, retrieval) · built-in filesystem/bash/web/search/code tools · an MCP implementation · HTTP servers, CLIs, or workers · fixed permission semantics (e.g. AllowWrite) · hidden session stores or state machines.

Each of these is better expressed in your code, an example, or an add-on package — composed with fino, never bolted into it.

Sufficiency: hard problems without a framework

fino's thesis is that reliable execution infrastructure for complex tool-using agents does not require an application-owning framework; it requires a semantically sufficient runtime kernel, explicit effect boundaries, and composable policies. That claim is made checkable, not just asserted:

Precise semantics — the ReAct loop is specified as a state-transition system in docs/spec/loop-semantics.md, with invariants for ordered results, single tool messages, terminal errors, stream contracts, and safe-boundary continuation.
Verified, not just tested — current property tests cover the protocol trace of serial and parallel runs. Parallel claims are scoped to protocol-trace equivalence under tool-independence assumptions, not arbitrary external-state equivalence.
Constructive evidence — the x/ packages demonstrate replay, recovery, tracing, budgets, and eval as compositions over existing seams, while documenting where future effect-aware runtime contracts are still required.

Add-on	Problem	Seam it rides on
`x/replay`	Reproducibility & audit	records an execution tape over public seams — model responses, policy decisions, tool executions, suspends, approvals, resumes, and termination; replay drives the run without calling real providers, tools, or policies
`x/recover`	Crash recovery & durable continuation	safe-boundary continuation (`history + mode`) plus an opt-in pending-tool seam for blind/crash resume; HITL approval resume is `runner.ResumeApproved`, not `x/recover`
`x/trace`	Tracing & observability	deterministic `hooks.Hooks` firing
`x/budget`	Cost / token budgets	a `model.Model` decorator
`x/eval`	Reproducible regression testing	runs deterministic cases over the recorded tape; `RunWithOptions` can wire `ReplayPolicy` for policy-sensitive fixtures

The replay tape is reproducibility and audit evidence, not proof of business correctness; it provides no exactly-once side effects, durable workflow, or tamper resistance.

The core never changes to add a capability — only, if ever, to expose a missing seam. See the seam discipline in docs/design.md.

Status

The API follows a consistent shape — NewX(required, opts ...Option) (*X, error) — and is approaching stability, but may still change before a tagged v1. Pin a commit if you need reproducibility.

Effect-aware concurrency (WithMaxConcurrency gated by Effects.ParallelSafe) and the idempotency boundary (tool.ExecutionContext + WithRunID) have landed. See docs/roadmap.md for the remaining path toward the reference-proof case study.

Contributing

Issues and PRs are welcome. Please read CONTRIBUTING.md for the coding standards (Google Go Style, gofmt, TDD, no external core dependencies) and docs/design.md for the design boundaries before changing core packages.

License

Documentation ¶

Overview ¶

Package fino provides minimal primitives for building LLM agents.

The core SDK implements the ReAct feedback loop and leaves orchestration, persistence, permissions semantics, RAG, tools, provider clients, and deployment to users.

Source Files ¶

View all Source files

doc.go

Directories ¶

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL

Path	Synopsis
agent Package agent defines Agent, Mode, and HandoffTool for the fino Agent SDK.	Package agent defines Agent, Mode, and HandoffTool for the fino Agent SDK.
examples
coding_agent command
cookbook/hitl_resume command Command hitl_resume shows human-in-the-loop tool approval built only from fino primitives: a Policy suspends a sensitive tool (DecisionSuspend), the run halts with a suspended Result, a human approves or rejects each pending call, and runner.ResumeApproved continues the ReAct loop.	Command hitl_resume shows human-in-the-loop tool approval built only from fino primitives: a Policy suspends a sensitive tool (DecisionSuspend), the run halts with a suspended Result, a human approves or rejects each pending call, and runner.ResumeApproved continues the ReAct loop.
cookbook/parallel_tools command Command parallel_tools shows bounded concurrent tool execution within a single tool-call batch via runner.WithMaxConcurrency.	Command parallel_tools shows bounded concurrent tool execution within a single tool-call batch via runner.WithMaxConcurrency.
cookbook/rag_as_tool command Command rag_as_tool shows that retrieval-augmented generation needs no core support: a retriever is just a tool.Tool.	Command rag_as_tool shows that retrieval-augmented generation needs no core support: a retriever is just a tool.Tool.
hello command Command hello is a minimal end-to-end fino example: an agent with one tool, driven by an OpenAI-compatible model.	Command hello is a minimal end-to-end fino example: an agent with one tool, driven by an OpenAI-compatible model.
history_trim command Command history_trim demonstrates the canonical fino extension pattern: wrapping model.Model.	Command history_trim demonstrates the canonical fino extension pattern: wrapping model.Model.
multi_mode command Command multi_mode shows one agent with two modes — "plan" and "code" — and how to select a mode per run with runner.WithMode.	Command multi_mode shows one agent with two modes — "plan" and "code" — and how to select a mode per run with runner.WithMode.
streaming command Command streaming shows how to consume the Runner's streaming events: text deltas, tool calls, tool results, and the final message.	Command streaming shows how to consume the Runner's streaming events: text deltas, tool calls, tool results, and the final message.
hooks Package hooks defines lifecycle hooks for observing and extending model calls and tool executions in the fino Agent SDK.	Package hooks defines lifecycle hooks for observing and extending model calls and tool executions in the fino Agent SDK.
message Package message defines the data types exchanged between models, tools, and the Runner in the fino Agent SDK.	Package message defines the data types exchanged between models, tools, and the Runner in the fino Agent SDK.
model Package model defines the Model interface and stream event types for the fino Agent SDK.	Package model defines the Model interface and stream event types for the fino Agent SDK.
policy Package policy defines the authorization interface for tool execution in the fino Agent SDK.	Package policy defines the authorization interface for tool execution in the fino Agent SDK.
providers
anthropic Package anthropic implements a model.Model adapter for the Anthropic Messages API and any Anthropic-compatible endpoint (such as DeepSeek's https://api.deepseek.com/anthropic).	Package anthropic implements a model.Model adapter for the Anthropic Messages API and any Anthropic-compatible endpoint (such as DeepSeek's https://api.deepseek.com/anthropic).
deepseek Package deepseek provides preset constructors for DeepSeek, which exposes both an OpenAI-compatible and an Anthropic-compatible endpoint.	Package deepseek provides preset constructors for DeepSeek, which exposes both an OpenAI-compatible and an Anthropic-compatible endpoint.
glm Package glm provides a preset constructor and typed options for Zhipu's GLM models, which expose an OpenAI-compatible endpoint.	Package glm provides a preset constructor and typed options for Zhipu's GLM models, which expose an OpenAI-compatible endpoint.
internal/httpx Package httpx holds the HTTP transport tuning shared by the provider adapters: a streaming-safe client with connection-phase timeouts and a retry loop for transient failures.	Package httpx holds the HTTP transport tuning shared by the provider adapters: a streaming-safe client with connection-phase timeouts and a retry loop for transient failures.
internal/sse Package sse holds the Server-Sent Events streaming loop shared by the provider adapters.	Package sse holds the Server-Sent Events streaming loop shared by the provider adapters.
kimi Package kimi provides a preset constructor for Moonshot's Kimi models, which expose an OpenAI-compatible endpoint.	Package kimi provides a preset constructor for Moonshot's Kimi models, which expose an OpenAI-compatible endpoint.
minimax Package minimax provides a preset constructor and typed options for MiniMax models, which expose an OpenAI-compatible endpoint.	Package minimax provides a preset constructor and typed options for MiniMax models, which expose an OpenAI-compatible endpoint.
openai Package openai implements a model.Model adapter for the OpenAI Chat Completions API and any OpenAI-compatible endpoint (such as DeepSeek's https://api.deepseek.com).	Package openai implements a model.Model adapter for the OpenAI Chat Completions API and any OpenAI-compatible endpoint (such as DeepSeek's https://api.deepseek.com).
qwen Package qwen provides a preset constructor and typed options for Alibaba's Qwen models on DashScope, which expose an OpenAI-compatible endpoint.	Package qwen provides a preset constructor and typed options for Alibaba's Qwen models on DashScope, which expose an OpenAI-compatible endpoint.
runner Package runner executes the ReAct feedback loop for the fino Agent SDK.	Package runner executes the ReAct feedback loop for the fino Agent SDK.
tool Package tool defines the Tool interface, function-tool helpers, and JSON Schema inference for the fino Agent SDK.	Package tool defines the Tool interface, function-tool helpers, and JSON Schema inference for the fino Agent SDK.
x
agui Package agui adapts fino runs to the AG-UI protocol.	Package agui adapts fino runs to the AG-UI protocol.
budget Package budget provides a cost-bounded model decorator for fino.	Package budget provides a cost-bounded model decorator for fino.
eval Package eval provides reproducible regression testing for fino agents.	Package eval provides reproducible regression testing for fino agents.
recover Package recover provides durable continuation for fino agent runs.	Package recover provides durable continuation for fino agent runs.
replay Package replay provides record-and-replay and an execution tape for fino agent runs.	Package replay provides record-and-replay and an execution tape for fino agent runs.
trace Package trace adapts fino lifecycle hooks to a minimal tracer.	Package trace adapts fino lifecycle hooks to a minimal tracer.