montygo

package module

v0.2.0 Latest Latest Go to latest Published: Apr 11, 2026 License: MIT Imports: 8 Imported by: 4

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/fugue-labs/monty-go

Links

Open Source Insights

README ¶

monty-go

Run LLM-generated Python safely from Go — no containers, no CGO, no subprocess.

A pure-Go wrapper around Pydantic's Monty Python interpreter, compiled to WebAssembly and loaded via wazero. Your Go agent writes Python code, monty-go executes it in a sandboxed WASM instance with sub-millisecond startup, and pauses whenever the code calls an external function so your Go code can handle it.

go get github.com/fugue-labs/monty-go

Why?

LLMs work faster, cheaper, and more reliably when they write code instead of making sequential tool calls. Instead of:

Agent → tool_call("search", {query: "weather london"}) → result
Agent → tool_call("search", {query: "weather tokyo"})  → result
Agent → tool_call("compare", {a: result1, b: result2}) → result

The LLM writes:

london = search(query="weather london")
tokyo = search(query="weather tokyo")
compare(a=london, b=tokyo)

One model call instead of three. The Python code calls your Go functions, Monty pauses at each call, your Go code executes it, and Monty resumes. No containers. No sandbox services. No exec(). Just a 2.9MB WASM binary embedded in your Go binary.

For motivation, see:

Programmatic Tool Calling from Anthropic
Code Execution with MCP from Anthropic
Code Mode from Cloudflare
Smol Agents from Hugging Face

Quick Start

package main

import (
    "context"
    "fmt"
    "log"

    montygo "github.com/fugue-labs/monty-go"
)

func main() {
    runner, err := montygo.New()
    if err != nil {
        log.Fatal(err)
    }
    defer runner.Close()

    result, err := runner.Execute(context.Background(),
        "x * 2 + y",
        map[string]any{"x": 10, "y": 5},
    )
    if err != nil {
        log.Fatal(err)
    }
    fmt.Println(result) // 25
}

External Functions (Pause/Resume)

The real power is external function calls. Monty pauses execution whenever Python code calls a function you've declared, your Go callback handles it, and Monty resumes with the return value:

result, err := runner.Execute(ctx,
    `
london = get_weather("London")
tokyo = get_weather("Tokyo")
f"{london['city']}: {london['temp']}°C, {tokyo['city']}: {tokyo['temp']}°C"
    `,
    nil,
    montygo.WithExternalFunc(func(ctx context.Context, call *montygo.FunctionCall) (any, error) {
        city, _ := call.Args["city"].(string)
        // Your real implementation here — HTTP call, database query, anything.
        return map[string]any{"city": city, "temp": 22}, nil
    }, montygo.Func("get_weather", "city")),
)
// result: "London: 22°C, Tokyo: 22°C"

Multiple functions work the same way — register them all and dispatch by name:

result, err := runner.Execute(ctx, code, nil,
    montygo.WithExternalFunc(func(ctx context.Context, call *montygo.FunctionCall) (any, error) {
        switch call.Name {
        case "search":
            return doSearch(call.Args)
        case "calculate":
            return doCalculate(call.Args)
        case "store":
            return doStore(call.Args)
        default:
            return nil, fmt.Errorf("unknown function: %s", call.Name)
        }
    },
        montygo.Func("search", "query"),
        montygo.Func("calculate", "expression"),
        montygo.Func("store", "key", "value"),
    ),
)

Resource Limits

Prevent runaway code with memory, time, allocation, and recursion limits:

result, err := runner.Execute(ctx, code, inputs,
    montygo.WithLimits(montygo.Limits{
        MaxDuration:       5 * time.Second,
        MaxMemoryBytes:    10 * 1024 * 1024, // 10 MB
        MaxAllocations:    100000,
        MaxRecursionDepth: 100,
    }),
)

Infinite loops, memory bombs, and deep recursion all terminate cleanly with a *MontyError. Go's context.Context deadlines are also respected — cancel the context and the WASM instance stops.

Print Capture

Capture Python print() output:

var output strings.Builder
_, err := runner.Execute(ctx, `print("step 1 done")`, nil,
    montygo.WithPrintFunc(func(s string) { output.WriteString(s) }),
)
fmt.Print(output.String()) // "step 1 done\n"

OS Calls

Python filesystem and environment access routes through your Go callback:

result, err := runner.Execute(ctx,
    `
from pathlib import Path
data = Path("/config/settings.json").read_text()
data
    `,
    nil,
    montygo.WithOsCallFunc(func(ctx context.Context, call *montygo.OsCall) (any, error) {
        switch call.Function {
        case "Path.read_text":
            path, _ := call.Args[0].(string)
            return readFromYourStorage(path)
        case "Path.exists":
            path, _ := call.Args[0].(string)
            return existsInYourStorage(path), nil
        default:
            return nil, fmt.Errorf("blocked: %s", call.Function)
        }
    }),
)

No filesystem access happens unless your callback allows it.

Gollem Integration

monty-go is designed to power code-mode in Gollem, the production agent framework for Go. Instead of sequential tool calls, the LLM writes Python that calls your tools as functions — Monty executes it safely, and Gollem orchestrates the whole thing.

Here's what this looks like with Gollem:

import (
    "github.com/fugue-labs/gollem"
    "github.com/fugue-labs/gollem/provider/anthropic"
    montygo "github.com/fugue-labs/monty-go"
)

// Your existing Gollem tools — search, calculate, store, whatever.
searchTool := gollem.FuncTool[SearchParams]("search", "Search the knowledge base", doSearch)
calcTool := gollem.FuncTool[CalcParams]("calculate", "Run calculations", doCalc)

// Create a code-mode tool that wraps your toolset with Monty.
// The LLM writes Python code, Monty executes it, external function calls
// route to your Go tools.
codeMode := NewCodeModeTool(runner, searchTool, calcTool)

agent := gollem.NewAgent[Analysis](anthropic.New(),
    gollem.WithTools[Analysis](codeMode),
    gollem.WithSystemPrompt[Analysis](`You have a code execution tool.
Write Python code to call the available functions. Available functions:
- search(query: str) -> dict: Search the knowledge base
- calculate(expression: str) -> float: Evaluate math expressions
Write code that calls these functions and returns the result.`),
)

result, _ := agent.Run(ctx, "Compare Q3 and Q4 revenue and calculate the growth rate")

With one model call, the LLM writes:

q3 = search(query="Q3 revenue")
q4 = search(query="Q4 revenue")
growth = calculate(expression=f"({q4['revenue']} - {q3['revenue']}) / {q3['revenue']} * 100")
{"q3": q3, "q4": q4, "growth_rate": growth}

Monty pauses three times (two searches, one calculation), your Go functions handle each one, and the final result flows back through Gollem's typed output pipeline. Three tool calls in one LLM round-trip.

Why Gollem + monty-go:

	Traditional tool calling	Code-mode with monty-go
LLM calls	One per tool use	One for all tools
Latency	N × model round-trip	1 × model round-trip + μs execution
Cost	N × input/output tokens	1 × input/output tokens
Logic	LLM reasons step by step	LLM writes the logic once
Control flow	None (sequential only)	Loops, conditionals, variables
Error handling	LLM must react to each failure	try/except in Python
Security	✅ (tools are Go functions)	✅ (WASM sandbox + your callbacks)

Gollem gives you compile-time type safety, structured output, guardrails, cost tracking, middleware, and multi-provider support. monty-go gives you secure embedded Python execution. Together, your agents do more work per model call.

github.com/fugue-labs/gollem — The production agent framework for Go.

How It Works

┌─────────────────────────────────────────────────────────┐
│  Your Go Application                                    │
│                                                         │
│  runner, _ := montygo.New()                             │
│  result, _ := runner.Execute(ctx, code, inputs, opts)   │
│       │                                                 │
│       ▼                                                 │
│  ┌──────────────────────────────────┐                   │
│  │  wazero (pure Go WASM runtime)  │                    │
│  │                                 │                    │
│  │  ┌───────────────────────────┐  │                    │
│  │  │  monty.wasm (2.9 MB)      │  │  ◄── go:embed      │
│  │  │  Monty Python Interpreter │  │                    │
│  │  │  compiled to wasm32-wasi  │  │                    │
│  │  └──────────┬────────────────┘  │                    │
│  │             │                   │                    │
│  │     pause on external call      │                    │
│  │             │                   │                    │
│  └─────────────┼───────────────────┘                    │
│                │                                        │
│                ▼                                        │
│  ExternalFunc callback ──► your Go code ──► resume      │
│  OsCallFunc callback   ──► your Go code ──► resume      │
│  PrintFunc callback    ──► your Go code                 │
└─────────────────────────────────────────────────────────┘

No CGO. wazero is a pure-Go WebAssembly runtime.
No subprocess. The WASM binary is embedded via go:embed and compiled once at startup.
Fresh instance per call. Each Execute() gets an isolated WASM instance. No state leaks between calls.
JSON at the boundary. All data crossing the Go↔WASM boundary is JSON. Go types map naturally: int→float64, string→string, bool→bool, nil→None, []any→list, map[string]any→dict.

API

// Create a reusable runner. Compiles the WASM module once.
runner, err := montygo.New()
defer runner.Close()

// Execute Python code with inputs and options.
result, err := runner.Execute(ctx, code, inputs, opts...)

// Options:
montygo.WithExternalFunc(fn,                     // register callable functions
    montygo.Func("search", "query", "limit"),    // with named parameters
    montygo.Func("calculate", "expression"),
)
montygo.WithOsCallFunc(fn)                       // handle filesystem/env access
montygo.WithLimits(montygo.Limits{...})          // resource limits
montygo.WithPrintFunc(fn)                        // capture print output

// FunctionCall provides named args (positional mapped by param name):
call.Args["query"].(string)    // access by parameter name
call.ArgsJSON()                // pre-serialized JSON string

Types

Python	Go (result)	Go (input)
`int`	`float64`	`int`, `float64`
`float`	`float64`	`float64`
`str`	`string`	`string`
`bool`	`bool`	`bool`
`None`	`nil`	`nil`
`list`, `tuple`	`[]any`	`[]any`
`dict`	`map[string]any`	`map[string]any`
`set`	`[]any`	—

Errors

Python exceptions become *montygo.MontyError:

result, err := runner.Execute(ctx, "1 / 0", nil)
var me *montygo.MontyError
if errors.As(err, &me) {
    fmt.Println(me.Message) // "Traceback... ZeroDivisionError: division by zero"
}

What Monty Can Do

Tracks upstream Monty v0.0.11.

Arithmetic, string operations, f-strings, slicing
Functions, lambdas, closures, generators
for/while loops, if/elif/else, break/continue
try/except/finally/else, raise, exception hierarchy
List/dict/set comprehensions, dict/set view operators
range, len, sum, min, max, sorted, reversed, enumerate, zip, map, filter, all, any, getattr
isinstance, type, int(), float(), str(), bool(), abs()
print() with sep and end kwargs
PEP 448 generalized unpacking (*args, **kwargs in calls, literals, etc.)
Nested and augmented subscript assignment (a[i][j] = v, a[i] += 1)
Tuple comparison (<, >, <=, >=)
Multi-module imports (import a, b, c)
Stdlib modules: math (all functions), re, datetime, json, and sys/typing/asyncio subsets
import os, from pathlib import Path (routed through OsCallFunc)
Dataclass instances flow through external function calls (args, returns, and method calls surface with method_call=true)
Resource limits: time, memory, allocations, recursion depth

What Monty Cannot Do

Class definitions (only dataclass instances via external I/O; upstream Monty flags class def as "coming soon")
match statements (coming soon upstream)
Context managers (with ...)
Rest of stdlib and all third-party libraries
float('inf') / float('nan') (JSON serialization limitation in this bridge)

Tests

97 end-to-end tests covering every testable scenario from Monty's core test suite:

make test

Covers: basic expressions, print variants, all exception types, data type round-tripping, external functions (args, kwargs, mixed, complex types, chaining, loops), input handling and scoping, resource limits (timeout, recursion, memory, allocations), OS calls, builtins, control flow, lambdas/closures, and execution isolation.

Building from Source

Requires Rust with wasm32-wasip1 target and Go 1.23+:

rustup target add wasm32-wasip1
make build  # compiles Rust → WASM, copies to monty.wasm
make test   # builds and runs Go tests

Acknowledgments

monty-go exists because of Monty, created by Samuel Colvin and the Pydantic team. Monty is a genuinely novel piece of engineering — a minimal, secure Python interpreter written from scratch in Rust, purpose-built for AI agents. The insight that LLMs should write code instead of making sequential tool calls, and that you need a safe interpreter (not a container) to execute it, is what makes code-mode possible.

Samuel and the Pydantic team have a track record of building foundational tools that the whole ecosystem builds on — Pydantic, Pydantic AI, Logfire, and now Monty. This project is a Go bridge to their work, and we're grateful they built it.

License

MIT

Documentation ¶

Overview ¶

Package montygo provides a Go wrapper around the Pydantic Monty Python interpreter compiled to WebAssembly. It uses wazero (pure Go, no CGO) to execute Python code in a sandboxed environment with pause/resume support for external function calls.

Index ¶

type ExecuteOption
type ExternalFunc
type FuncDef
- func Func(name string, params ...string) FuncDef
type FunctionCall
- func (fc *FunctionCall) ArgsJSON() string
type Limits
- func (l Limits) MarshalJSON() ([]byte, error)
type MontyError
- func (e *MontyError) Error() string
type OsCall
type OsCallFunc
type Runner
- func New() (*Runner, error)
- func (r *Runner) Close() error
- func (r *Runner) Execute(ctx context.Context, code string, inputs map[string]any, opts ...ExecuteOption) (any, error)

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

This section is empty.

Types ¶

type ExecuteOption ¶

type ExecuteOption func(*executeConfig)

ExecuteOption configures a single Execute call.

func WithExternalFunc ¶

func WithExternalFunc(fn ExternalFunc, funcs ...FuncDef) ExecuteOption

WithExternalFunc sets the callback for external function calls. Each FuncDef declares a function name and its parameter names (for positional-to-keyword argument mapping).

func WithLimits ¶

func WithLimits(l Limits) ExecuteOption

WithLimits sets resource limits for the execution.

func WithOsCallFunc ¶

func WithOsCallFunc(fn OsCallFunc) ExecuteOption

WithOsCallFunc sets the callback for OS-level operations.

func WithPrintFunc ¶

func WithPrintFunc(fn func(string)) ExecuteOption

WithPrintFunc sets a callback for Python print() output.

type ExternalFunc ¶

type ExternalFunc func(ctx context.Context, call *FunctionCall) (any, error)

ExternalFunc is called when Python code calls an external function.

type FuncDef ¶

type FuncDef struct {
	Name   string   `json:"name"`
	Params []string `json:"params,omitempty"`
}

FuncDef defines an external Python function with its parameter names. Parameter names enable positional-to-keyword argument mapping in the WASM layer.

func Func ¶

func Func(name string, params ...string) FuncDef

Func creates a FuncDef with the given name and parameter names.

type FunctionCall ¶

type FunctionCall struct {
	Name   string
	Args   map[string]any
	CallID uint32
}

FunctionCall contains information about an external function call from Python. Args contains all arguments merged into a single map — positional args are mapped to parameter names (registered via FuncDef) and kwargs are merged in.

func (*FunctionCall) ArgsJSON ¶

func (fc *FunctionCall) ArgsJSON() string

ArgsJSON returns Args serialized as a JSON string, suitable for passing directly to tool handlers that accept JSON argument strings.

type Limits ¶

type Limits struct {
	MaxMemoryBytes    uint64        `json:"max_memory,omitempty"`
	MaxDuration       time.Duration `json:"-"`
	MaxAllocations    uint64        `json:"max_allocations,omitempty"`
	MaxRecursionDepth uint32        `json:"max_recursion_depth,omitempty"`
}

Limits configures resource limits for Python execution.

func (Limits) MarshalJSON ¶

func (l Limits) MarshalJSON() ([]byte, error)

MarshalJSON implements custom JSON marshaling for Limits.

type MontyError ¶

type MontyError struct {
	Message string
}

MontyError represents an error from the Monty Python interpreter.

func (*MontyError) Error ¶

func (e *MontyError) Error() string

type OsCall ¶

type OsCall struct {
	Function string
	Args     []any
	Kwargs   map[string]any
	CallID   uint32
}

OsCall contains information about an OS-level operation from Python.

type OsCallFunc ¶

type OsCallFunc func(ctx context.Context, call *OsCall) (any, error)

OsCallFunc is called when Python code performs an OS operation.

type Runner ¶

type Runner struct {
	// contains filtered or unexported fields
}

Runner is a compiled Monty WASM runtime ready to execute Python code. Create one Runner and reuse it across multiple Execute calls. Each Execute call gets its own isolated WASM instance.

func New ¶

func New() (*Runner, error)

New creates a new Monty WASM runner. The WASM module is compiled once and reused across Execute calls.

func (*Runner) Close ¶

func (r *Runner) Close() error

Close releases all WASM resources.

func (*Runner) Execute ¶

func (r *Runner) Execute(ctx context.Context, code string, inputs map[string]any, opts ...ExecuteOption) (any, error)

Execute runs Python code with the given inputs and returns the result. Each call creates an isolated WASM instance that is cleaned up when done.

Source Files ¶

View all Source files

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL