skillbox

module

v0.2.0 Latest Latest Go to latest Published: Mar 3, 2026 License: MIT

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/devs-group/skillbox

Links

Open Source Insights

README ¶

Skillbox

The self-hosted execution runtime for AI agents.

Your agents need a sandbox. Don't build one.

Skillbox gives AI agents a single API to run sandboxed skill scripts (Python, Node.js, Bash) and receive structured JSON output + file artifacts. Self-hosted, open source, secure by default.

from skillbox import Client

client = Client("http://localhost:8080", "sk-your-key")

# Discover what skills are available
for skill in client.list_skills():
    print(f"{skill.name}: {skill.description}")

# Run a skill — structured in, structured out
result = client.run("data-analysis", input={"data": [1, 2, 3, 4, 5]})
print(result.output)  # {"row_count": 5, "mean": 3.0, ...}

Why Skillbox

Every AI agent that does useful work needs to execute code. But executing arbitrary code is dangerous. Most teams either skip sandboxing ("we'll fix it later") or build their own broken wrapper. Skillbox is the missing piece:

Problem	How Skillbox Solves It
"We need sandboxing but E2B/Modal are cloud-only"	Self-hosted. Your infrastructure, your data, your rules.
"Three teams built three sandbox wrappers"	One runtime. One API. One security review.
"Our agents don't know what tools are available"	Skill catalog — agents discover, inspect, and choose capabilities.
"We need GDPR/EU AI Act compliance"	Data never leaves your network. MIT license.
"Docker is insecure for running untrusted code"	OpenSandbox with 11 layers of hardening, enforced by the runtime, not configurable by callers.

Compared to Alternatives

	Skillbox	E2B	Modal	Daytona
Self-hosted	Yes (MIT)	Experimental	No	Limited
Skill catalog	Yes (SKILL.md)	No	No	No
Structured I/O	JSON in → JSON out	Raw stdout	Raw stdout	Raw stdout
Agent introspection	list + get_skill	No	No	No
LangChain-native	1:1 tool mapping	Manual	Manual	Manual
Network disabled	Always	Optional	No	No
Zero-dep SDK	Go + Python	Python	Python	REST only
File management	Upload, version, download	Limited	No	No
License	MIT	Apache-2.0	Proprietary	Apache-2.0

Features

Secure by default — OpenSandbox isolation with network disabled, all capabilities dropped, read-only rootfs, PID limits, non-root user, no-new-privileges, image allowlist. 11 layers. Not optional.
Skill catalog — Skills are versioned, discoverable, introspectable units with YAML metadata + markdown instructions. Agents understand what's available before executing.
Structured I/O — Skills read JSON input, write JSON output, and produce file artifacts. No stdout parsing.
LangChain-ready — Skills map 1:1 to LangChain tools. get_skill returns descriptions for tool selection.
Self-hosted — Docker Compose with OpenSandbox service (dev), Kubernetes (prod), Helm chart. Air-gapped? Works offline.
Multi-tenant — API keys scoped to tenants, skills and executions isolated.
Zero-dep SDKs — Go and Python clients use only the standard library. No dependency conflicts.
CLI — Push, lint, run, package, and manage skills from the terminal.
File artifacts — Skills write files, runtime tars them, presigned S3 URL returned.
File persistence — Files persist across sessions, support versioning, and can be edited after creation via the file management API.
12-factor config — All configuration via environment variables.

Quick Start

Prerequisites: Docker and Docker Compose. The compose stack includes the OpenSandbox service for sandbox execution.

# 1. Start the stack (includes OpenSandbox, MinIO, PostgreSQL)
git clone https://github.com/devs-group/skillbox.git && cd skillbox
docker compose -f deploy/docker/docker-compose.yml up -d

# 2. Create an API key
bash scripts/seed-apikey.sh
export SKILLBOX_API_KEY=sk-...  # from the script output

# 3. Install the CLI and push a skill
go install github.com/devs-group/skillbox/cmd/skillbox@latest
skillbox skill push examples/skills/data-analysis --server http://localhost:8080

# 4. Run it
skillbox run data-analysis --input '{"data": [{"name": "Alice", "age": 30}, {"name": "Bob", "age": 25}]}'

Or with curl:

curl -s http://localhost:8080/v1/executions \
  -H "Authorization: Bearer $SKILLBOX_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"skill": "data-analysis", "input": {"data": [{"name": "Alice", "age": 30}]}}' | jq .

Security Model

Security is enforced by the runtime — not configurable away by callers:

Control	Implementation	Threat Mitigated
Network isolation	OpenSandbox NetworkPolicy (`defaultAction: deny`)	Data exfiltration, SSRF
Capability drop	OpenSandbox container security context	Privilege escalation
Read-only rootfs	OpenSandbox container config	Filesystem tampering
PID limit	OpenSandbox resource limits	Fork bombs
No-new-privileges	OpenSandbox security context	setuid/setgid escalation
Non-root user	`UID 65534:65534` (set by runner)	Container escape
Image allowlist	Validated by Skillbox before `CreateSandbox` call	Supply-chain attack
Timeout	Go context cancellation + sandbox TTL	Resource exhaustion
tmpfs	OpenSandbox mounts	Binary execution in writable areas
Env var blocking	Filtered by runner before passing to sandbox	Library injection
Sandbox lifecycle	OpenSandbox API (no Docker socket required)	Host escape

For genuinely untrusted code, gVisor or Kata Containers can be enabled as a Kubernetes RuntimeClass with zero changes to Skillbox.

Skill Format

A skill is a zip archive containing SKILL.md + scripts:

my-skill/
├── SKILL.md              # YAML frontmatter + instructions
├── scripts/
│   └── main.py           # Entrypoint
└── requirements.txt      # Optional: Python deps

---
name: data-analysis
version: "1.0.0"
description: Analyze CSV data and produce summary statistics
lang: python
timeout: 60s
resources:
  memory: 256Mi
  cpu: "0.5"
---

# Data Analysis Skill

Analyze data and produce summary statistics with charts.

The YAML frontmatter is machine-readable (for SDKs and API). The markdown body is LLM-readable (for agent tool selection). This dual format is what makes Skillbox skills work as LangChain tools out of the box.

See docs/SKILL-SPEC.md for the full specification.

SDKs

Go

Single file, zero dependencies beyond the Go standard library:

go get github.com/devs-group/skillbox/sdks/go

import skillbox "github.com/devs-group/skillbox/sdks/go"

client := skillbox.New("http://localhost:8080", "sk-your-key",
    skillbox.WithTenant("my-team"),
)

result, err := client.Run(ctx, skillbox.RunRequest{
    Skill:   "text-summary",
    Input:   json.RawMessage(`{"text": "Long text here...", "max_sentences": 3}`),
})

if result.HasFiles() {
    err = client.DownloadFiles(ctx, result, "./output")
}

// File management
files, err := client.ListFiles(ctx, skillbox.FileFilter{ExecutionID: "exec-abc-123"})
err = client.DownloadFile(ctx, files[0].ID, "./output/report.pdf")

Python

Single file, zero dependencies beyond the Python standard library:

from skillbox import Client

client = Client("http://localhost:8080", "sk-your-key", tenant_id="my-team")

result = client.run("text-summary", input={"text": "Long text here...", "max_sentences": 3})
print(result.output)  # {"summary": "...", "sentence_count": 2}

if result.has_files:
    client.download_files(result, "./output")

# File management
files = client.list_files(execution_id="exec-abc-123")
client.download_file(files[0].id, "./output/report.pdf")

LangChain Integration

Skillbox skills map directly to LangChain tools. Each skill becomes a callable tool that an agent can discover, inspect, and execute:

from langchain_anthropic import ChatAnthropic
from langgraph.prebuilt import create_react_agent

# Build tools from all registered skills
tools = build_skillbox_toolkit("http://localhost:8080", "sk-your-key")

# Agent sees tools like skillbox_data_analysis, reads their descriptions,
# picks the right one, calls it with structured input, gets structured output
agent = create_react_agent(ChatAnthropic(model="claude-sonnet-4-6"), tools)
result = agent.invoke({
    "messages": [{"role": "user", "content": "Analyze this data: name,age\nAlice,30\nBob,25"}]
})

See the full LangChain integration guide for SkillboxTool, SkillboxToolkit, and custom tool examples.

Architecture

Agent → REST API → Skill Registry (MinIO) → OpenSandbox Runner → Sandbox (hardened) → Output + Files
              ↕                                      ↕
         PostgreSQL                         OpenSandbox API (lifecycle + ExecD)

Every execution: authenticate → load skill → validate image → create hardened sandbox via OpenSandbox → run → collect output + files → cleanup. Stateless API, horizontally scalable behind a load balancer.

See docs/ARCHITECTURE.md for the full deep-dive.

CLI

skillbox run <skill> [--input '{}'] [--version latest]
skillbox skill push <dir|zip>
skillbox skill list
skillbox skill lint <dir>
skillbox skill package <dir>
skillbox exec logs <id>
skillbox health
skillbox version

Deployment

Docker Compose (Development)

docker compose -f deploy/docker/docker-compose.yml up

Kubernetes (Production)

kubectl apply -k deploy/k8s/overlays/prod

Helm

helm install skillbox deploy/helm/skillbox/

Kustomize overlays for dev and prod environments. Includes namespace, RBAC, NetworkPolicy, and Pod Security Standards. OpenSandbox manages container lifecycle directly -- no Docker socket proxy required.

API

Method	Path	Description
POST	/v1/executions	Run a skill
GET	/v1/executions/:id	Get execution result
GET	/v1/executions/:id/logs	Get execution logs
POST	/v1/skills	Upload a skill zip
GET	/v1/skills	List skills (with descriptions)
GET	/v1/skills/:name/:version	Get skill metadata + instructions
DELETE	/v1/skills/:name/:version	Delete a skill
POST	/v1/files	Upload a file
GET	/v1/files	List files (with pagination)
GET	/v1/files/:id	Get file metadata
GET	/v1/files/:id/download	Download file content
PUT	/v1/files/:id	Update/version a file
DELETE	/v1/files/:id	Delete a file
GET	/v1/files/:id/versions	List file versions
GET	/health	Liveness probe
GET	/ready	Readiness probe

See docs/API.md for the full reference.

Examples

Example	Description
examples/skills/data-analysis/	CSV/JSON statistics with chart artifacts
examples/skills/text-summary/	Extractive text summarization
examples/skills/word-counter/	Word frequency counting
examples/curl/	Step-by-step curl + jq walkthrough
examples/python/	Python integration (stdlib only)
examples/agent-integration/	Full Go agent using the SDK
examples/write-your-first-skill/	Build your first skill (tutorial)

Run all examples at once:

docker compose -f examples/docker-compose.yml up

Configuration

All configuration via environment variables (12-factor):

Variable	Default	Description
`SKILLBOX_DB_DSN`	required	PostgreSQL connection string
`SKILLBOX_S3_ENDPOINT`	required	MinIO/S3 endpoint
`SKILLBOX_S3_ACCESS_KEY`	required	S3 access key
`SKILLBOX_S3_SECRET_KEY`	required	S3 secret key
`SKILLBOX_OPENSANDBOX_URL`	http://localhost:8080	OpenSandbox API URL
`SKILLBOX_OPENSANDBOX_API_KEY`	required	OpenSandbox API key
`SKILLBOX_SANDBOX_EXPIRATION`	5m	Sandbox TTL
`SKILLBOX_IMAGE_ALLOWLIST`	python:3.12-slim,...	Allowed Docker images
`SKILLBOX_DEFAULT_TIMEOUT`	120s	Default execution timeout
`SKILLBOX_API_PORT`	8080	HTTP port
`SKILLBOX_REDIS_URL`	(optional)	Redis URL for caching

Contributing

We welcome contributions! See CONTRIBUTING.md for development setup, coding guidelines, and how to add new skills.

License

MIT. See LICENSE.

Built and maintained by devs group · Kreuzlingen, Switzerland

Directories ¶

Path	Synopsis
cmd
skillbox command
skillbox-server command
examples
agent-integration command Example: Using the Skillbox Go SDK in an AI agent.	Example: Using the Skillbox Go SDK in an AI agent.
internal
api
api/handlers
api/middleware
api/response
artifacts
config
registry
runner
sandbox
skill
store
version
sdks
go Package skillbox provides a Go client for the Skillbox API — an open-source secure skill execution runtime for AI agents.	Package skillbox provides a Go client for the Skillbox API — an open-source secure skill execution runtime for AI agents.

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL