partialcache

command

v1.0.0 Latest Latest Go to latest Published: Mar 19, 2026 License: MIT Imports: 1 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/stockyard-dev/stockyard

Links

Open Source Insights

README ¶

PartialCache

Cache reusable prompt prefixes.

PartialCache detects static prompt prefixes and uses native prefix caching where supported. Simulates for providers that don't support it.

Quickstart

export OPENAI_API_KEY=sk-...
npx @stockyard/partialcache

# Your app:   http://localhost:6230/v1/chat/completions
# Dashboard:  http://localhost:6230/ui

What You Get

Detect static prompt prefixes
Native prefix caching support
Simulation for unsupported providers
Per-prefix savings tracking
Auto-detect cacheable prefixes
Dashboard with prefix cache stats

Config

# partialcache.yaml
port: 6230
partialcache:
  enabled: true
  min_prefix_tokens: 100
  auto_detect: true

Docker

docker run -p 6230:6230 -e OPENAI_API_KEY=sk-... stockyard/partialcache

Part of Stockyard

PartialCache is part of Stockyard — an open-source LLM proxy and control plane. MIT licensed.

Documentation ¶

There is no documentation for this package.

Source Files ¶

View all Source files

main.go

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL