synthgen

command
v1.0.0 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Mar 19, 2026 License: MIT Imports: 1 Imported by: 0

README

SynthGen

Generate synthetic training data.

SynthGen generates synthetic training data through the proxy with quality control. Templates, seed examples, deduplication, and EvalGate scoring.

Quickstart

export OPENAI_API_KEY=sk-...
npx @stockyard/synthgen

# Your app:   http://localhost:5990/v1/chat/completions
# Dashboard:  http://localhost:5990/ui

What You Get

  • Template-based generation
  • Seed example expansion
  • Quality scoring per sample
  • Deduplication
  • Batch generation via BatchQueue
  • Export in training formats

Config

# synthgen.yaml
port: 5990
providers:
  openai:
    api_key: ${OPENAI_API_KEY}
synthgen:
  template: "Generate a customer support conversation about {{topic}}"
  topics: [billing, shipping, returns, product_info]
  samples_per_topic: 100
  min_quality: 0.8
  deduplicate: true

Docker

docker run -p 5990:5990 -e OPENAI_API_KEY=sk-... stockyard/synthgen

Part of Stockyard

SynthGen is part of Stockyard — an open-source LLM proxy and control plane. MIT licensed.

Documentation

Overview

SynthGen — Stockyard Phase 3 P3 product.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL