04-openai-server

command

v1.11.1 Latest Latest Go to latest Published: Mar 24, 2026 License: Apache-2.0 Imports: 11 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/zerfoo/zerfoo

Links

Documentation ¶

Overview ¶

Recipe 04: OpenAI-Compatible Server

Serve a GGUF model behind an OpenAI-compatible HTTP API. Clients that work with the OpenAI API (curl, Python openai library, LangChain, etc.) can connect directly -- just point them at http://localhost:8080.

Endpoints:

POST /v1/chat/completions (chat)
POST /v1/completions (text completion)
POST /v1/embeddings (embeddings)
GET /v1/models (model listing)
GET /health (health check)

Usage:

go run ./docs/cookbook/04-openai-server/ --model path/to/model.gguf
curl http://localhost:8080/v1/chat/completions -d '{"model":"default","messages":[{"role":"user","content":"Hello"}]}'

Source Files ¶

View all Source files

main.go

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL