sdk

package module

v0.0.0-...-28caef1 Latest Latest Go to latest Published: May 21, 2026 License: MIT Imports: 7 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/TheSlopMachine/llm-router-sdk

Links

Open Source Insights

README ¶

llm-router-sdk

Public API contract for llm-router adapters.

Overview

The llm-router-sdk defines the interface contract for building external provider adapters for llm-router. Adapters are developed as independent Go modules that implement the Adapter interface, enabling llm-router to route requests to any LLM backend.

Why external adapters?

Decoupled development: Adapter authors maintain their own repositories
Independent versioning: Update adapters without rebuilding llm-router
Community contributions: Anyone can publish adapters for new providers
Clean separation: Core router logic remains provider-agnostic

Installation

go get github.com/TheSlopMachine/llm-router-sdk@main

Requirements: Go 1.25 or later

For bleeding-edge installs, @main resolves the current tip of the main branch. Go will still pin the resolved result as a pseudo-version in your go.mod, so rerun go get ...@main when you want to move to newer commits.

Quick Start

Create a new Go module and implement the Adapter interface:

package myadapter

import (
    "context"
    "io"
    sdk "github.com/TheSlopMachine/llm-router-sdk"
)

func init() {
    sdk.Register(&Adapter{})
}

type Adapter struct{}

func (a *Adapter) TypeKey() string { return "myprovider" }
func (a *Adapter) AuthType() sdk.AuthType { return sdk.AuthTypeAPIKey }
func (a *Adapter) ValidateCredentials(data map[string]string) error { /* ... */ }
func (a *Adapter) Complete(ctx context.Context, cred *sdk.Credential, req *sdk.ChatCompletionRequest) (*sdk.ChatCompletionResponse, error) { /* ... */ }
func (a *Adapter) CompleteStream(ctx context.Context, cred *sdk.Credential, req *sdk.ChatCompletionRequest, w io.Writer) error { /* ... */ }
func (a *Adapter) NeedsRefresh(cred *sdk.Credential) bool { return false }
func (a *Adapter) RefreshCredential(ctx context.Context, cred *sdk.Credential) (*sdk.Credential, error) { return nil, sdk.ErrNoRefreshNeeded }
func (a *Adapter) GetModelInfos(ctx context.Context, cred *sdk.Credential, providerQualifier string) ([]sdk.ModelInfo, error) { /* ... */ }
func (a *Adapter) GetAuthFlow() sdk.AuthFlowHandler { return &MyProviderAuthFlow{} }
func (a *Adapter) GetDefaultProviders() []sdk.ProviderInfo { return []sdk.ProviderInfo{{Name: "My Provider"}} }

var _ sdk.Adapter = (*Adapter)(nil)

Then add your adapter to llm-router's adapters.conf:

github.com/yourorg/llm-router-adapter-myprovider main

Start from llm-router-adapter-template for the canonical scaffold, or see examples/minimal for the smallest possible local example.

Core Concepts

Adapter Interface

The Adapter interface defines how llm-router communicates with provider backends. Each adapter is stateless; per-provider state lives in Credential records.

type Adapter interface {
    TypeKey() string
    AuthType() AuthType
    ValidateCredentials(data map[string]string) error
    Complete(ctx context.Context, cred *Credential, req *ChatCompletionRequest) (*ChatCompletionResponse, error)
    CompleteStream(ctx context.Context, cred *Credential, req *ChatCompletionRequest, w io.Writer) error
    NeedsRefresh(cred *Credential) bool
    RefreshCredential(ctx context.Context, cred *Credential) (*Credential, error)
    GetModelInfos(ctx context.Context, cred *Credential, providerQualifier string) ([]ModelInfo, error)
    GetAuthFlow() AuthFlowHandler
    GetDefaultProviders() []ProviderInfo
}

ModelId Format

Models are identified using the format: provider/model-name[:version]

Examples:

openai/gpt-4o
anthropic/claude-3-opus:20240229
openai:azure/gpt-4o
ollama/llama3

The prefix (openai, anthropic, etc.) maps to the adapter's TypeKey().

Credential Structure

Credentials use a flexible map[string]string to support various authentication schemes:

type Credential struct {
    ID   string            `json:"id"`
    Data map[string]string `json:"data"`
}

Examples:

// API Key
{"api_key": "sk-..."}

// OAuth2
{"access_token": "...", "refresh_token": "..."}

// Basic Auth
{"username": "...", "password": "..."}

Error Handling

Return *ProviderError for errors that should trigger intelligent retry and credential rotation:

type ProviderError struct {
    StatusCode int
    Message    string
    Type       ErrorType
    RetryAfter *time.Time
}

Error Types:

ErrorTypeRateLimit - Temporary rate limit, rotate credential
ErrorTypeQuotaExceeded - Credential quota exhausted, deprioritize (MUST have RetryAfter)
ErrorTypeAuth - Auth failure, credential may be invalid
ErrorTypeUpstream - Upstream error, don't retry
ErrorTypeTimeout - Timeout, may retry

API Reference

Adapter Methods

TypeKey

TypeKey() string

Returns a unique identifier for this provider type. Used in model IDs (e.g., "openai" for openai/gpt-4).

Requirements:

Must be lowercase
Must be alphanumeric (no spaces or special characters)
Must be unique across all registered adapters

AuthType

AuthType() AuthType

Declares what kind of credentials this provider uses.

Available types:

AuthTypeAPIKey - Static API key (most common)
AuthTypeOAuth2 - OAuth2 access/refresh tokens
AuthTypeBasic - HTTP Basic Auth (username/password)

ValidateCredentials

ValidateCredentials(data map[string]string) error

Validates credential data before saving. Called when users manually add credentials or when auth flows complete.

Best practices:

Check for required fields
Validate field formats (e.g., API key prefix)
Optionally make a test API call to verify the credential works
Return descriptive errors that help users fix the issue

Complete

Complete(
    ctx context.Context,
    cred *Credential,
    req *ChatCompletionRequest,
) (*ChatCompletionResponse, error)

Handles non-streaming chat completion requests. Called when a client makes a POST to /v1/chat/completions without "stream": true.

Implementation steps:

Extract credentials from cred.Data (e.g., API key)
Transform req (llm-router format) to your provider's API format
Make HTTP request to provider's API
Parse response and transform to ChatCompletionResponse
Return the standardized response

Error handling:

Return *ProviderError for retryable errors (rate limits, quota)
Return regular errors for network failures, parsing errors
Check ctx.Done() for client cancellation

CompleteStream

CompleteStream(
    ctx context.Context,
    cred *Credential,
    req *ChatCompletionRequest,
    w io.Writer,
) error

Handles streaming chat completion requests. Called when a client makes a POST to /v1/chat/completions with "stream": true.

Implementation steps:

Extract credentials from cred.Data
Transform req to your provider's API format
Make streaming HTTP request to provider's API
Read response stream and transform each chunk to StreamChunk
Write each chunk as Server-Sent Event (SSE) to w

SSE format:

data: {"id":"...","object":"chat.completion.chunk",...}\n\n
data: [DONE]\n\n

Error handling:

Check ctx.Done() frequently to detect client disconnection
Return *ProviderError for retryable errors
Partial writes are acceptable (client may have disconnected)

NeedsRefresh

NeedsRefresh(cred *Credential) bool

Returns true if the credential needs to be refreshed. Called periodically by the maintenance service.

Return true if:

OAuth token is expired or will expire soon
Session token needs renewal
Any other condition that requires credential refresh

For static API keys that don't expire, always return false.

RefreshCredential

RefreshCredential(
    ctx context.Context,
    cred *Credential,
) (*Credential, error)

Obtains fresh credentials using the existing credential data. Called by the maintenance service when NeedsRefresh() returns true.

Implementation patterns:

For OAuth2:

Extract refresh_token from cred.Data
Make token refresh request to provider's OAuth endpoint
Parse response and create new Credential with updated tokens
Return the new credential (framework will save it)

For session-based auth:

Extract session credentials from cred.Data
Make session renewal request
Return updated credential with new session token

For static API keys:

Return sdk.ErrNoRefreshNeeded

GetModelInfos

GetModelInfos(
    ctx context.Context,
    cred *Credential,
    providerQualifier string,
) ([]ModelInfo, error)

Retrieves metadata for all available models from the provider. Called by the modelinfo service when cache is expired or missing.

Parameters:

ctx - Context for timeout/cancellation
cred - Credential to use for API authentication
providerQualifier - Provider qualifier (e.g., "", "azure")

Returns:

Array of ModelInfo with rate limits and capabilities
Error if fetching fails (network, auth, parsing, etc.)

Implementation notes:

Should fetch from provider API when possible
Can return hardcoded values if API doesn't provide metadata
Should respect ctx timeout/cancellation
Return empty array (not error) if provider has no models
Framework will cache results with TTL

GetAuthFlow

GetAuthFlow() AuthFlowHandler

Returns an optional handler for automated provider-specific authentication. Return nil if your provider doesn't support automated authentication (users will need to manually enter credentials via JSON).

Most adapters should return an AuthFlowHandler implementation so the dashboard can guide users through API key, OAuth2, or multi-step authentication. Return nil only for intentionally manual-only providers.

GetDefaultProviders

GetDefaultProviders() []ProviderInfo

Returns a list of default providers this adapter supports. These providers will be automatically created during bootstrap.

Return an empty slice if this adapter should not have default providers.

Types

ChatCompletionRequest

type ChatCompletionRequest struct {
    Model       ModelId       `json:"model"`
    Messages    []ChatMessage `json:"messages"`
    Stream      bool          `json:"stream,omitempty"`
    MaxTokens   int           `json:"max_tokens,omitempty"`
    Temperature float64       `json:"temperature,omitempty"`
    TopP        float64       `json:"top_p,omitempty"`
}

ChatCompletionResponse

type ChatCompletionResponse struct {
    ID      string                 `json:"id"`
    Object  string                 `json:"object"`
    Created int64                  `json:"created"`
    Model   string                 `json:"model"`
    Choices []ChatCompletionChoice `json:"choices"`
    Usage   ChatCompletionUsage    `json:"usage"`
}

type ChatCompletionChoice struct {
    Index        int         `json:"index"`
    Message      ChatMessage `json:"message"`
    FinishReason string      `json:"finish_reason"`
}

type ChatCompletionUsage struct {
    PromptTokens     int `json:"prompt_tokens"`
    CompletionTokens int `json:"completion_tokens"`
    TotalTokens      int `json:"total_tokens"`
}

StreamChunk

type StreamChunk struct {
    ID      string              `json:"id"`
    Object  string              `json:"object"`
    Created int64               `json:"created"`
    Model   string              `json:"model"`
    Choices []StreamChunkChoice `json:"choices"`
}

type StreamChunkChoice struct {
    Index        int         `json:"index"`
    Delta        ChatMessage `json:"delta"`
    FinishReason *string     `json:"finish_reason"`
}

ModelInfo

type ModelInfo struct {
    Name          string `json:"name"`
    DisplayName   string `json:"display_name"`
    RPM           int64  `json:"rpm"`           // Requests per minute (0 = unlimited/unknown)
    TPM           int64  `json:"tpm"`           // Tokens per minute (0 = unlimited/unknown)
    RPD           int64  `json:"rpd"`           // Requests per day (0 = unlimited/unknown)
    ContextWindow int64  `json:"context_window,omitempty"`
    MaxTokens     int64  `json:"max_tokens,omitempty"`
}

ProviderInfo

type ProviderInfo struct {
    Name      string // Display name (e.g., "OpenAI", "Azure OpenAI")
    Qualifier string // Optional qualifier (e.g., "azure"), empty for default
    BaseURL   string // API base URL
    IconURL   string // Icon URL (remote CDN or data URI)
}

Authentication Flow (Optional)

The AuthFlowHandler interface enables automated credential acquisition through a wizard-like UI embedded in the dashboard.

type AuthFlowHandler interface {
    InitiateFlow(ctx AuthFlowContext) (AuthFlowState, error)
    HandleStep(ctx AuthFlowContext, input map[string][]string) (AuthFlowState, error)
}

type AuthFlowContext struct {
    ProviderID string
    FlowID     string
    Store      AuthStore // For adapter to persist its own state between steps
}

type AuthFlowState struct {
    RenderHTML         string            // Render this HTML in the wizard
    ExternalURL        string            // Open this URL in new tab (OAuth)
    Credentials        map[string]string // Flow complete, save these credentials
    WaitingForCallback bool              // True if waiting for external OAuth callback
}

Exactly ONE of RenderHTML, ExternalURL, or Credentials should be set.

See llm-router-adapter-demo for comprehensive examples including:

Simple API key input
OAuth2 flow
Multi-step authentication (username → password → MFA)

Error Handling for Credential Rotation

The framework implements intelligent retry with credential rotation. Adapters should return *ProviderError for errors that should trigger credential rotation:

// Rate limit error (temporary, rotate to next credential)
if resp.StatusCode == 429 {
    retryAfter := time.Now().Add(60 * time.Second)
    return nil, &sdk.ProviderError{
        StatusCode: 429,
        Message:    "rate limit exceeded",
        Type:       sdk.ErrorTypeRateLimit,
        RetryAfter: &retryAfter,
    }
}

// Quota exceeded (credential exhausted, deprioritize)
if strings.Contains(body, "quota exceeded") {
    resetAt := time.Now().Add(24 * time.Hour)
    return nil, &sdk.ProviderError{
        StatusCode: 429,
        Message:    "quota exceeded",
        Type:       sdk.ErrorTypeQuotaExceeded,
        RetryAfter: &resetAt, // REQUIRED for ErrorTypeQuotaExceeded
    }
}

// Auth failure (credential may be invalid)
if resp.StatusCode == 401 {
    return nil, &sdk.ProviderError{
        StatusCode: 401,
        Message:    "authentication failed",
        Type:       sdk.ErrorTypeAuth,
    }
}

The framework will:

Try each available credential once (sorted by LRU)
Apply exponential backoff when all credentials exhausted (1s→2s→4s→8s→16s→32s→64s)
Mark quota-exceeded credentials with lower priority
Automatically recover credentials when RetryAfter time passes

Examples

Starter Template

Start with llm-router-adapter-template for the canonical standalone starter repository.

Minimal Local Example

See examples/minimal for a tiny local example that demonstrates the smallest compiling adapter shape.

Reference Implementation

See llm-router-adapter-demo for a comprehensive reference implementation demonstrating:

Complete adapter interface
Auth flow handler
Rate limiting simulation
Error handling patterns
Test models for various scenarios
Comprehensive inline documentation

Resources

Main Router: github.com/TheSlopMachine/llm-router
Adapter Template: github.com/TheSlopMachine/llm-router-adapter-template
Demo Adapter: github.com/TheSlopMachine/llm-router-adapter-demo
Examples: examples/

License

MIT

Documentation ¶

Overview ¶

Package sdk defines the Adapter interface and registry for llm-router provider adapters.

To add a new provider type:

Implement the Adapter interface in a new package.
Call sdk.Register(yourAdapter{}) from an init() function.
That's it — the router will use it automatically.

Package sdk defines error types for llm-router provider adapters.

Package sdk defines shared types for llm-router provider adapters.

Index ¶

Variables
func GetAllDefaultProviders() map[string][]ProviderInfo
func Register(a Adapter)
func Registered() []string
type Adapter
- func Lookup(typeKey string) (Adapter, error)
type AuthFlowContext
type AuthFlowHandler
type AuthFlowState
type AuthStore
type AuthType
type ChatCompletionChoice
type ChatCompletionRequest
type ChatCompletionResponse
type ChatCompletionUsage
type ChatMessage
- func (m ChatMessage) MarshalJSON() ([]byte, error)
- func (m ChatMessage) TextContent() string
- func (m *ChatMessage) UnmarshalJSON(data []byte) error
type ChatMessageContentPart
- func (p ChatMessageContentPart) MarshalJSON() ([]byte, error)
- func (p ChatMessageContentPart) TextContent() string
- func (p *ChatMessageContentPart) UnmarshalJSON(data []byte) error
type ChatTool
type ChatToolCall
type ChatToolFunction
type Credential
type ErrorType
type ModelId
- func (m ModelId) Parse() (providerID, model string, err error)
- func (m ModelId) ParseFull() (adapterType, qualifier, model string, err error)
- func (m ModelId) String() string
type ModelInfo
type ProviderError
- func (e *ProviderError) Error() string
- func (e *ProviderError) IsRetryable() bool
type ProviderInfo
type StreamChunk
type StreamChunkChoice

Constants ¶

This section is empty.

Variables ¶

View Source

var ErrNoRefreshNeeded = fmt.Errorf("no refresh needed for this credential type")

ErrNoRefreshNeeded is returned by RefreshCredential for credential types that do not expire (e.g. static API keys).

Functions ¶

func GetAllDefaultProviders ¶

func GetAllDefaultProviders() map[string][]ProviderInfo

GetAllDefaultProviders collects default providers from all registered adapters.

func Register ¶

func Register(a Adapter)

Register adds an Adapter to the global registry. Panics if an adapter with the same TypeKey is already registered. Call this from your adapter package's init() function.

func Registered ¶

func Registered() []string

Registered returns all registered provider type keys.

Types ¶

type Adapter ¶

type Adapter interface {
	// TypeKey returns the unique string identifier for this provider type.
	// This value is stored in Provider.Type and used for adapter lookup.
	// Examples: "openai", "anthropic", "ollama"
	TypeKey() string

	// AuthType declares what kind of credentials this provider uses.
	AuthType() AuthType

	// ValidateCredentials validates and normalises the raw credential data
	// submitted through the dashboard and returns a ready-to-store Credential.
	ValidateCredentials(data map[string]string) error

	// Complete sends a non-streaming chat completion request.
	Complete(ctx context.Context, cred *Credential, req *ChatCompletionRequest) (*ChatCompletionResponse, error)

	// CompleteStream sends a streaming chat completion request, writing
	// Server-Sent Events to w.
	CompleteStream(ctx context.Context, cred *Credential, req *ChatCompletionRequest, w io.Writer) error

	// NeedsRefresh returns true if the credential should be proactively
	// refreshed before it expires (e.g. within the next 5 minutes).
	NeedsRefresh(cred *Credential) bool

	// RefreshCredential obtains new auth data using the existing credential
	// (e.g. using a refresh_token for OAuth 2.0 flows).
	// For static API keys, this should return ErrNoRefreshNeeded.
	RefreshCredential(ctx context.Context, cred *Credential) (*Credential, error)

	// GetModelInfos retrieves metadata for all available models from the provider.
	// Called by the modelinfo service when cache is expired or missing.
	//
	// Parameters:
	//   - ctx: Context for timeout/cancellation
	//   - cred: Credential to use for API authentication
	//   - providerQualifier: Provider qualifier (e.g., "", "azure")
	//
	// Returns:
	//   - Array of ModelInfo with rate limits and capabilities
	//   - Error if fetching fails (network, auth, parsing, etc.)
	//
	// Implementation notes:
	//   - Should fetch from provider API when possible
	//   - Can return hardcoded values if API doesn't provide metadata
	//   - Should respect ctx timeout/cancellation
	//   - Return empty array (not error) if provider has no models
	//   - Framework will cache results with TTL
	GetModelInfos(ctx context.Context, cred *Credential, providerQualifier string) ([]ModelInfo, error)

	// GetAuthFlow returns an optional handler for automated provider-specific auth.
	GetAuthFlow() AuthFlowHandler

	// GetDefaultProviders returns a list of default providers this adapter supports.
	// These providers will be automatically created during bootstrap.
	// Return an empty slice if this adapter should not have default providers.
	GetDefaultProviders() []ProviderInfo
}

Adapter is implemented by every provider type (e.g. OpenAI, Anthropic, Ollama). Each Adapter is stateless; per-provider state lives in Credential records.

func Lookup ¶

func Lookup(typeKey string) (Adapter, error)

Lookup returns the Adapter for the given provider type key.

type AuthFlowContext ¶

type AuthFlowContext struct {
	ProviderID string
	FlowID     string
	Store      AuthStore // For adapter to persist its own state between steps
}

AuthFlowContext provides framework services to the adapter

type AuthFlowHandler ¶

type AuthFlowHandler interface {
	// InitiateFlow starts the auth flow and returns the initial state.
	InitiateFlow(ctx AuthFlowContext) (AuthFlowState, error)

	// HandleStep processes user input and returns the next state.
	// Called on every form submission, callback, or interaction.
	HandleStep(ctx AuthFlowContext, input map[string][]string) (AuthFlowState, error)
}

AuthFlowHandler defines the custom auth orchestration flow.

type AuthFlowState ¶

type AuthFlowState struct {
	RenderHTML  string            // Render this HTML in the wizard (can include error messages)
	ExternalURL string            // Open this URL in new tab (OAuth)
	Credentials map[string]string // Flow complete, save these credentials

	// Optional metadata:
	WaitingForCallback bool // True if waiting for external OAuth callback
}

AuthFlowState represents the current state of the auth flow

type AuthStore ¶

type AuthStore interface {
	Set(key, value string) error
	Get(key string) (string, error)
	Delete(key string) error
}

AuthStore provides isolated, ephemeral storage for auth flows.

type AuthType ¶

type AuthType string

AuthType describes the authentication mechanism a provider uses.

const (
	AuthTypeAPIKey AuthType = "api_key" // static API key
	AuthTypeOAuth2 AuthType = "oauth2"  // OAuth 2.0 (with refresh)
	AuthTypeBasic  AuthType = "basic"   // HTTP Basic Auth
)

type ChatCompletionChoice ¶

type ChatCompletionChoice struct {
	Index        int         `json:"index"`
	Message      ChatMessage `json:"message"`
	FinishReason string      `json:"finish_reason"`
}

type ChatCompletionRequest ¶

type ChatCompletionRequest struct {
	Model       ModelId       `json:"model" example:"openai/gpt-4o"`
	Messages    []ChatMessage `json:"messages"`
	Tools       []ChatTool    `json:"tools,omitempty"`
	ToolChoice  any           `json:"tool_choice,omitempty"`
	Stream      bool          `json:"stream,omitempty" example:"false"`
	MaxTokens   int           `json:"max_tokens,omitempty" example:"1000"`
	Temperature float64       `json:"temperature,omitempty" example:"0.7"`
	TopP        float64       `json:"top_p,omitempty" example:"1.0"`
}

ChatCompletionRequest is the incoming /v1/chat/completions body.

type ChatCompletionResponse ¶

type ChatCompletionResponse struct {
	ID      string                 `json:"id"`
	Object  string                 `json:"object"`
	Created int64                  `json:"created"`
	Model   string                 `json:"model"`
	Choices []ChatCompletionChoice `json:"choices"`
	Usage   ChatCompletionUsage    `json:"usage"`
}

ChatCompletionResponse mirrors the OpenAI response schema.

type ChatCompletionUsage ¶

type ChatCompletionUsage struct {
	PromptTokens     int `json:"prompt_tokens"`
	CompletionTokens int `json:"completion_tokens"`
	TotalTokens      int `json:"total_tokens"`
}

type ChatMessage ¶

type ChatMessage struct {
	Role         string                   `json:"role" example:"user" enums:"system,user,assistant,tool"`
	Content      string                   `json:"-" example:"Hello, how are you?"`
	ContentParts []ChatMessageContentPart `json:"-"`
	ToolCalls    []ChatToolCall           `json:"tool_calls,omitempty"`
	ToolCallID   string                   `json:"tool_call_id,omitempty"`
}

ChatMessage is a single turn in a conversation.

func (ChatMessage) MarshalJSON ¶

func (m ChatMessage) MarshalJSON() ([]byte, error)

func (ChatMessage) TextContent ¶

func (m ChatMessage) TextContent() string

func (*ChatMessage) UnmarshalJSON ¶

func (m *ChatMessage) UnmarshalJSON(data []byte) error

type ChatMessageContentPart ¶

type ChatMessageContentPart struct {
	Type        string                   `json:"type,omitempty"`
	Text        string                   `json:"text,omitempty"`
	ToolUseID   string                   `json:"tool_use_id,omitempty"`
	ID          string                   `json:"id,omitempty"`
	Name        string                   `json:"name,omitempty"`
	Input       json.RawMessage          `json:"input,omitempty"`
	Content     []ChatMessageContentPart `json:"-"`
	ContentText string                   `json:"-"`
}

func (ChatMessageContentPart) MarshalJSON ¶

func (p ChatMessageContentPart) MarshalJSON() ([]byte, error)

func (ChatMessageContentPart) TextContent ¶

func (p ChatMessageContentPart) TextContent() string

func (*ChatMessageContentPart) UnmarshalJSON ¶

func (p *ChatMessageContentPart) UnmarshalJSON(data []byte) error

type ChatTool ¶

type ChatTool struct {
	Type        string            `json:"type,omitempty"`
	Name        string            `json:"name,omitempty"`
	Description string            `json:"description,omitempty"`
	Parameters  map[string]any    `json:"parameters,omitempty"`
	InputSchema map[string]any    `json:"input_schema,omitempty"`
	Function    *ChatToolFunction `json:"function,omitempty"`
}

type ChatToolCall ¶

type ChatToolCall struct {
	ID       string           `json:"id,omitempty"`
	Type     string           `json:"type,omitempty"`
	Function ChatToolFunction `json:"function"`
}

type ChatToolFunction ¶

type ChatToolFunction struct {
	Name        string         `json:"name,omitempty"`
	Description string         `json:"description,omitempty"`
	Parameters  map[string]any `json:"parameters,omitempty"`
	Arguments   string         `json:"arguments,omitempty"`
}

type Credential ¶

type Credential struct {
	ID   string            `json:"id"`
	Data map[string]string `json:"data"` // e.g. {"api_key": "sk-…"} or {"access_token": "…", "refresh_token": "…"}
}

Credential holds provider-specific authentication data.

type ErrorType ¶

type ErrorType int

ErrorType classifies provider errors for retry logic.

const (
	ErrorTypeUnknown       ErrorType = iota
	ErrorTypeRateLimit               // Temporary rate limit, rotate credential
	ErrorTypeQuotaExceeded           // Credential quota exhausted, deprioritize (MUST have RetryAfter)
	ErrorTypeAuth                    // Auth failure, credential may be invalid
	ErrorTypeUpstream                // Upstream error, don't retry
	ErrorTypeTimeout                 // Timeout, may retry
)

type ModelId ¶

type ModelId string

ModelId uniquely identifies a model in the format: provider/model-name[:version] Examples: "openai/gpt-4o", "openai:azure/gpt-4o", "anthropic/claude-3-opus:20240229"

func (ModelId) Parse ¶

func (m ModelId) Parse() (providerID, model string, err error)

Parse splits a ModelId into providerID and model name. The providerID may be composite (e.g., "openai" or "openai:azure"). Examples:

"openai/gpt-4o" -> ("openai", "gpt-4o")
"openai:azure/gpt-4o" -> ("openai:azure", "gpt-4o")

func (ModelId) ParseFull ¶

func (m ModelId) ParseFull() (adapterType, qualifier, model string, err error)

ParseFull splits a ModelId into adapter type, qualifier, and model name. Examples:

"openai/gpt-4o" -> ("openai", "", "gpt-4o")
"openai:azure/gpt-4o" -> ("openai", "azure", "gpt-4o")

func (ModelId) String ¶

func (m ModelId) String() string

type ModelInfo ¶

type ModelInfo struct {
	Name          string `json:"name"`                     // Model name without provider prefix
	DisplayName   string `json:"display_name"`             // Human-readable name (optional)
	RPM           int64  `json:"rpm"`                      // Requests per minute (0 = unlimited/unknown)
	TPM           int64  `json:"tpm"`                      // Tokens per minute (0 = unlimited/unknown)
	RPD           int64  `json:"rpd"`                      // Requests per day (0 = unlimited/unknown)
	ContextWindow int64  `json:"context_window,omitempty"` // Max context tokens
	MaxTokens     int64  `json:"max_tokens,omitempty"`     // Max output tokens
}

ModelInfo contains metadata about a specific model.

type ProviderError ¶

type ProviderError struct {
	StatusCode int
	Message    string
	Type       ErrorType
	RetryAfter *time.Time // Required for ErrorTypeQuotaExceeded, optional for ErrorTypeRateLimit
}

ProviderError represents errors returned by provider adapters. Adapters should return this type to enable intelligent retry and credential rotation.

func (*ProviderError) Error ¶

func (e *ProviderError) Error() string

func (*ProviderError) IsRetryable ¶

func (e *ProviderError) IsRetryable() bool

IsRetryable returns true if this error should trigger credential rotation.

type ProviderInfo ¶

type ProviderInfo struct {
	Name      string // Display name (e.g., "OpenAI", "Azure OpenAI")
	Qualifier string // Optional qualifier (e.g., "azure"), empty for default
	BaseURL   string // API base URL
	IconURL   string // Icon URL (remote CDN or data URI)
}

ProviderInfo defines a default provider configuration returned by adapters.

type StreamChunk ¶

type StreamChunk struct {
	ID      string              `json:"id"`
	Object  string              `json:"object"`
	Created int64               `json:"created"`
	Model   string              `json:"model"`
	Choices []StreamChunkChoice `json:"choices"`
}

StreamChunk is a single SSE data payload for streaming responses.

type StreamChunkChoice ¶

type StreamChunkChoice struct {
	Index        int         `json:"index"`
	Delta        ChatMessage `json:"delta"`
	FinishReason *string     `json:"finish_reason"`
}

Source Files ¶

View all Source files

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL