api

package

v0.12.0 Latest Latest Go to latest Published: Jun 18, 2025 License: MIT Imports: 21 Imported by: 0

Documentation ¶

Index ¶

type ErrorResponse
type ResponseJSON
type Router
- func NewRouter(cfg config.Config, logger l.Logger, registry providers.ProviderRegistry, ...) Router
type RouterImpl

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

This section is empty.

Types ¶

type ErrorResponse ¶

type ErrorResponse struct {
	Error string `json:"error"`
}

type ResponseJSON ¶

type ResponseJSON struct {
	Message string `json:"message"`
}

type Router ¶

type Router interface {
	ListModelsHandler(c *gin.Context)
	ChatCompletionsHandler(c *gin.Context)
	ListToolsHandler(c *gin.Context)
	ListAgentsHandler(c *gin.Context)
	GetAgentHandler(c *gin.Context)
	ProxyHandler(c *gin.Context)
	HealthcheckHandler(c *gin.Context)
	NotFoundHandler(c *gin.Context)
}

func NewRouter ¶

func NewRouter(
	cfg config.Config,
	logger l.Logger,
	registry providers.ProviderRegistry,
	client providers.Client,
	mcpClient mcp.MCPClientInterface,
	a2aClient a2a.A2AClientInterface,
) Router

type RouterImpl ¶

type RouterImpl struct {
	// contains filtered or unexported fields
}

func (*RouterImpl) ChatCompletionsHandler ¶ added in v0.2.21

func (router *RouterImpl) ChatCompletionsHandler(c *gin.Context)

ChatCompletionsHandler implements an OpenAI-compatible API endpoint that generates text completions in the standard OpenAI format.

Regular response format:

{
  "choices": [
    {
      "finish_reason": "stop",
      "message": {
        "content": "Hello, how can I help you today?",
        "role": "assistant"
      }
    }
  ],
  "created": 1742165657,
  "id": "chatcmpl-118",
  "model": "deepseek-r1:1.5b",
  "object": "chat.completion",
  "usage": {
    "completion_tokens": 139,
    "prompt_tokens": 10,
    "total_tokens": 149
  }
}

Streaming response format:

{
  "choices": [
    {
      "index": 0,
      "finish_reason": "stop",
      "delta": {
        "content": "Hello",
        "role": "assistant"
      }
    }
  ],
  "created": 1742165657,
  "id": "chatcmpl-118",
  "model": "deepseek-r1:1.5b",
  "object": "chat.completion.chunk",
  "usage": {
    "completion_tokens": 139,
    "prompt_tokens": 10,
    "total_tokens": 149
  }
}

It returns token completions as chat in the standard OpenAI format, allowing applications built for OpenAI's API to work seamlessly with the Inference Gateway's multi-provider architecture.

func (*RouterImpl) GetAgentHandler ¶ added in v0.12.0

func (router *RouterImpl) GetAgentHandler(c *gin.Context)

GetAgentHandler implements an endpoint that returns a specific A2A agent by ID

This endpoint provides detailed information about a single A2A agent identified by its unique ID. The ID is generated as a base64-encoded SHA256 hash of the agent's URL to ensure uniqueness and deterministic identification across restarts.

Request:

Method: GET
Path: /a2a/agents/{id}
Parameters:
id (path): The unique identifier of the agent (base64-encoded SHA256 hash of the agent URL)

Response (200 OK):

Content-Type: application/json
Body: A2AAgentCard object containing complete agent information

Response (404 Not Found):

When the specified agent ID does not exist

Response (403 Forbidden):

When A2A_EXPOSE is not enabled

Security:

Requires authentication via Bearer token
Only accessible when explicitly configured via A2A_EXPOSE=true
Does not expose internal errors to clients

func (*RouterImpl) HealthcheckHandler ¶

func (router *RouterImpl) HealthcheckHandler(c *gin.Context)

func (*RouterImpl) ListAgentsHandler ¶ added in v0.10.0

func (router *RouterImpl) ListAgentsHandler(c *gin.Context)

ListAgentsHandler implements an endpoint that returns complete A2A agent cards with all detailed information when A2A_EXPOSE environment variable is enabled.

This handler supports the Agent-to-Agent (A2A) protocol by exposing a list of connected agents along with their complete agent cards containing comprehensive metadata such as name, description, URL, capabilities, skills, security schemes, and supported input/output modes.

The endpoint follows the OpenAI-compatible API format pattern used throughout the gateway for consistency.

Request:

Method: GET
Path: /a2a/agents
Authentication: Required (Bearer token)
Query Parameters: None

Response format when A2A is exposed and agents are available:

{
  "object": "list",
  "data": [
    {
      "name": "Calculator Agent",
      "description": "An agent that can perform mathematical calculations",
      "url": "https://agent1.example.com",
      "version": "1.0.0",
      "capabilities": {
        "streaming": true,
        "pushNotifications": false,
        "stateTransitionHistory": false,
        "extensions": []
      },
      "skills": [
        {
          "id": "calculate",
          "name": "Mathematical Calculation",
          "description": "Perform basic and advanced mathematical operations",
          "tags": ["math", "calculation"],
          "examples": ["2 + 2", "sqrt(16)", "sin(pi/2)"]
        }
      ],
      "defaultInputModes": ["text/plain"],
      "defaultOutputModes": ["text/plain", "application/json"],
      "provider": {
        "organization": "Example Corp",
        "url": "https://example.com"
      },
      "security": [],
      "securitySchemes": {}
    }
  ]
}

Response when A2A is not exposed:

{
  "error": "A2A agents endpoint is not exposed. Set A2A_EXPOSE=true to enable."
}

Response when no agents are available:

{
  "object": "list",
  "data": []
}

Error Handling:

Returns 403 Forbidden if A2A_EXPOSE is not enabled
Returns empty list if A2A client is not initialized
Continues processing other agents if individual agent card retrieval fails
Logs errors for failed agent card retrievals but doesn't fail the entire request

The handler gracefully handles various states:

A2A client is nil (returns empty list)
A2A client is not initialized (returns empty list)
Individual agent card retrieval failures (skips failed agents, continues with others)
No agents configured (returns empty list)

Security:

Requires authentication via Bearer token
Only exposes agents when explicitly configured via A2A_EXPOSE=true
Does not expose internal errors to clients

Note: This endpoint now returns complete AgentCard objects instead of simplified A2AItem objects, providing clients with full agent metadata including capabilities, skills, security requirements, and supported modalities.

func (*RouterImpl) ListModelsHandler ¶ added in v0.1.6

func (router *RouterImpl) ListModelsHandler(c *gin.Context)

ListModelsHandler implements an OpenAI-compatible API endpoint that returns model information in the standard OpenAI format.

This handler supports the OpenAI GET /v1/models endpoint specification: https://platform.openai.com/docs/api-reference/models/list

Parameters:

provider (query): Optional. When specified, returns models from only that provider. If not specified, returns models from all configured providers.

Response format:

{
  "object": "list",
  "data": [
   {
      "id": "model-id",
      "object": "model",
      "created": 1686935002,
      "owned_by": "provider-name",
      "served_by": "provider-name"
   },
   ...
  ]
}

This endpoint allows applications built for OpenAI's API to work seamlessly with the Inference Gateway's multi-provider architecture.

func (*RouterImpl) ListToolsHandler ¶ added in v0.7.0

func (router *RouterImpl) ListToolsHandler(c *gin.Context)

ListToolsHandler implements an endpoint that returns available MCP tools when EXPOSE_MCP environment variable is enabled.

Response format when MCP is exposed:

{
  "object": "list",
  "data": [
    {
      "name": "read_file",
      "description": "Read the contents of a file",
      "server": "filesystem-server",
      "input_schema": {...}
    },
    ...
  ]
}

Response when MCP is not exposed:

{
  "error": "MCP tools endpoint is not exposed"
}

func (*RouterImpl) NotFoundHandler ¶

func (router *RouterImpl) NotFoundHandler(c *gin.Context)

func (*RouterImpl) ProxyHandler ¶

func (router *RouterImpl) ProxyHandler(c *gin.Context)

Source Files ¶

View all Source files

routes.go

Directories ¶

Path	Synopsis
middlewares

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL