inference-gateway

module
v0.1.4 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Jan 17, 2025 License: MIT

README ΒΆ

Inference Gateway

CI Status Version License

The Inference Gateway is a proxy server designed to facilitate access to various language model APIs. It allows users to interact with different language models through a unified interface, simplifying the configuration and the process of sending requests and receiving responses from multiple LLMs, enabling an easy use of Mixture of Experts.

Key Features

  • πŸ“œ Open Source: Available under the MIT License.
  • πŸš€ Unified API Access: Proxy requests to multiple language model APIs, including Groq, OpenAI, Ollama etc.
  • βš™οΈ Environment Configuration: Easily configure API keys and URLs through environment variables.
  • 🐳 Docker Support: Use Docker and Docker Compose for easy setup and deployment.
  • ☸️ Kubernetes Support: Ready for deployment in Kubernetes environments.
  • πŸ“Š OpenTelemetry Tracing: Enable tracing for the server to monitor and analyze performance.
  • πŸ›‘οΈ Production Ready: Built with production in mind, with configurable timeouts and TLS support.
  • 🌿 Lightweight: Includes only essential libraries and runtime, resulting in smaller size binary of ~10.8MB.
  • πŸ“‰ Minimal Resource Consumption: Designed to consume minimal resources and have a lower footprint.
  • πŸ“š Documentation: Well documented with examples and guides.
  • πŸ§ͺ Tested: Extensively tested with unit tests and integration tests.
  • πŸ› οΈ Maintained: Actively maintained and developed.
  • πŸ“ˆ Scalable: Easily scalable and can be used in a distributed environment - with HPA in Kubernetes.
  • πŸ”’ Compliance and Data Privacy: This project does not collect data or analytics, ensuring compliance and data privacy.
  • 🏠 Self-Hosted: Can be self-hosted for complete control over the deployment environment.

Supported API's

Configuration

The Inference Gateway can be configured using environment variables. The following environment variables are supported.

Examples

License

This project is licensed under the MIT License.

Directories ΒΆ

Path Synopsis
api
cmd
gateway command
generate command
mocks
Package mocks is a generated GoMock package.
Package mocks is a generated GoMock package.
mocks
Package mocks is a generated GoMock package.
Package mocks is a generated GoMock package.
sdk module

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL