Redirected from
github.com/inference-gateway/inference-gateway .
Discover Packages
github.com/edenreich/inference-gateway
module
Version:
v0.1.4
Opens a new window with list of versions in this module.
Published: Jan 17, 2025
License: MIT
Opens a new window with license information.
README
README
ΒΆ
Inference Gateway
The Inference Gateway is a proxy server designed to facilitate access to various language model APIs. It allows users to interact with different language models through a unified interface, simplifying the configuration and the process of sending requests and receiving responses from multiple LLMs, enabling an easy use of Mixture of Experts.
Key Features
π Open Source : Available under the MIT License.
π Unified API Access : Proxy requests to multiple language model APIs, including Groq, OpenAI, Ollama etc.
βοΈ Environment Configuration : Easily configure API keys and URLs through environment variables.
π³ Docker Support : Use Docker and Docker Compose for easy setup and deployment.
βΈοΈ Kubernetes Support : Ready for deployment in Kubernetes environments.
π OpenTelemetry Tracing : Enable tracing for the server to monitor and analyze performance.
π‘οΈ Production Ready : Built with production in mind, with configurable timeouts and TLS support.
πΏ Lightweight : Includes only essential libraries and runtime, resulting in smaller size binary of ~10.8MB.
π Minimal Resource Consumption : Designed to consume minimal resources and have a lower footprint.
π Documentation : Well documented with examples and guides.
π§ͺ Tested : Extensively tested with unit tests and integration tests.
π οΈ Maintained : Actively maintained and developed.
π Scalable : Easily scalable and can be used in a distributed environment - with HPA in Kubernetes.
π Compliance and Data Privacy: This project does not collect data or analytics, ensuring compliance and data privacy.
π Self-Hosted : Can be self-hosted for complete control over the deployment environment.
Supported API's
Configuration
The Inference Gateway can be configured using environment variables. The following environment variables are supported.
Examples
License
This project is licensed under the MIT License.
Expand βΎ
Collapse β΄
Directories
ΒΆ
cmd
mocks
Package mocks is a generated GoMock package.
Package mocks is a generated GoMock package.
mocks
Package mocks is a generated GoMock package.
Package mocks is a generated GoMock package.
Click to show internal directories.
Click to hide internal directories.