README
ยถ
DevOps MCP - AI Agent Orchestration Platform
๐ The production-ready platform for orchestrating multiple AI agents in your DevOps workflows
Connect AI models โข Intelligent task routing โข Real-time collaboration โข Enterprise scale
๐ฏ Transform Your DevOps with AI Orchestration
DevOps teams struggle to integrate AI effectively - managing multiple models, coordinating agents, and optimizing costs. DevOps MCP solves this with intelligent orchestration that routes tasks to the right AI agent at the right time.
Why DevOps MCP?
- ๐ค Multi-Agent Orchestration: Register and coordinate multiple AI agents with different capabilities
- ๐ง Intelligent Task Routing: Automatically route tasks based on capability, performance, and cost
- โก Real-time Collaboration: WebSocket-based coordination with binary protocol optimization
- ๐ฐ Cost Optimization: Smart routing minimizes AI costs while maximizing performance
- ๐ข Enterprise Ready: Production AWS integration with circuit breakers and observability
๐ Key Features
AI Agent Orchestration
- Capability-Based Discovery: Agents advertise their strengths (code analysis, security, documentation)
- Dynamic Load Balancing: Routes tasks to least-loaded agents in real-time
- Collaboration Strategies: MapReduce, parallel execution, consensus building
- Workload Management: Track and optimize agent utilization
Intelligent Task Assignment
- Multiple Routing Algorithms:
- Performance-based: Route to fastest agents
- Cost-optimized: Minimize API costs
- Capability-match: Match task requirements to agent strengths
- Least-loaded: Balance work across agents
- Circuit Breakers: Automatic failover when agents fail
- Priority Queuing: Critical tasks get processed first
AWS Bedrock Integration
- Multiple Embedding Models:
- Amazon Titan (v1 & v2)
- Cohere (English & Multilingual)
- Claude 3 (experimental)
- Smart Provider Routing: Automatic failover between models
- Cost Tracking: Per-model usage and cost optimization
- Quality Scoring: Route based on embedding quality needs
Real-time Communication
- Binary WebSocket Protocol: Compressed messages for efficiency
- Mixed Message Support: Text and binary in same connection
- Connection Pooling: Efficient resource utilization
- Heartbeat Monitoring: Automatic reconnection handling
๐ Real-World Impact
๐ฏ Intelligent Code Review
Route security reviews to specialized models, style checks to faster models
- 70% faster PR reviews with better coverage
- Parallel analysis by multiple specialized agents
- Cost savings through intelligent routing
๐ Multi-Agent Documentation
Coordinate multiple AI agents to generate comprehensive docs
- Generate complete API docs in minutes, not hours
- Different agents handle different sections
- Consistency through orchestration
๐จ Smart Incident Response
Route alerts to specialized agents based on severity and type
- 50% reduction in MTTR with AI-powered triage
- Automatic escalation to appropriate agents
- Learning from resolution patterns
๐๏ธ Architecture
graph TD
A[AI Agents] -->|WebSocket| B[MCP Server]
B --> C[Task Router]
C --> D[Assignment Engine]
D --> E[Agent Registry]
B --> F[REST API]
F --> G[GitHub/Tools]
B --> H[Vector DB]
B --> I[S3 Storage]
B --> J[SQS Queue]
Core Components
- MCP Server: WebSocket server for real-time agent communication
- Task Router: Intelligent routing based on capabilities and load
- Agent Registry: Tracks online agents and their capabilities
- Assignment Engine: Sophisticated algorithms for task distribution
- Vector Database: pgvector for semantic search and embeddings
- Event Queue: SQS for asynchronous task processing
๐ Quick Start
Prerequisites
- Go 1.24+ (workspace support)
- Docker & Docker Compose
- AWS Account (for production features)
- PostgreSQL 14+ with pgvector
Option 1: Docker (Recommended)
# Clone repository
git clone https://github.com/S-Corkum/devops-mcp.git
cd devops-mcp
# Configure environment
cp .env.example .env
# Edit .env with your AWS credentials and settings
# Start all services
docker-compose up -d
# Verify health
curl http://localhost:8080/health
curl http://localhost:8081/health
Option 2: Local Development
# Clone and setup
git clone https://github.com/S-Corkum/devops-mcp.git
cd devops-mcp
# Install dependencies
make deps
# Start infrastructure (PostgreSQL, Redis)
make dev-setup
# Run database migrations
make migrate-up
# Start services
make dev
# In another terminal, register an AI agent
curl -X POST http://localhost:8080/api/v1/agents/register \
-H "Content-Type: application/json" \
-d '{
"id": "agent-001",
"name": "Code Analyzer",
"capabilities": ["code-review", "security-scan"],
"model_id": "amazon.titan-embed-text-v1"
}'
๐ฎ Usage Examples
Register an AI Agent
// WebSocket connection to MCP
ws, _ := websocket.Dial("ws://localhost:8080/ws", "", "http://localhost")
// Register agent
msg := AgentRegistration{
Type: "agent.register",
Payload: AgentInfo{
ID: "security-agent",
Name: "Security Scanner",
Capabilities: []string{"security", "vulnerability-scan"},
ModelID: "amazon.titan-embed-text-v2",
},
}
websocket.JSON.Send(ws, msg)
Submit a Task
# Submit task via REST API
curl -X POST http://localhost:8081/api/v1/tasks \
-H "Content-Type: application/json" \
-d '{
"type": "code-review",
"priority": "high",
"repository": "github.com/org/repo",
"capabilities_required": ["security", "code-review"]
}'
Monitor Agent Performance
# Get agent metrics
curl http://localhost:8081/api/v1/agents/agent-001/metrics
# Response shows:
# - Active tasks: 3
# - Completed tasks: 147
# - Average response time: 1.2s
# - Cost per task: $0.0003
๐ Performance Metrics
- Binary Protocol: Significant performance improvements with compression
- Concurrent Agents: Handle 1000+ simultaneous AI agents
- Task Routing: Sub-100ms routing decisions
- Availability: 99.9% uptime with circuit breakers
- Scalability: Horizontal scaling with distributed task processing
๐ ๏ธ Technology Stack
- Language: Go 1.24+ with workspace support
- Databases: PostgreSQL 14+ (pgvector), Redis 6.2+
- AI/ML: AWS Bedrock (Titan, Cohere, Claude)
- Queue: AWS SQS
- Storage: AWS S3
- Protocol: WebSocket with binary encoding
- Observability: OpenTelemetry, Prometheus, Grafana
๐ Documentation
Getting Started
Architecture
API Reference
Operations
๐ค Contributing
We welcome contributions! Please see our Contributing Guide for details.
Development Workflow
- Fork the repository
- Create a feature branch
- Make your changes with tests
- Run
make pre-commit
- Submit a pull request
๐ License
This project is licensed under the MIT License - see the LICENSE file for details.
๐ Acknowledgments
- AWS Bedrock team for AI/ML infrastructure
- pgvector for vector similarity search
- OpenTelemetry for observability standards
- The Go community for excellent tooling
Built with โค๏ธ for the DevOps community
Documentation
ยถ
There is no documentation for this package.