A high-performance AI gateway that provides a single OpenAI-compatible endpoint across 15+ AI providers with automatic failover, intelligent load balancing, semantic caching, and MCP tool integration — claiming sub-100µs overhead at 5k RPS.