Distributed AI compute pool with OpenAI-compatible endpoints, automatic load balancing, and failover support.