Sakana AI has launched Fugu, a multi-agent orchestration system presented through a single, OpenAI-compatible API endpoint. Rather than acting as one monolithic foundation model, Fugu coordinates a swappable pool of specialized models: it decomposes requests, assigns roles to different agents, verifies intermediate work, and synthesizes a final response. Sakana positions this “orchestration model” approach as a way to improve reliability and cost-performance by routing different subtasks to different models and reserving more expensive reasoning paths for harder parts of a workflow.

Sakana describes Fugu as grounded in two research papers, TRINITY and Conductor, which it says underpin learned coordination strategies for roles such as thinker, worker, and verifier. The company offers two variants: Fugu for lower-latency everyday use and Fugu Ultra for more complex, high-stakes, multi-step tasks such as research and security analysis, using a deeper agent pool.

Sakana reports benchmark performance on coding, reasoning, and agentic evaluations and says routing details are proprietary. Enterprise controls include options to exclude specific models or providers and a mechanism to manage training-data usage. Availability is currently limited in the EU/EEA, and pricing is offered via subscriptions and pay-as-you-go plans, with pay-as-you-go billing tied to the highest-tier underlying model involved for Fugu and fixed per-token pricing for Fugu Ultra.