Sakana Fugu API Pricing
Sakana AI describes Sakana Fugu as a multi-agent system delivered as one model API. Fugu dynamically orchestrates a pool of expert models for coding, reasoning, research, security analysis, and other complex multi-step work.
| Model | Input /M | Output /M | Cached | Context | Max output | Vision | Tools | Tier |
|---|---|---|---|---|---|---|---|---|
| Fugu Ultra FLAGSHIP | $5.00 | $30 | $0.5−90% | 272K | — | ✗ | ✓ | Active |
Sakana Fugu pricing notes.
AI//COST tracks Fugu Ultra because Sakana publishes a fixed token row for fugu-ultra-20260615. Regular Fugu remains variable-rate because its price depends on the active underlying model tier.
OpenAI-compatible API for Fugu and Fugu Ultra.
PRICINGCanonical source for fixed Fugu Ultra token prices and variable Fugu billing notes.
TECHNICAL REPORTResearch background for learned orchestration and multi-agent coordination.
Sakana Fugu is positioned as "Multi-Agent System as a Model": one OpenAI-compatible model API that dynamically coordinates a pool of expert models for complex, multi-step tasks. The public page emphasizes coding, reasoning, Kaggle competitions, paper reproduction, cybersecurity analysis, and literature or patent investigations.
AI//COST tracks Fugu Ultra as the token-priced row because Sakana publishes fixed pay-as-you-go prices for fugu-ultra-20260615. Regular Fugu remains variable-rate: Sakana says a single active agent is billed at that underlying model rate, and multiple active agents are billed as one rate based on the top-tier model involved.
OpenAI
GPT-5.6 Sol has the same headline $5/$30/$0.50 token row but without Sakana's published multi-agent orchestration positioning.
AGENT MODELAnthropic
Claude Fable 5 and Mythos 5 are key comparators for high-end agentic work and long-context coding/research tasks.
LONG CONTEXTGemini provides lower-cost long-context alternatives when a single-model workflow is enough.
OPEN-WEIGHT GIANTAlibaba (Qwen)
The Qwen (Tongyi Qianwen) model family from Alibaba Cloud's Tongyi Lab. First released in 2023, Qwen is the most-downloaded open-weight model family in the world — most tiers ship under Apache 2.0 on Hugging Face and ModelScope, while the proprietary Max tier is served through Alibaba Cloud's Model Studio.
CHINA CONSUMER LEADERByteDance (Doubao)
The Doubao model family from ByteDance — parent of TikTok and Douyin. Served through the Volcano Ark (火山方舟) platform on ByteDance's Volcano Engine cloud, Doubao powers China's most-used consumer AI app and undercuts most frontier labs on price.
EU FRONTIER LABMistral AI
The Paris lab behind Le Chat and the open-weight Mistral family. Founded in 2023 by Arthur Mensch with ex-DeepMind and Meta researchers, it ships frontier-grade models with Apache-2.0 open weights for most tiers and EU-native data residency — the leading European alternative to US labs.
Frequently asked.
Practical notes on fixed Fugu Ultra prices, variable Fugu billing, and availability limits.
Q · 01 What Fugu prices are tracked here? +
Q · 02 Why is regular Fugu not shown as a normal fixed price? +
Q · 03 Does Fugu Ultra have a long-context surcharge? +
$10/M input, $45/M output, and $1.00/M cached input when context exceeds 272K.