Cohere API Pricing
Cohere is an enterprise AI company behind Command, Aya, Embed, Rerank, and North. Public token pricing is currently explicit for Aya Expanse; newer Command models are listed as live but use custom or private-deployment pricing.
Command R7B LAUNCHED DEC 2024
Official Cohere public pricing card lists $0.0375/M input and $0.15/M output; no separate cache price is published.
Aya Expanse 32B LAUNCHED JUN 2026
Cohere pricing FAQ lists Aya Expanse 8B and 32B on the API at $0.50/M input and $1.50/M output.
Command R LAUNCHED AUG 2024
Official Cohere public pricing card lists $0.15/M input and $0.60/M output; no separate cache price is published.
| Model | Input /M | Output /M | Cached | Context | Max output | Vision | Tools | Tier |
|---|---|---|---|---|---|---|---|---|
| Command R7B | $0.04 | $0.15 | — | 128K | — | ✗ | ✓ | Active |
| Aya Expanse 32B FLAGSHIP | $0.5 | $1.50 | — | 128K | — | ✗ | ✓ | Active |
| Command R | $0.15 | $0.6 | — | 128K | — | ✗ | ✓ | Active |
Pricing across the lineup.
Cohere pricing notes.
AI//COST first added Cohere after verifying the public pricing page on 2026-06-14.
Cohere offers Command, Aya, Embed, Rerank, North, and private deployment products. The current pricing page exposes public token prices for Aya Expanse, while newer Command models are presented through enterprise or dedicated-deployment pricing.
AI//COST tracks the publicly priced Aya Expanse API row and treats Command A-family pricing as custom until Cohere publishes token list prices.
OpenAI
Broad model catalog and API ecosystem; useful reference for general-purpose chat pricing.
SAFETY LABAnthropic
Claude is the major enterprise comparator for agentic and document-heavy workloads.
OPEN-WEIGHTAlibaba (Qwen)
Qwen provides low-cost open and proprietary options for text and code workloads.
CHINA CONSUMER LEADERByteDance (Doubao)
The Doubao model family from ByteDance — parent of TikTok and Douyin. Served through the Volcano Ark (火山方舟) platform on ByteDance's Volcano Engine cloud, Doubao powers China's most-used consumer AI app and undercuts most frontier labs on price.
EU FRONTIER LABMistral AI
The Paris lab behind Le Chat and the open-weight Mistral family. Founded in 2023 by Arthur Mensch with ex-DeepMind and Meta researchers, it ships frontier-grade models with Apache-2.0 open weights for most tiers and EU-native data residency — the leading European alternative to US labs.
GLM ARCHITECTZhipu (Z.ai / GLM)
The Beijing lab behind the GLM models, also operating internationally as Z.ai. Spun out of Tsinghua University in 2019, Zhipu ships strong coding/agentic models, multiple free tiers, and open weights — and was the first of China's "Six Tigers" to pursue an IPO. It also sits on the US Entity List.
Frequently asked.
Practical notes on Cohere pricing and what AI//COST includes.