Claude Opus 4.8 API Pricing
Claude Opus 4.8 is Anthropic's newest generally available Opus model: $5/M input, $25/M output, and $0.50/M cache-hit input. Fast mode is now $10/M input and $50/M output, three times cheaper than the prior Opus fast-mode tier. Pulled directly from platform.claude.com daily.
Run the numbers.
Live calculator pre-loaded with Claude Opus 4.8 rates. Tweak spend, output mix, or cache hit rate; share the URL to share the calculation.
Real-world presets.
Codebase-scale migration
Reading 100-page contracts
Support agent ticket triage
Research planning turn
Price history.
Paste text. See tokens. See cost.
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| Claude Opus 4.8 Current | $5.00 cache $0.50 | $25.00 | $3.21 agentic 92/8 | 1M | Newest Opus - hard agents and code |
| Claude Opus 4.7 | $5.00 cache $0.50 | $25.00 | $3.21 same list price | 1M | Previous Opus frontier tier |
| Claude Opus 4.6 | $5.00 cache $0.50 | $25.00 | $3.21 same list price | 1M | Older 1M Opus workloads |
| Claude Sonnet 4.6 | $3.00 cache $0.30 | $15.00 | $1.92 cheaper Anthropic sibling | 1M | Production agents and coding |
| Claude Haiku 4.5 | $1.00 cache $0.10 | $5.00 | $0.64 cheaper Anthropic sibling | 200K | Support and classification |
| GPT-5.5 | $5.00 cache $0.50 | $30.00 | $3.61 frontier competitor | 1M | OpenAI frontier agents |
| Gemini 3.1 Pro Preview | $2.00 cache $0.20 | $12.00 | $1.44 frontier preview competitor | 1M | Multimodal long-context analysis |
| DeepSeek V4 Pro | $0.43 cache $0.00 | $0.87 | $0.14 budget reasoning competitor | 1M | Low-cost reasoning workloads |
Frequently asked.
Practical Claude Opus 4.8 pricing questions, with regular usage separated from fast mode, cache writes, and batch discounts.
Q · 01 What is Claude Opus 4.8 priced at? +
$5/M input, $25/M output, and $0.50/M cache-hit input. Under AI//COST's 92/8 agentic blend with 82% cache hits, the effective planning figure is $3.21/M.Q · 02 Is Opus 4.8 more expensive than Opus 4.7? +
$5/M input and $25/M output. The new pricing change is fast mode, which is $10/M input and $50/M output for Opus 4.8, versus $30/M and $150/M for Opus 4.6 / 4.7 fast mode.Q · 03 How does prompt caching work for Opus 4.8? +
$0.50/M, which is 10% of the $5/M base input rate. Cache writes cost $6.25/M for 5-minute writes and $10/M for 1-hour writes. Those multipliers stack with data-residency modifiers.Q · 04 Does the full 1M context cost extra? +
1M token context window at standard pricing. A 900k-token request is billed at the same per-token rate as a 9k-token request.Q · 05 Is there a Batch API discount? +
$2.50/M input and $12.50/M output.Q · 06 What is fast mode pricing? +
$10/M input and $50/M output. Fast mode is not available with Batch API and is not available on Claude Platform on AWS.Q · 07 Does regional pricing differ? +
inference_geo: "us" applies a 1.1x multiplier to all token pricing categories. Global routing is the default and uses the standard prices shown here.Q · 08 How accurate is the tokenizer estimate? +
anthropic-bpe-estimate at 2.6 English characters per token. Anthropic says Opus 4.7 and later use a newer tokenizer that may use up to 35% more tokens for the same fixed text compared with previous models.