Last verified
NEWEST OPUS1M CONTEXTTEXT + VISIONFAST MODEPROMPT CACHING

Claude Opus 4.8 API Pricing

Claude Opus 4.8 is Anthropic's newest generally available Opus model: $5/M input, $25/M output, and $0.50/M cache-hit input. Fast mode is now $10/M input and $50/M output, three times cheaper than the prior Opus fast-mode tier. Pulled directly from platform.claude.com daily.

Input - per 1M tokens
$5.00/M
Stable Opus tier same as 4.7
Output - per 1M tokens
$25.00/M
Standard full 1M context same as 4.7
Cached input - 90% off
$0.50/M
Cache write $6.25 / $10 -90%
Effective - agentic blend
$3.21/M
92/8 split - 82% cache
§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with Claude Opus 4.8 rates. Tweak spend, output mix, or cache hit rate; share the URL to share the calculation.

$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
Words equivalent (English)
Effective rate
§ 02 / SCENARIOS

Real-world presets.

§ 03 / TAPE

Price history.

Launched at the same $5/M input and $25/M output standard Opus price as 4.7.

Input · $5/M
Output · $25/M
Cached · $0.50/M
MAY 28 Launch at $5/M input and $25/M outputMAY 28 Fast mode introduced at $10/M input and $50/M outputMAY 29 Verified unchanged on Anthropic pricing docs
§ 04 / TOKENIZER

Paste text. See tokens. See cost.

Estimate · anthropic-bpe-estimate · ≈2.6 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters
Words
Tokens (estimated)
Cost as input · uncached
Cost as output · uncached
Cost as cached input
§ 05 / SHELF

Up against the shelf.

All models →
Model Input /M Output /M Effective blended Context Best for
Claude Opus 4.8 Current $5.00 cache $0.50 $25.00 $3.21 agentic 92/8 1M Newest Opus - hard agents and code
Claude Opus 4.7 $5.00 cache $0.50 $25.00 $3.21 same list price 1M Previous Opus frontier tier
Claude Opus 4.6 $5.00 cache $0.50 $25.00 $3.21 same list price 1M Older 1M Opus workloads
Claude Sonnet 4.6 $3.00 cache $0.30 $15.00 $1.92 cheaper Anthropic sibling 1M Production agents and coding
Claude Haiku 4.5 $1.00 cache $0.10 $5.00 $0.64 cheaper Anthropic sibling 200K Support and classification
GPT-5.5 $5.00 cache $0.50 $30.00 $3.61 frontier competitor 1M OpenAI frontier agents
Gemini 3.1 Pro Preview $2.00 cache $0.20 $12.00 $1.44 frontier preview competitor 1M Multimodal long-context analysis
DeepSeek V4 Pro $0.43 cache $0.00 $0.87 $0.14 budget reasoning competitor 1M Low-cost reasoning workloads

Frequently asked.

Practical Claude Opus 4.8 pricing questions, with regular usage separated from fast mode, cache writes, and batch discounts.

Q · 01 What is Claude Opus 4.8 priced at? +
Anthropic's pricing page lists Claude Opus 4.8 at $5/M input, $25/M output, and $0.50/M cache-hit input. Under AI//COST's 92/8 agentic blend with 82% cache hits, the effective planning figure is $3.21/M.
Q · 02 Is Opus 4.8 more expensive than Opus 4.7? +
No. Anthropic says regular Opus 4.8 usage is unchanged from Opus 4.7: $5/M input and $25/M output. The new pricing change is fast mode, which is $10/M input and $50/M output for Opus 4.8, versus $30/M and $150/M for Opus 4.6 / 4.7 fast mode.
Q · 03 How does prompt caching work for Opus 4.8? +
Cache hits are billed at $0.50/M, which is 10% of the $5/M base input rate. Cache writes cost $6.25/M for 5-minute writes and $10/M for 1-hour writes. Those multipliers stack with data-residency modifiers.
Q · 04 Does the full 1M context cost extra? +
No. Anthropic's pricing docs say Claude Opus 4.8 includes the full 1M token context window at standard pricing. A 900k-token request is billed at the same per-token rate as a 9k-token request.
Q · 05 Is there a Batch API discount? +
Yes. Anthropic's Batch API gives a 50% discount on input and output tokens. For Claude Opus 4.8, the published batch row is $2.50/M input and $12.50/M output.
Q · 06 What is fast mode pricing? +
Fast mode is a research-preview option that provides faster output. For Claude Opus 4.8, Anthropic lists fast mode at $10/M input and $50/M output. Fast mode is not available with Batch API and is not available on Claude Platform on AWS.
Q · 07 Does regional pricing differ? +
Yes. For Claude Opus 4.6, Sonnet 4.6, and later models, Anthropic says inference_geo: "us" applies a 1.1x multiplier to all token pricing categories. Global routing is the default and uses the standard prices shown here.
Q · 08 How accurate is the tokenizer estimate? +
The browser widget uses an anthropic-bpe-estimate at 2.6 English characters per token. Anthropic says Opus 4.7 and later use a newer tokenizer that may use up to 35% more tokens for the same fixed text compared with previous models.