Last verified 2026-07-11

NEWEST OPUS1M CONTEXTTEXT + VISIONFAST MODEPROMPT CACHING

Claude Opus 4.8 API Pricing

Q: What is Claude Opus 4.8 priced at?

Anthropic's pricing page lists Claude Opus 4.8 at $5/M input, $25/M output, and $0.50/M cache-hit input. Under AI//COST's 92/8 agentic blend with 82% cache hits, the effective planning figure is $3.21/M.

Q: How does prompt caching work for Opus 4.8?

Cache hits are billed at $0.50/M, which is 10% of the $5/M base input rate. Cache writes cost $6.25/M for 5-minute writes and $10/M for 1-hour writes. Those multipliers stack with data-residency modifiers.

Q: How accurate is the tokenizer estimate?

The browser widget uses an anthropic-bpe-estimate at 2.6 English characters per token. Anthropic says Opus 4.7 and later use a newer tokenizer that may use up to 35% more tokens for the same fixed text compared with previous models.

Claude Opus 4.8 is Anthropic's newest generally available Opus model: $5/M input, $25/M output, and $0.50/M cache-hit input. Fast mode is now $10/M input and $50/M output, three times cheaper than the prior Opus fast-mode tier. Pulled directly from platform.claude.com daily.

Input - per 1M tokens

$5.00/M

Stable Opus tier same as 4.7

Output - per 1M tokens

$25.00/M

Standard full 1M context same as 4.7

Cached input - 90% off

$0.50/M

Cache write $6.25 / $10 -90%

Effective - agentic blend

$3.21/M

92/8 split - 82% cache

§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with Claude Opus 4.8 rates. Tweak spend, output mix, or cache hit rate; share the URL to share the calculation.

Spend

$ /mo

Workload split

Prompt cache hit rate

Tokens you can process

—

Words equivalent (English)

—

Effective rate

—

Open full calculator (all models · share URL · CSV) →

§ 02 / SCENARIOS

Real-world presets.

CODING AGENT

Codebase-scale migration

$0.407/task

120k cache-heavy in / 10k out~245 units/$100

LONG DOC ANALYSIS

Reading 100-page contracts

$0.436/doc

180k cache-heavy in / 8k out~229 units/$100

RAG SUPPORT

Support agent ticket triage

$0.035/ticket

8k cache-heavy in / 1k out~2,816 units/$100

ASSISTANT

Research planning turn

$0.074/turn

18k cache-heavy in / 2k out~1,358 units/$100

§ 03 / TOKENIZER

Paste text. See tokens. See cost.

Your text · live count

Estimate · anthropic-bpe-estimate · ≈2.6 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters 495

Words 76

Tokens (estimated) 190 tokens

Cost as input · uncached $0.000950 USD

Cost as output · uncached $0.004750 USD

Cost as cached input $0.000095 USD

§ 04 / SHELF

Up against the shelf.

All models →

Model	Input /M	Output /M	Effective blended	Context	Best for
Claude Opus 4.8 Current	$5.00 cache $0.50	$25.00	$3.21 agentic 92/8	1M	Newest Opus - hard agents and code
Claude Opus 4.7	$5.00 cache $0.50	$25.00	$3.21 same list price	1M	Previous Opus frontier tier
Claude Opus 4.6	$5.00 cache $0.50	$25.00	$3.21 same list price	1M	Older 1M Opus workloads
Claude Sonnet 4.6	$3.00 cache $0.30	$15.00	$1.92 cheaper Anthropic sibling	1M	Production agents and coding
Claude Haiku 4.5	$1.00 cache $0.10	$5.00	$0.64 cheaper Anthropic sibling	200K	Support and classification
GPT-5.5	$5.00 cache $0.50	$30.00	$3.61 frontier competitor	1M	OpenAI frontier agents
Gemini 3.1 Pro Preview	$2.00 cache $0.20	$12.00	$1.44 frontier preview competitor	1M	Multimodal long-context analysis
DeepSeek V4 Pro	$0.43 cache $0.00	$0.87	$0.14 budget reasoning competitor	1M	Low-cost reasoning workloads

Frequently asked.

Practical Claude Opus 4.8 pricing questions, with regular usage separated from fast mode, cache writes, and batch discounts.

Q · 01 What is Claude Opus 4.8 priced at? +

Anthropic's pricing page lists Claude Opus 4.8 at $5/M input, $25/M output, and $0.50/M cache-hit input. Under AI//COST's 92/8 agentic blend with 82% cache hits, the effective planning figure is $3.21/M.

Q · 02 Is Opus 4.8 more expensive than Opus 4.7? +

No. Anthropic says regular Opus 4.8 usage is unchanged from Opus 4.7: $5/M input and $25/M output. The new pricing change is fast mode, which is $10/M input and $50/M output for Opus 4.8, versus $30/M and $150/M for Opus 4.6 / 4.7 fast mode.

Q · 03 How does prompt caching work for Opus 4.8? +

Cache hits are billed at $0.50/M, which is 10% of the $5/M base input rate. Cache writes cost $6.25/M for 5-minute writes and $10/M for 1-hour writes. Those multipliers stack with data-residency modifiers.

Q · 04 Does the full 1M context cost extra? +

No. Anthropic's pricing docs say Claude Opus 4.8 includes the full 1M token context window at standard pricing. A 900k-token request is billed at the same per-token rate as a 9k-token request.

Q · 05 Is there a Batch API discount? +

Yes. Anthropic's Batch API gives a 50% discount on input and output tokens. For Claude Opus 4.8, the published batch row is $2.50/M input and $12.50/M output.

Q · 06 What is fast mode pricing? +

Fast mode is a research-preview option that provides faster output. For Claude Opus 4.8, Anthropic lists fast mode at $10/M input and $50/M output. Fast mode is not available with Batch API and is not available on Claude Platform on AWS.

Q · 07 Does regional pricing differ? +

Yes. For Claude Opus 4.6, Sonnet 4.6, and later models, Anthropic says inference_geo: "us" applies a 1.1x multiplier to all token pricing categories. Global routing is the default and uses the standard prices shown here.

Q · 08 How accurate is the tokenizer estimate? +

The browser widget uses an anthropic-bpe-estimate at 2.6 English characters per token. Anthropic says Opus 4.7 and later use a newer tokenizer that may use up to 35% more tokens for the same fixed text compared with previous models.

Reviewed by Yaroslav Vikhariev Founder - AI//COST - Pricing pulled daily from platform.claude.com - Last verified July 11, 2026

Methodology Report a correction More by Y.V.