Last verified
FRONTIER OPUS1M CONTEXTTEXT + VISIONPROMPT CACHINGBATCH -50%

Claude Opus 4.6 API Pricing

Live API pricing for Claude Opus 4.6: $5/M input, $25/M output, and $0.5/M cache-hit input. Opus 4.6 is an active frontier Opus generation with 1M context at standard pricing. Pulled directly from platform.claude.com daily.

Input - per 1M tokens
$5.00/M
Stable Anthropic table flat
Output - per 1M tokens
$25.00/M
Stable since launch flat
Cached input - 90% off
$0.50/M
Cache 5min or 1h -90%
Effective - agentic blend
$3.21/M
92/8 split - 82% cache
§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with Claude Opus 4.6 rates. Tweak spend, output mix, or cache hit rate; share the URL to share the calculation.

$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
Words equivalent (English)
Effective rate
§ 02 / SCENARIOS

Real-world presets.

§ 03 / TAPE

Price history.

Held at $5/M input and $25/M output since launch.

Input · $5/M
Output · $25/M
Cached · $0.50/M
FEB 05 Launch at $5/M - $25/MMAY 18 Verified unchanged
§ 04 / TOKENIZER

Paste text. See tokens. See cost.

Estimate · anthropic-bpe-estimate · ≈2.6 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters
Words
Tokens (estimated)
Cost as input · uncached
Cost as output · uncached
Cost as cached input
§ 05 / SHELF

Up against the shelf.

All models →
Model Input /M Output /M Effective blended Context Best for
Claude Opus 4.7 $5.00 cache $0.50 $25.00 $3.21 same blend 1M Frontier reasoning and hard code
Claude Opus 4.6 Current $5.00 cache $0.50 $25.00 $3.21 agentic 92/8 1M Frontier reasoning and hard code
Claude Opus 4.5 $5.00 cache $0.50 $25.00 $3.21 same blend 200K Frontier reasoning and hard code
Claude Opus 4.1 $15.00 cache $1.50 $75.00 $9.62 pricier 200K Legacy Opus workloads
Claude Sonnet 4.6 $3.00 cache $0.30 $15.00 $1.92 cheaper 1M Production agents and coding
Claude Sonnet 4.5 $3.00 cache $0.30 $15.00 $1.92 cheaper 200K Production agents and coding
Claude Haiku 4.5 $1.00 cache $0.10 $5.00 $0.64 cheaper 200K Support and classification
GPT-5.4 $2.50 cache $0.25 $15.00 $1.80 cheaper 1.05M Tool use and app agents
Gemini 2.5 Pro $1.25 cache $0.13 $10.00 $1.10 cheaper 2M Long-context document analysis

Frequently asked.

Practical pricing questions, with vendor list prices separated from workload assumptions.

Q · 01 What is Claude Opus 4.6 priced at? +
Anthropic's pricing page lists $5/M input, $25/M output, and $0.5/M cache-hit input for Claude Opus 4.6. Under the 92/8 agentic blend with 82% cache hits, that works out to $3.205/M.
Q · 02 How does prompt caching change the price? +
Cache hits are listed at $0.5/M, which is 10% of the $5/M base input rate. Anthropic also charges cache writes at $6.25/M for 5-minute writes and $10/M for 1-hour writes. The calculator models repeated prompt sections as cache hits.
Q · 03 Is there a Batch API discount? +
Yes. Anthropic's Batch API gives a 50% discount on input and output tokens. For Claude Opus 4.6, the batch table maps to $2.5/M input and $12.5/M output.
Q · 04 Does long context cost extra? +
No. Anthropic says Claude Opus 4.6 includes the full 1M token context window at standard pricing. Prompt caching and batch discounts still apply according to their normal rules.
Q · 05 Does regional pricing differ? +
For first-party Claude API US-only inference, Anthropic applies a 1.1x multiplier to all token categories for this generation. Bedrock and Vertex AI publish their own regional pricing policies.
Q · 06 Are volume discounts published? +
No fixed public volume-discount ladder is published. Enterprise and committed-spend contracts can have private terms, but standard API users should assume the list prices shown here unless they have a separate agreement.
Q · 07 How accurate is the tokenizer estimate? +
The live counter uses a Claude-family English estimate of 4.875 characters per token. Actual billing uses Anthropic's server-side tokenizer; code, tables, and non-English text can differ materially.