FRONTIER OPUS1M CONTEXTTEXT + VISIONPROMPT CACHINGBATCH -50%
Claude Opus 4.6 API Pricing
Live API pricing for Claude Opus 4.6: $5/M input, $25/M output, and $0.5/M cache-hit input. Opus 4.6 is an active frontier Opus generation with 1M context at standard pricing. Pulled directly from platform.claude.com daily.
Input - per 1M tokens
$5.00/M
Stable Anthropic table flat
Output - per 1M tokens
$25.00/M
Stable since launch flat
Cached input - 90% off
$0.50/M
Cache 5min or 1h -90%
Effective - agentic blend
$3.21/M
92/8 split - 82% cache
§ 01 / TERMINAL
Run the numbers.
Live calculator pre-loaded with Claude Opus 4.6 rates. Tweak spend, output mix, or cache hit rate; share the URL to share the calculation.
$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
—
Words equivalent (English)
—
Effective rate
—
§ 02 / SCENARIOS
Real-world presets.
CODING AGENT
Repo-wide bug fix
$0.281/task
LONG DOC ANALYSIS
Reading 100-page contracts
$0.429/doc
RAG SUPPORT
Support agent ticket triage
$0.018/ticket
ASSISTANT
Research planning turn
$0.053/turn
§ 03 / TAPE
Price history.
Input · $5/M
Output · $25/M
Cached · $0.50/M
FEB 05 Launch at $5/M - $25/MMAY 18 Verified unchanged
§ 04 / TOKENIZER
Paste text. See tokens. See cost.
Estimate · anthropic-bpe-estimate · ≈2.6 chars/token Auto-counts as you type
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
Characters —
Words —
Tokens (estimated) —
Cost as input · uncached —
Cost as output · uncached —
Cost as cached input —
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| Claude Opus 4.7 | $5.00 cache $0.50 | $25.00 | $3.21 same blend | 1M | Frontier reasoning and hard code |
| Claude Opus 4.6 Current | $5.00 cache $0.50 | $25.00 | $3.21 agentic 92/8 | 1M | Frontier reasoning and hard code |
| Claude Opus 4.5 | $5.00 cache $0.50 | $25.00 | $3.21 same blend | 200K | Frontier reasoning and hard code |
| Claude Opus 4.1 | $15.00 cache $1.50 | $75.00 | $9.62 pricier | 200K | Legacy Opus workloads |
| Claude Sonnet 4.6 | $3.00 cache $0.30 | $15.00 | $1.92 cheaper | 1M | Production agents and coding |
| Claude Sonnet 4.5 | $3.00 cache $0.30 | $15.00 | $1.92 cheaper | 200K | Production agents and coding |
| Claude Haiku 4.5 | $1.00 cache $0.10 | $5.00 | $0.64 cheaper | 200K | Support and classification |
| GPT-5.4 | $2.50 cache $0.25 | $15.00 | $1.80 cheaper | 1.05M | Tool use and app agents |
| Gemini 2.5 Pro | $1.25 cache $0.13 | $10.00 | $1.10 cheaper | 2M | Long-context document analysis |
Frequently asked.
Practical pricing questions, with vendor list prices separated from workload assumptions.
Q · 01 What is Claude Opus 4.6 priced at? +
Anthropic's pricing page lists
$5/M input, $25/M output, and $0.5/M cache-hit input for Claude Opus 4.6. Under the 92/8 agentic blend with 82% cache hits, that works out to $3.205/M.Q · 02 How does prompt caching change the price? +
Cache hits are listed at
$0.5/M, which is 10% of the $5/M base input rate. Anthropic also charges cache writes at $6.25/M for 5-minute writes and $10/M for 1-hour writes. The calculator models repeated prompt sections as cache hits.Q · 03 Is there a Batch API discount? +
Yes. Anthropic's Batch API gives a 50% discount on input and output tokens. For Claude Opus 4.6, the batch table maps to
$2.5/M input and $12.5/M output.Q · 04 Does long context cost extra? +
No. Anthropic says Claude Opus 4.6 includes the full
1M token context window at standard pricing. Prompt caching and batch discounts still apply according to their normal rules.Q · 05 Does regional pricing differ? +
For first-party Claude API US-only inference, Anthropic applies a
1.1x multiplier to all token categories for this generation. Bedrock and Vertex AI publish their own regional pricing policies.Q · 06 Are volume discounts published? +
No fixed public volume-discount ladder is published. Enterprise and committed-spend contracts can have private terms, but standard API users should assume the list prices shown here unless they have a separate agreement.
Q · 07 How accurate is the tokenizer estimate? +
The live counter uses a Claude-family English estimate of
4.875 characters per token. Actual billing uses Anthropic's server-side tokenizer; code, tables, and non-English text can differ materially.