Question 1

Is Claude Opus 4 still recommended for new API usage?

Accepted Answer

Anthropic lists this model as deprecated, with retirement scheduled for June 15, 2026. The current published price is still $15/M input and $75/M output, but Anthropic recommends migrating to claude-opus-4-7.

Question 2

What should I migrate Claude Opus 4 workloads to?

Accepted Answer

Use claude-opus-4-7 for new production work. Anthropic's deprecation table names that replacement, while this archive keeps the old list price for invoice checks and migration comparisons.

Question 3

How does prompt caching change the price?

Accepted Answer

Cache hits are listed at $1.5/M, which is 10% of the $15/M base input rate. Anthropic also charges cache writes at $18.75/M for 5-minute writes and $30/M for 1-hour writes. The calculator models repeated prompt sections as cache hits.

Question 4

Is there a Batch API discount?

Accepted Answer

Yes. Anthropic's Batch API gives a 50% discount on input and output tokens. For Claude Opus 4, the batch table maps to $7.5/M input and $37.5/M output.

Question 5

Does long context cost extra?

Accepted Answer

This page uses the model's documented 200K context label. Longer-context Opus 4.6, Opus 4.7, and Sonnet 4.6 use standard pricing across the full 1M window. Prompt caching and batch discounts still apply according to their normal rules.

Question 6

Does regional pricing differ?

Accepted Answer

Anthropic says earlier models do not support the inference_geo parameter and use standard first-party API pricing. Bedrock and Vertex AI publish their own regional pricing policies.

Question 7

Are volume discounts published?

Accepted Answer

No fixed public volume-discount ladder is published. Enterprise and committed-spend contracts can have private terms, but standard API users should assume the list prices shown here unless they have a separate agreement.

Question 8

How accurate is the tokenizer estimate?

Accepted Answer

The live counter uses a Claude-family English estimate of 4.875 characters per token. Actual billing uses Anthropic's server-side tokenizer; code, tables, and non-English text can differ materially.

Model	Input /M	Output /M	Effective blended	Context	Best for
Claude Opus 4.7	$5.00 cache $0.50	$25.00	$3.21 cheaper	1M	Frontier reasoning and hard code
Claude Opus 4.6	$5.00 cache $0.50	$25.00	$3.21 cheaper	1M	Frontier reasoning and hard code
Claude Opus 4.5	$5.00 cache $0.50	$25.00	$3.21 cheaper	200K	Frontier reasoning and hard code
Claude Opus 4.1	$15.00 cache $1.50	$75.00	$9.62 same blend	200K	Legacy Opus workloads
Claude Sonnet 4.6	$3.00 cache $0.30	$15.00	$1.92 cheaper	1M	Production agents and coding
Claude Sonnet 4.5	$3.00 cache $0.30	$15.00	$1.92 cheaper	200K	Production agents and coding
Claude Haiku 4.5	$1.00 cache $0.10	$5.00	$0.64 cheaper	200K	Support and classification
GPT-5.4	$2.50 cache $0.25	$15.00	$1.80 cheaper	1.05M	Tool use and app agents
Gemini 2.5 Pro	$1.25 cache $0.13	$10.00	$1.10 cheaper	2M	Long-context document analysis

Claude Opus 4 API Pricing

Run the numbers.

Real-world presets.

Repo-wide bug fix

Reading 100-page contracts

Support agent ticket triage

Research planning turn

Paste text. See tokens. See cost.

Up against the shelf.

Frequently asked.