Last verified 2026-07-11

DEPRECATED200K CONTEXTTEXT + VISIONPROMPT CACHINGBATCH -50%

Claude Opus 4.1 API Pricing

Q: What is Claude Opus 4.1 priced at?

Anthropic's pricing page lists $15/M input, $75/M output, and $1.5/M cache-hit input for Claude Opus 4.1. Under the 92/8 agentic blend with 82% cache hits, that works out to $9.616/M.

Q: How does prompt caching change the price?

Cache hits are listed at $1.5/M, which is 10% of the $15/M base input rate. Anthropic also charges cache writes at $18.75/M for 5-minute writes and $30/M for 1-hour writes. The calculator models repeated prompt sections as cache hits.

Q: Does long context cost extra?

This page uses the model's documented 200K context label. Longer-context Opus 4.6, Opus 4.7, and Sonnet 4.6 use standard pricing across the full 1M window. Prompt caching and batch discounts still apply according to their normal rules.

Q: Does regional pricing differ?

Anthropic says earlier models do not support the inference_geo parameter and use standard first-party API pricing. Bedrock and Vertex AI publish their own regional pricing policies.

Q: How accurate is the tokenizer estimate?

The live counter uses a Claude-family English estimate of 4.875 characters per token. Actual billing uses Anthropic's server-side tokenizer; code, tables, and non-English text can differ materially.

Deprecated: Anthropic now lists Claude Opus 4.1 as deprecated (verified 2026-07-11) - still callable at the older $15/M input / $75/M output / $1.5/M cache-hit band, but superseded by the $5/$25 Opus 4.8 generation. Retained for price-evolution comparison. Pulled from platform.claude.com.

Input - per 1M tokens

$15.00/M

Stable Anthropic table flat

Output - per 1M tokens

$75.00/M

Stable since launch flat

Cached input - 90% off

$1.50/M

Cache 5min or 1h -90%

Effective - agentic blend

$9.62/M

92/8 split - 82% cache

§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with Claude Opus 4.1 rates. Tweak spend, output mix, or cache hit rate; share the URL to share the calculation.

Spend

$ /mo

Workload split

Prompt cache hit rate

Tokens you can process

—

Words equivalent (English)

—

Effective rate

—

Open full calculator (all models · share URL · CSV) →

§ 02 / SCENARIOS

Real-world presets.

CODING AGENT

Repo-wide bug fix

$0.843/task

81k in - 7k out~118 units/$100

LONG DOC ANALYSIS

Reading 100-page contracts

$1.29/doc

175k in - 8k out~77 units/$100

RAG SUPPORT

Support agent ticket triage

$0.053/ticket

4k in - 1k out~1,886 units/$100

ASSISTANT

Research planning turn

$0.160/turn

12k in - 2k out~625 units/$100

§ 03 / TOKENIZER

Paste text. See tokens. See cost.

Your text · live count

Estimate · anthropic-bpe-estimate · ≈3.5 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters 354

Words 58

Tokens (estimated) 101 tokens

Cost as input · uncached $0.001515 USD

Cost as output · uncached $0.007575 USD

Cost as cached input $0.000151 USD

§ 04 / SHELF

Up against the shelf.

All models →

Model	Input /M	Output /M	Effective blended	Context	Best for
Claude Opus 4.7	$5.00 cache $0.50	$25.00	$3.21 cheaper	1M	Frontier reasoning and hard code
Claude Opus 4.6	$5.00 cache $0.50	$25.00	$3.21 cheaper	1M	Frontier reasoning and hard code
Claude Opus 4.5	$5.00 cache $0.50	$25.00	$3.21 cheaper	200K	Frontier reasoning and hard code
Claude Opus 4.1 Current	$15.00 cache $1.50	$75.00	$9.62 agentic 92/8	200K	Legacy Opus workloads
Claude Sonnet 4.6	$3.00 cache $0.30	$15.00	$1.92 cheaper	1M	Production agents and coding
Claude Sonnet 4.5	$3.00 cache $0.30	$15.00	$1.92 cheaper	200K	Production agents and coding
Claude Haiku 4.5	$1.00 cache $0.10	$5.00	$0.64 cheaper	200K	Support and classification
GPT-5.4	$2.50 cache $0.25	$15.00	$1.80 cheaper	1.05M	Tool use and app agents
Gemini 2.5 Pro	$1.25 cache $0.13	$10.00	$1.10 cheaper	2M	Long-context document analysis

Frequently asked.

Practical pricing questions, with vendor list prices separated from workload assumptions.

Q · 01 What is Claude Opus 4.1 priced at? +

Anthropic's pricing page lists $15/M input, $75/M output, and $1.5/M cache-hit input for Claude Opus 4.1. Under the 92/8 agentic blend with 82% cache hits, that works out to $9.616/M.

Q · 02 How does prompt caching change the price? +

Cache hits are listed at $1.5/M, which is 10% of the $15/M base input rate. Anthropic also charges cache writes at $18.75/M for 5-minute writes and $30/M for 1-hour writes. The calculator models repeated prompt sections as cache hits.

Q · 03 Is there a Batch API discount? +

Yes. Anthropic's Batch API gives a 50% discount on input and output tokens. For Claude Opus 4.1, the batch table maps to $7.5/M input and $37.5/M output.

Q · 04 Does long context cost extra? +

This page uses the model's documented 200K context label. Longer-context Opus 4.6, Opus 4.7, and Sonnet 4.6 use standard pricing across the full 1M window. Prompt caching and batch discounts still apply according to their normal rules.

Q · 05 Does regional pricing differ? +

Anthropic says earlier models do not support the inference_geo parameter and use standard first-party API pricing. Bedrock and Vertex AI publish their own regional pricing policies.

Q · 06 Are volume discounts published? +

No fixed public volume-discount ladder is published. Enterprise and committed-spend contracts can have private terms, but standard API users should assume the list prices shown here unless they have a separate agreement.

Q · 07 How accurate is the tokenizer estimate? +

The live counter uses a Claude-family English estimate of 4.875 characters per token. Actual billing uses Anthropic's server-side tokenizer; code, tables, and non-English text can differ materially.

Reviewed by Yaroslav Vikhariev Founder - AI//COST - Pricing pulled daily from platform.claude.com - Last verified July 11, 2026

Methodology Report a correction More by Y.V.