Last verified 2026-07-11

SONNET LEGACY200K CONTEXTTEXT + VISIONPROMPT CACHINGBATCH -50%

Claude Sonnet 4.5 API Pricing

Q: What is Claude Sonnet 4.5 priced at?

Anthropic's pricing page lists $3/M input, $15/M output, and $0.3/M cache-hit input for Claude Sonnet 4.5. Under the 92/8 agentic blend with 82% cache hits, that works out to $1.923/M.

Q: How does prompt caching change the price?

Cache hits are listed at $0.3/M, which is 10% of the $3/M base input rate. Anthropic also charges cache writes at $3.75/M for 5-minute writes and $6/M for 1-hour writes. The calculator models repeated prompt sections as cache hits.

Q: Does long context cost extra?

This page uses the model's documented 200K context label. Longer-context Opus 4.6, Opus 4.7, and Sonnet 4.6 use standard pricing across the full 1M window. Prompt caching and batch discounts still apply according to their normal rules.

Q: How accurate is the tokenizer estimate?

The live counter uses a Claude-family English estimate of 4.875 characters per token. Actual billing uses Anthropic's server-side tokenizer; code, tables, and non-English text can differ materially.

Live API pricing for Claude Sonnet 4.5: $3/M input, $15/M output, and $0.3/M cache-hit input. Sonnet 4.5 remains active at the standard Sonnet price band and is useful for stable migrations that have not moved to Sonnet 4.6 yet. Pulled directly from platform.claude.com daily.

Input - per 1M tokens

$3.00/M

Stable Anthropic table flat

Output - per 1M tokens

$15.00/M

Stable since launch flat

Cached input - 90% off

$0.30/M

Cache 5min or 1h -90%

Effective - agentic blend

$1.92/M

92/8 split - 82% cache

§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with Claude Sonnet 4.5 rates. Tweak spend, output mix, or cache hit rate; share the URL to share the calculation.

Spend

$ /mo

Workload split

Prompt cache hit rate

Tokens you can process

—

Words equivalent (English)

—

Effective rate

—

Open full calculator (all models · share URL · CSV) →

§ 02 / SCENARIOS

Real-world presets.

CODING AGENT

Repo-wide bug fix

$0.169/task

81k in - 7k out~591 units/$100

LONG DOC ANALYSIS

Reading 100-page contracts

$0.258/doc

175k in - 8k out~387 units/$100

RAG SUPPORT

Support agent ticket triage

$0.011/ticket

4k in - 1k out~9,090 units/$100

ASSISTANT

Research planning turn

$0.032/turn

12k in - 2k out~3,125 units/$100

§ 03 / TOKENIZER

Paste text. See tokens. See cost.

Your text · live count

Estimate · anthropic-bpe-estimate · ≈3.5 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters 341

Words 54

Tokens (estimated) 97 tokens

Cost as input · uncached $0.000291 USD

Cost as output · uncached $0.001455 USD

Cost as cached input $0.000029 USD

§ 04 / SHELF

Up against the shelf.

All models →

Model	Input /M	Output /M	Effective blended	Context	Best for
Claude Opus 4.7	$5.00 cache $0.50	$25.00	$3.21 pricier	1M	Frontier reasoning and hard code
Claude Opus 4.6	$5.00 cache $0.50	$25.00	$3.21 pricier	1M	Frontier reasoning and hard code
Claude Opus 4.5	$5.00 cache $0.50	$25.00	$3.21 pricier	200K	Frontier reasoning and hard code
Claude Opus 4.1	$15.00 cache $1.50	$75.00	$9.62 pricier	200K	Legacy Opus workloads
Claude Sonnet 4.6	$3.00 cache $0.30	$15.00	$1.92 same blend	1M	Production agents and coding
Claude Sonnet 4.5 Current	$3.00 cache $0.30	$15.00	$1.92 agentic 92/8	200K	Production agents and coding
Claude Haiku 4.5	$1.00 cache $0.10	$5.00	$0.64 cheaper	200K	Support and classification
GPT-5.4	$2.50 cache $0.25	$15.00	$1.80 cheaper	1.05M	Tool use and app agents
Gemini 2.5 Pro	$1.25 cache $0.13	$10.00	$1.10 cheaper	2M	Long-context document analysis

Frequently asked.

Practical pricing questions, with vendor list prices separated from workload assumptions.

Q · 01 What is Claude Sonnet 4.5 priced at? +

Anthropic's pricing page lists $3/M input, $15/M output, and $0.3/M cache-hit input for Claude Sonnet 4.5. Under the 92/8 agentic blend with 82% cache hits, that works out to $1.923/M.

Q · 02 How does prompt caching change the price? +

Cache hits are listed at $0.3/M, which is 10% of the $3/M base input rate. Anthropic also charges cache writes at $3.75/M for 5-minute writes and $6/M for 1-hour writes. The calculator models repeated prompt sections as cache hits.

Q · 03 Is there a Batch API discount? +

Yes. Anthropic's Batch API gives a 50% discount on input and output tokens. For Claude Sonnet 4.5, the batch table maps to $1.5/M input and $7.5/M output.

Q · 04 Does long context cost extra? +

This page uses the model's documented 200K context label. Longer-context Opus 4.6, Opus 4.7, and Sonnet 4.6 use standard pricing across the full 1M window. Prompt caching and batch discounts still apply according to their normal rules.

Q · 05 Does regional pricing differ? +

For first-party Claude API US-only inference, Anthropic applies a 1.1x multiplier to all token categories for this generation. Bedrock and Vertex AI publish their own regional pricing policies.

Q · 06 Are volume discounts published? +

No fixed public volume-discount ladder is published. Enterprise and committed-spend contracts can have private terms, but standard API users should assume the list prices shown here unless they have a separate agreement.

Q · 07 How accurate is the tokenizer estimate? +

The live counter uses a Claude-family English estimate of 4.875 characters per token. Actual billing uses Anthropic's server-side tokenizer; code, tables, and non-English text can differ materially.

Reviewed by Yaroslav Vikhariev Founder - AI//COST - Pricing pulled daily from platform.claude.com - Last verified July 11, 2026

Methodology Report a correction More by Y.V.