Last verified
SONNET LEGACY200K CONTEXTTEXT + VISIONPROMPT CACHINGBATCH -50%

Claude Sonnet 4.5 API Pricing

Live API pricing for Claude Sonnet 4.5: $3/M input, $15/M output, and $0.3/M cache-hit input. Sonnet 4.5 remains active at the standard Sonnet price band and is useful for stable migrations that have not moved to Sonnet 4.6 yet. Pulled directly from platform.claude.com daily.

Input - per 1M tokens
$3.00/M
Stable Anthropic table flat
Output - per 1M tokens
$15.00/M
Stable since launch flat
Cached input - 90% off
$0.30/M
Cache 5min or 1h -90%
Effective - agentic blend
$1.92/M
92/8 split - 82% cache
§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with Claude Sonnet 4.5 rates. Tweak spend, output mix, or cache hit rate; share the URL to share the calculation.

$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
Words equivalent (English)
Effective rate
§ 02 / SCENARIOS

Real-world presets.

§ 03 / TAPE

Price history.

Held at $3/M input and $15/M output since launch.

Input · $3/M
Output · $15/M
Cached · $0.30/M
SEP 29 Launch at $3/M - $15/MMAY 18 Verified unchanged
§ 04 / TOKENIZER

Paste text. See tokens. See cost.

Estimate · anthropic-bpe-estimate · ≈3.5 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters
Words
Tokens (estimated)
Cost as input · uncached
Cost as output · uncached
Cost as cached input
§ 05 / SHELF

Up against the shelf.

All models →
Model Input /M Output /M Effective blended Context Best for
Claude Opus 4.7 $5.00 cache $0.50 $25.00 $3.21 pricier 1M Frontier reasoning and hard code
Claude Opus 4.6 $5.00 cache $0.50 $25.00 $3.21 pricier 1M Frontier reasoning and hard code
Claude Opus 4.5 $5.00 cache $0.50 $25.00 $3.21 pricier 200K Frontier reasoning and hard code
Claude Opus 4.1 $15.00 cache $1.50 $75.00 $9.62 pricier 200K Legacy Opus workloads
Claude Sonnet 4.6 $3.00 cache $0.30 $15.00 $1.92 same blend 1M Production agents and coding
Claude Sonnet 4.5 Current $3.00 cache $0.30 $15.00 $1.92 agentic 92/8 200K Production agents and coding
Claude Haiku 4.5 $1.00 cache $0.10 $5.00 $0.64 cheaper 200K Support and classification
GPT-5.4 $2.50 cache $0.25 $15.00 $1.80 cheaper 1.05M Tool use and app agents
Gemini 2.5 Pro $1.25 cache $0.13 $10.00 $1.10 cheaper 2M Long-context document analysis

Frequently asked.

Practical pricing questions, with vendor list prices separated from workload assumptions.

Q · 01 What is Claude Sonnet 4.5 priced at? +
Anthropic's pricing page lists $3/M input, $15/M output, and $0.3/M cache-hit input for Claude Sonnet 4.5. Under the 92/8 agentic blend with 82% cache hits, that works out to $1.923/M.
Q · 02 How does prompt caching change the price? +
Cache hits are listed at $0.3/M, which is 10% of the $3/M base input rate. Anthropic also charges cache writes at $3.75/M for 5-minute writes and $6/M for 1-hour writes. The calculator models repeated prompt sections as cache hits.
Q · 03 Is there a Batch API discount? +
Yes. Anthropic's Batch API gives a 50% discount on input and output tokens. For Claude Sonnet 4.5, the batch table maps to $1.5/M input and $7.5/M output.
Q · 04 Does long context cost extra? +
This page uses the model's documented 200K context label. Longer-context Opus 4.6, Opus 4.7, and Sonnet 4.6 use standard pricing across the full 1M window. Prompt caching and batch discounts still apply according to their normal rules.
Q · 05 Does regional pricing differ? +
For first-party Claude API US-only inference, Anthropic applies a 1.1x multiplier to all token categories for this generation. Bedrock and Vertex AI publish their own regional pricing policies.
Q · 06 Are volume discounts published? +
No fixed public volume-discount ladder is published. Enterprise and committed-spend contracts can have private terms, but standard API users should assume the list prices shown here unless they have a separate agreement.
Q · 07 How accurate is the tokenizer estimate? +
The live counter uses a Claude-family English estimate of 4.875 characters per token. Actual billing uses Anthropic's server-side tokenizer; code, tables, and non-English text can differ materially.