Last verified
ARCHIVE PRICEDEPRECATED MODEL200KTEXT + VISIONPROMPT CACHING

Claude Sonnet 4 API Pricing

Claude Sonnet 4 is deprecated and scheduled to retire on June 15, 2026. $3/M input, $15/M output, and $0.3/M cache-hit input. Use this page for archived Anthropic API pricing and migration checks. New production traffic should move to Claude Sonnet 4.6.

Archived input - per 1M tokens
$3.00/M
Source Anthropic deprecated
Archived output - per 1M tokens
$15.00/M
Use for invoice checks deprecated
Cached input - 90% off
$0.30/M
Cache reads -90%
Effective - agentic blend
$1.92/M
92/8 split - 82% cache
§ 01 / TERMINAL

Run the numbers.

Archive calculator pre-loaded with Claude Sonnet 4 rates. Tweak spend, output mix, or cache hit rate to compare historical bills with current replacements.

$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
Words equivalent (English)
Effective rate
§ 02 / SCENARIOS

Real-world presets.

§ 03 / TAPE

Price history.

Claude Sonnet 4 is deprecated; archived at $3/$15 per M.

Input · $3/M
Output · $15/M
Cached · $0.30/M
MAY 14 Launch at $3/M input and $15/M outputMAY 18 Deprecated; retirement scheduled for June 15, 2026
§ 04 / TOKENIZER

Paste text. See tokens. See cost.

Estimate · anthropic-bpe-estimate · ≈3.5 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters
Words
Tokens (estimated)
Cost as input · uncached
Cost as output · uncached
Cost as cached input
§ 05 / SHELF

Up against the shelf.

All models →
Model Input /M Output /M Effective blended Context Best for
Claude Sonnet 4 $3.00 cache $0.30 $15.00 $1.92 current archive 200K Production Claude agents
Claude Sonnet 4.6 $3.00 cache $0.30 $15.00 $1.92 same blend 1M Production Claude agents
Claude Sonnet 4.5 $3.00 cache $0.30 $15.00 $1.92 same blend 200K Production Claude agents
Claude Haiku 4.5 $1.00 cache $0.10 $5.00 $0.64 cheaper 200K Low-latency Claude workloads
Claude Opus 4.7 $5.00 cache $0.50 $25.00 $3.21 pricier 1M Frontier reasoning and hard code
GPT-5.4 $2.50 cache $0.25 $15.00 $1.80 same blend 1.05M OpenAI production apps
Gemini 2.5 Pro $1.25 cache $0.13 $10.00 $1.10 cheaper 2M Gemini long-context work
DeepSeek V4 Pro $0.43 cache $0.00 $0.87 $0.14 cheaper 1M Low-cost reasoning and coding

Frequently asked.

Short answers for teams checking old Claude Sonnet 4 costs or migrating archive workloads.

Q · 01 Is Claude Sonnet 4 still available? +
Claude Sonnet 4 is deprecated and scheduled to retire on June 15, 2026. New production work should migrate before that date.
Q · 02 What replaced Claude Sonnet 4? +
Anthropic lists claude-sonnet-4-6 as the recommended replacement in its model deprecations page.
Q · 03 How much did Claude Sonnet 4 cost? +
Claude Sonnet 4 is archived at $3/M input and $15/M output.
Q · 04 Does this archive include prompt caching? +
Cache-hit input is archived at $0.3/M, matching Anthropic's 10% cache-read multiplier on this price tier.
Q · 05 Can I use this for new production routing? +
Use the archive for cost history, not fresh routing decisions. Compare the current Claude rows in the shelf before starting new workloads.
Q · 06 Why keep a page for a retired model? +
Retired-model pages help teams reconcile historical spend, understand migration economics, and answer old search demand without pretending the model is current.