Last verified
ARCHIVE PRICERETIRED FEB 2026200K CONTEXTEXTENDED THINKINGPROMPT CACHING

Claude Sonnet 3.7 API Pricing

Archive profile for Anthropic's first hybrid reasoning Sonnet. The last published launch price was $3/M input and $15/M output, with cache reads modeled at $0.30/M. Anthropic now lists the API model as retired, so use this page for historical invoices and migration comparisons, not new deployments.

Archived input · per 1M tokens
$3.00/M
Launch Feb 24 2025 retired
Archived output · per 1M tokens
$15.00/M
Includes thinking tokens retired
Cached input · historical
$0.30/M
Cache 5min or 1h -90%
Effective · agentic blend
$1.92/M
92/8 split · 82% cache
§ 01 / TERMINAL

Run the numbers.

Archive calculator pre-loaded with Claude Sonnet 3.7's last published API rates. Use it to reconcile older spend or compare migrations; share the URL to share the calculation.

$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
Words equivalent (English)
Effective rate
§ 02 / SCENARIOS

Real-world presets.

§ 03 / TAPE

Price history.

Retired on February 19, 2026 after launching at $3/$15 per M.

Input · $3/M
Output · $15/M
Cached · $0.30/M
FEB 24 Launch at $3/M · $15/MOCT 28 Deprecated; Anthropic recommended migrating to Claude Sonnet 4.6FEB 19 Retired from the Anthropic-operated Claude APIMAY 17 Archived on AI//COST with last published launch pricing
§ 04 / TOKENIZER

Paste text. See tokens. See cost.

Estimate · anthropic-bpe-estimate · ≈3.5 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters
Words
Tokens (estimated)
Cost as input · uncached
Cost as output · uncached
Cost as cached input
§ 05 / SHELF

Up against the shelf.

All models →
Model Input /M Output /M Effective blended Context Best for
Claude Sonnet 3.7 Current $3.00 cache $0.30 $15.00 $1.92 archive · retired 200K Historical invoices · migration comparisons
Claude Sonnet 4.6 $3.00 cache $0.30 $15.00 $1.92 official replacement 1M Production agents · coding
Claude Sonnet 4.5 $3.00 cache $0.30 $15.00 $1.92 same list price 200K Stable Sonnet migrations
Claude Opus 4.7 $5.00 cache $0.50 $25.00 $3.21 pricier 1M Frontier reasoning · hard code
Claude Haiku 4.5 $1.00 cache $0.10 $5.00 $0.64 cheaper 200K Support · classification
GPT-5.4 $2.50 cache $0.25 $15.00 $1.80 near parity 270K Affordable coding · tool use
Gemini 2.5 Pro $1.25 cache $0.13 $10.00 $1.10 cheaper 2M Long-context document analysis
DeepSeek V4 Flash $0.14 cache $0.00 $0.28 $0.05 deep budget 1M Bulk RAG · low-cost generation

Frequently asked.

Archive pricing questions, with historical token math separated from current model availability.

Q · 01 Is Claude Sonnet 3.7 still available on the Anthropic API? +
No. Anthropic's model deprecations page lists claude-3-7-sonnet-20250219 as retired on February 19, 2026. The recommended replacement is claude-sonnet-4-6, which keeps the same $3/M input and $15/M output headline price with a larger 1M context window.
Q · 02 Why does this archive page still show a price? +
Anthropic's launch announcement stated that Claude Sonnet 3.7 used $3/M input and $15/M output, including thinking tokens. We keep that historical price so older invoices, benchmark cost notes, and migration comparisons still have a stable reference. It is not a recommendation to route new production traffic to a retired model.
Q · 03 How was the cached input price calculated? +
Anthropic's prompt caching rules price cache reads at 0.1x the base input price. With a $3/M base input rate, that gives $0.30/M for cache hits. Historical cache writes follow the same documented multipliers: $3.75/M for 5-minute writes and $6/M for 1-hour writes.
Q · 04 Did extended thinking cost extra? +
No separate surcharge was published. Anthropic said standard and extended thinking modes had the same model price, with thinking tokens included in output billing. In practice, extended thinking could still raise total cost because it generated more billable output tokens at $15/M.
Q · 05 Was there a Batch API discount? +
While the model was active, the standard Anthropic Batch API discount was 50% for asynchronous workloads. Applying that rule to Claude Sonnet 3.7's archived list price gives $1.50/M input and $7.50/M output. After retirement, new first-party batch jobs should target the replacement model instead.
Q · 06 What should I migrate to? +
Use Claude Sonnet 4.6 for the closest migration path. It has the same $3/M input, $15/M output, and $0.30/M cache-hit rates, but Anthropic lists it as active and gives it a 1M context window.
Q · 07 Does regional pricing apply to this archived model? +
No first-party US-only inference multiplier is modeled here. Anthropic's current data-residency pricing applies to Claude Opus 4.6, Claude Sonnet 4.6, and later models; earlier models do not support the inference_geo parameter. Bedrock and Vertex AI had their own platform-specific policies while 3.7 was available.
Q · 08 How accurate is the tokenizer estimate? +
The live counter uses a 4.875 English characters-per-token estimate for Claude-class BPE tokenization. It is useful for planning historical spend, but actual invoices were based on Anthropic's server-side token counts. Code, tables, and non-English text can deviate from the English estimate.