Question 1

Is Claude Sonnet 3.7 still available on the Anthropic API?

Accepted Answer

No. Anthropic's model deprecations page lists claude-3-7-sonnet-20250219 as retired on February 19, 2026. The recommended replacement is claude-sonnet-4-6, which keeps the same $3/M input and $15/M output headline price with a larger 1M context window.

Question 2

Why does this archive page still show a price?

Accepted Answer

Anthropic's launch announcement stated that Claude Sonnet 3.7 used $3/M input and $15/M output, including thinking tokens. We keep that historical price so older invoices, benchmark cost notes, and migration comparisons still have a stable reference. It is not a recommendation to route new production traffic to a retired model.

Question 3

How was the cached input price calculated?

Accepted Answer

Anthropic's prompt caching rules price cache reads at 0.1x the base input price. With a $3/M base input rate, that gives $0.30/M for cache hits. Historical cache writes follow the same documented multipliers: $3.75/M for 5-minute writes and $6/M for 1-hour writes.

Question 4

Did extended thinking cost extra?

Accepted Answer

No separate surcharge was published. Anthropic said standard and extended thinking modes had the same model price, with thinking tokens included in output billing. In practice, extended thinking could still raise total cost because it generated more billable output tokens at $15/M.

Question 5

Was there a Batch API discount?

Accepted Answer

While the model was active, the standard Anthropic Batch API discount was 50% for asynchronous workloads. Applying that rule to Claude Sonnet 3.7's archived list price gives $1.50/M input and $7.50/M output. After retirement, new first-party batch jobs should target the replacement model instead.

Question 6

What should I migrate to?

Accepted Answer

Use Claude Sonnet 4.6 for the closest migration path. It has the same $3/M input, $15/M output, and $0.30/M cache-hit rates, but Anthropic lists it as active and gives it a 1M context window.

Question 7

Does regional pricing apply to this archived model?

Accepted Answer

No first-party US-only inference multiplier is modeled here. Anthropic's current data-residency pricing applies to Claude Opus 4.6, Claude Sonnet 4.6, and later models; earlier models do not support the inference_geo parameter. Bedrock and Vertex AI had their own platform-specific policies while 3.7 was available.

Question 8

How accurate is the tokenizer estimate?

Accepted Answer

The live counter uses a 4.875 English characters-per-token estimate for Claude-class BPE tokenization. It is useful for planning historical spend, but actual invoices were based on Anthropic's server-side token counts. Code, tables, and non-English text can deviate from the English estimate.

Model	Input /M	Output /M	Effective blended	Context	Best for
Claude Sonnet 3.7 Current	$3.00 cache $0.30	$15.00	$1.92 archive · retired	200K	Historical invoices · migration comparisons
Claude Sonnet 4.6	$3.00 cache $0.30	$15.00	$1.92 official replacement	1M	Production agents · coding
Claude Sonnet 4.5	$3.00 cache $0.30	$15.00	$1.92 same list price	200K	Stable Sonnet migrations
Claude Opus 4.7	$5.00 cache $0.50	$25.00	$3.21 pricier	1M	Frontier reasoning · hard code
Claude Haiku 4.5	$1.00 cache $0.10	$5.00	$0.64 cheaper	200K	Support · classification
GPT-5.4	$2.50 cache $0.25	$15.00	$1.80 near parity	270K	Affordable coding · tool use
Gemini 2.5 Pro	$1.25 cache $0.13	$10.00	$1.10 cheaper	2M	Long-context document analysis
DeepSeek V4 Flash	$0.14 cache $0.00	$0.28	$0.05 deep budget	1M	Bulk RAG · low-cost generation

Claude Sonnet 3.7 API Pricing

Run the numbers.

Real-world presets.

Support agent ticket triage

Repo-wide bug fix

Technical reasoning task

Reading 100-page contracts

Paste text. See tokens. See cost.

Up against the shelf.

Frequently asked.