Claude Sonnet 3.7 API Pricing
Archive profile for Anthropic's first hybrid reasoning Sonnet. The last published launch price was $3/M input and $15/M output, with cache reads modeled at $0.30/M. Anthropic now lists the API model as retired, so use this page for historical invoices and migration comparisons, not new deployments.
Run the numbers.
Archive calculator pre-loaded with Claude Sonnet 3.7's last published API rates. Use it to reconcile older spend or compare migrations; share the URL to share the calculation.
Real-world presets.
Support agent ticket triage
Repo-wide bug fix
Technical reasoning task
Reading 100-page contracts
Price history.
Paste text. See tokens. See cost.
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| Claude Sonnet 3.7 Current | $3.00 cache $0.30 | $15.00 | $1.92 archive · retired | 200K | Historical invoices · migration comparisons |
| Claude Sonnet 4.6 | $3.00 cache $0.30 | $15.00 | $1.92 official replacement | 1M | Production agents · coding |
| Claude Sonnet 4.5 | $3.00 cache $0.30 | $15.00 | $1.92 same list price | 200K | Stable Sonnet migrations |
| Claude Opus 4.7 | $5.00 cache $0.50 | $25.00 | $3.21 pricier | 1M | Frontier reasoning · hard code |
| Claude Haiku 4.5 | $1.00 cache $0.10 | $5.00 | $0.64 cheaper | 200K | Support · classification |
| GPT-5.4 | $2.50 cache $0.25 | $15.00 | $1.80 near parity | 270K | Affordable coding · tool use |
| Gemini 2.5 Pro | $1.25 cache $0.13 | $10.00 | $1.10 cheaper | 2M | Long-context document analysis |
| DeepSeek V4 Flash | $0.14 cache $0.00 | $0.28 | $0.05 deep budget | 1M | Bulk RAG · low-cost generation |
Frequently asked.
Archive pricing questions, with historical token math separated from current model availability.
Q · 01 Is Claude Sonnet 3.7 still available on the Anthropic API? +
claude-3-7-sonnet-20250219 as retired on February 19, 2026. The recommended replacement is claude-sonnet-4-6, which keeps the same $3/M input and $15/M output headline price with a larger 1M context window.Q · 02 Why does this archive page still show a price? +
$3/M input and $15/M output, including thinking tokens. We keep that historical price so older invoices, benchmark cost notes, and migration comparisons still have a stable reference. It is not a recommendation to route new production traffic to a retired model.Q · 03 How was the cached input price calculated? +
0.1x the base input price. With a $3/M base input rate, that gives $0.30/M for cache hits. Historical cache writes follow the same documented multipliers: $3.75/M for 5-minute writes and $6/M for 1-hour writes.Q · 04 Did extended thinking cost extra? +
$15/M.Q · 05 Was there a Batch API discount? +
$1.50/M input and $7.50/M output. After retirement, new first-party batch jobs should target the replacement model instead.Q · 06 What should I migrate to? +
$3/M input, $15/M output, and $0.30/M cache-hit rates, but Anthropic lists it as active and gives it a 1M context window.Q · 07 Does regional pricing apply to this archived model? +
inference_geo parameter. Bedrock and Vertex AI had their own platform-specific policies while 3.7 was available.Q · 08 How accurate is the tokenizer estimate? +
4.875 English characters-per-token estimate for Claude-class BPE tokenization. It is useful for planning historical spend, but actual invoices were based on Anthropic's server-side token counts. Code, tables, and non-English text can deviate from the English estimate.