Last verified
VERIFIED PRICEPROMO FLAGSHIP1MTEXT MODELPROMPT CACHING

DeepSeek V4 Pro API Pricing

DeepSeek V4 Pro is the current DeepSeek thinking flagship with a 75% promo price. $0.435/M input, $0.87/M output, and $0.003625/M cached input. The promo runs through May 31, 2026; list price reverts afterward.

Input - per 1M tokens
$0.43/M
Source DeepSeek active
Output - per 1M tokens
$0.87/M
Use for cost checks active
Cached input
$0.00/M
Cache hit price discounted
Effective - agentic blend
$0.14/M
92/8 split - 82% cache
§ 01 / TERMINAL

Run the numbers.

Calculator pre-loaded with DeepSeek V4 Pro rates. Tweak spend, output mix, or cache hit rate to compare this model with nearby alternatives.

$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
Words equivalent (English)
Effective rate
§ 02 / SCENARIOS

Real-world presets.

§ 03 / TAPE

Price history.

DeepSeek V4 Pro is active; listed at $0.435/$0.87 per M.

Input · $0.43/M
Output · $0.87/M
Cached · $0.00/M
APR 26 V4 Pro promo listed at $0.435/M input and $0.87/M outputMAY 18 Live DeepSeek pricing verified; 75% discount still listed
§ 04 / TOKENIZER

Paste text. See tokens. See cost.

Estimate · deepseek-tokenizer-estimate · ≈4.2 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters
Words
Tokens (estimated)
Cost as input · uncached
Cost as output · uncached
Cost as cached input
§ 05 / SHELF

Up against the shelf.

All models →
Model Input /M Output /M Effective blended Context Best for
DeepSeek V4 Pro $0.43 cache $0.00 $0.87 $0.14 current page 1M Low-cost reasoning
DeepSeek V4 Flash $0.14 cache $0.00 $0.28 $0.05 cheaper 1M Cheap DeepSeek workloads
GPT-5.4 mini $0.75 cache $0.07 $4.50 $0.54 pricier 400K Open-weight multimodal work
Claude Sonnet 4.6 $3.00 cache $0.30 $15.00 $1.92 pricier 1M Open-weight multimodal work
Gemini 2.5 Flash $0.30 cache $0.03 $2.50 $0.27 pricier 1M Open-weight multimodal work
Grok 4.3 $1.25 cache $0.20 $2.50 $0.56 pricier 1M Grok long-context agents

Frequently asked.

Short answers for teams checking DeepSeek V4 Pro pricing, status, and migration choices.

Q · 01 Is DeepSeek V4 Pro still available? +
DeepSeek V4 Pro is listed as active/current in the pricing snapshot and was checked against the vendor page used for this entry.
Q · 02 How much does DeepSeek V4 Pro cost? +
DeepSeek V4 Pro is listed at $0.435/M input and $0.87/M output.
Q · 03 Is cached-input pricing included? +
Yes. Cached input is shown at $0.003625/M.
Q · 04 What should teams compare it against? +
Use the shelf to compare nearby current models. Snapshot replacement: current same-provider models.
Q · 05 Why keep this page? +
These pages preserve live price checks, preview economics, and migration context in one consistent calculator format.