DeepSeek V4 Flash API Pricing
DeepSeek V4 Flash is the current cheap DeepSeek tier for both non-thinking and thinking modes. $0.14/M input, $0.28/M output, and $0.0028/M cached input. DeepSeek says old deepseek-chat and deepseek-reasoner names map into this V4 family for compatibility.
Run the numbers.
Calculator pre-loaded with DeepSeek V4 Flash rates. Tweak spend, output mix, or cache hit rate to compare this model with nearby alternatives.
Real-world presets.
Repo-wide bug fix
Reading 100-page contracts
Support agent ticket triage
Research planning turn
Price history.
Paste text. See tokens. See cost.
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| DeepSeek V4 Flash | $0.14 cache $0.00 | $0.28 | $0.05 current page | 1M | Cheap DeepSeek workloads |
| DeepSeek V4 Pro | $0.43 cache $0.00 | $0.87 | $0.14 pricier | 1M | Low-cost reasoning |
| GPT-5.4 mini | $0.75 cache $0.07 | $4.50 | $0.54 pricier | 400K | Open-weight multimodal work |
| Claude Sonnet 4.6 | $3.00 cache $0.30 | $15.00 | $1.92 pricier | 1M | Open-weight multimodal work |
| Gemini 2.5 Flash | $0.30 cache $0.03 | $2.50 | $0.27 pricier | 1M | Open-weight multimodal work |
| Grok 4.3 | $1.25 cache $0.20 | $2.50 | $0.56 pricier | 1M | Grok long-context agents |
Frequently asked.
Short answers for teams checking DeepSeek V4 Flash pricing, status, and migration choices.
Q · 01 Is DeepSeek V4 Flash still available? +
Q · 02 How much does DeepSeek V4 Flash cost? +
Q · 03 Is cached-input pricing included? +
Q · 04 What should teams compare it against? +
current same-provider models.