Last verified
COST DISRUPTORFOUNDED 2023HANGZHOU · CHINAMIT OPEN WEIGHTSHIGH-FLYER BACKED

DeepSeek API Pricing

The Chinese lab behind the DeepSeek open-weight models. Spun out of quant hedge fund High-Flyer in 2023 by Liang Wenfeng, it shocked the industry by matching frontier reasoning quality at a fraction of the cost — and shipping the weights under an MIT license.

Production models
2
+ MIT open weights
Founded
2023
by High-Flyer fund
Cheapest tier
$0.14/M
V4 Flash input
Cost vs frontier
~11×
cheaper input vs GPT-5.5
Context window
1M
both V4 models
Open weights
MIT
on Hugging Face
§ 01 / LINEUP

The full roster.

Side-by-side →
§ 02 / SHELF

All side-by-side.

Methodology →
Model Input /M Output /M Cached Context Max output Vision Tools Tier
DeepSeek V4 Flash $0.14 $0.28 $0.0−98% 1M Active
DeepSeek V4 Pro FLAGSHIP $0.43 $0.87 $0.0−99% 1M Frontier
DeepSeek V3 RETIRED Retired Apr 26, 2026 → deepseek-v4-flash Retired
DeepSeek R1 RETIRED Retired Apr 26, 2026 → deepseek-v4-pro Retired

The DeepSeek story so far.

A short, loud history: V3 → R1 → V4, with V4 arriving 484 days after V3. Sourced from DeepSeek's API docs, model cards, and contemporaneous coverage.

MAY 31 · 2026
V4 Pro 75% launch promo ends — list price reverts to $1.74/$3.48 per M (still ¼ of the original launch list)
PRICING
APR 26 · 2026
DeepSeek V4 Pro + V4 Flash launched (preview) — MIT open weights, 1.6T-param MoE, 1M context
RELEASE
APR 26 · 2026
Pricing reset — cache-hit input cut to 1/10 of launch; legacy deepseek-chat/deepseek-reasoner fold into V4; V3 + R1 retired
PRICING
JAN 20 · 2025
DeepSeek R1 released — o1-class reasoning at a fraction of the price; the consumer app tops the US App Store and rattles AI markets
RELEASE
DEC 26 · 2024
DeepSeek V3 released — 671B-param MoE open weights trained at a breakthrough-low compute cost
RELEASE
JUL · 2023
DeepSeek founded by Liang Wenfeng as an AI lab funded by quant hedge fund High-Flyer, Hangzhou
CORPORATE
§ 04 / ACCESS

Where to get it.

Methodology →
§ 04 / BEST FOR

Which DeepSeek for what.

More scenarios →
If you need frontier reasoning at minimal cost
DeepSeek V4 Pro
Profile →
If you want a cheap high-volume daily driver
DeepSeek V4 Flash
Profile →
If you process 1M-token long documents
DeepSeek V4 Flash
Profile →
If you must self-host for data control
V4 weights (MIT)
Hugging Face →
If you need non-China data residency
V4 on a Western host
Hosts →
If you're cost-cutting from GPT-5 / Claude
DeepSeek V4 Flash
Profile →
§ 06 / BACKGROUND

The company behind it.

www.deepseek.com →

DeepSeek was founded in July 2023 in Hangzhou by Liang Wenfeng, who also co-founded and runs the quantitative hedge fund High-Flyer (started in 2015–2016). DeepSeek is owned and funded by High-Flyer rather than by external venture capital — an unusual structure that let it train large models on a GPU cluster the fund had already built for trading.

The lab's signature is compute efficiency. Its mixture-of-experts (MoE) training recipes delivered frontier-class quality at a small fraction of the budgets reported by US labs. Liang has reportedly held a controlling personal stake (~84% as of 2024), and the team is famously lean — on the order of ~150 people, with many hired straight out of university.

DeepSeek's breakout moment came with V3 (December 2024) and then R1 (January 2025): R1 matched OpenAI o1-class reasoning at roughly 1/27 the price, the consumer app briefly topped the US App Store, and the release triggered a sharp sell-off in AI hardware stocks. In April 2026 the lab shipped the V4 family — a 1.6T-parameter MoE (V4 Pro) and a 284B MoE (V4 Flash), both with 1M-token context and MIT-licensed open weights.

Pricing is the headline: V4 Pro runs at $0.435/$0.87 per M under a 75% launch promo (reverting to $1.74/$3.48 after May 31, 2026), and V4 Flash at $0.14/$0.28 — roughly an order of magnitude below frontier US models, with cache-hit input cut to about 1/10 of base.

The trade-offs are real: models are text-only (no native vision or audio), the company is China-based with data stored in China by default, and US export-control and procurement-policy questions apply. The MIT weights are the escape hatch — teams that need US/EU residency can self-host. Versus OpenAI and Anthropic, DeepSeek trades multimodal breadth and Western data governance for radically lower cost and full open weights.

§ 07 / COMPETITORS

Other frontier labs.

All providers →

Frequently asked.

Practical questions about DeepSeek pricing, open weights, and data residency.

Q · 01 Which DeepSeek model should I start with? +
For most workloads: DeepSeek V4 Flash ($0.14/$0.28) — it covers both non-thinking and thinking modes cheaply. Step up to V4 Pro ($0.435/$0.87 on promo) for the hardest reasoning. Both ship a 1M-token context. See the use case picker above.
Q · 02 How much cheaper is DeepSeek than OpenAI or Anthropic? +
On input, V4 Pro is roughly 11× cheaper than GPT-5.5 ($0.435 vs $5) and ~34× cheaper on output ($0.87 vs $30). With cache-hit input at $0.0036/M the gap widens further. The honest caveat: DeepSeek is text-only and trails on multimodal.
Q · 03 Is there an off-peak discount? +
Not currently. DeepSeek previously ran off-peak (UTC-night) discounts, but the live pricing page no longer lists off-peak hours. Today the savings come from a flat 75% V4 Pro launch promo (through May 31, 2026) and a cache-hit input price cut to about 1/10 of base.
Q · 04 Are DeepSeek models open-weight? +
Yes. V4 Pro and V4 Flash weights are MIT-licensed on Hugging Face — fully self-hostable, commercially usable, with no per-token fee. This is what lets teams run DeepSeek outside China for data-residency reasons.
Q · 05 Where is my data stored, and is that a problem? +
Using the direct DeepSeek API, data is processed and stored in China by default, which is a blocker for many EU/US compliance and procurement teams. The workaround is to self-host the MIT weights on your own US/EU cloud, or use a Western third-party host. US export-control and policy questions also apply.
Q · 06 What happens after the 75% promo ends on May 31, 2026? +
V4 Pro reverts to its standard list of $1.74/M input and $3.48/M output (¼ of the original launch list). Even at list, that remains far below frontier US models. V4 Flash pricing ($0.14/$0.28) is not part of the promo.
Q · 07 How does DeepSeek compare to Qwen and Mistral? +
Qwen (Alibaba) is the closest Chinese peer with broader multimodal and multilingual coverage; Mistral is the EU open-weight peer with GDPR-native residency. DeepSeek's edge is the lowest cost-per-token at frontier reasoning quality, with the same China-data caveat as Qwen.