Last verified 2026-05-22

COST DISRUPTORFOUNDED 2023HANGZHOU · CHINAMIT OPEN WEIGHTSHIGH-FLYER BACKED

DeepSeek API Pricing

Q: Which DeepSeek model should I start with?

For most workloads: DeepSeek V4 Flash ($0.14/$0.28) — it covers both non-thinking and thinking modes cheaply. Step up to V4 Pro ($0.435/$0.87 on promo) for the hardest reasoning. Both ship a 1M-token context. See the use case picker above.

Q: How much cheaper is DeepSeek than OpenAI or Anthropic?

On input, V4 Pro is roughly 11× cheaper than GPT-5.5 ($0.435 vs $5) and ~34× cheaper on output ($0.87 vs $30). With cache-hit input at $0.0036/M the gap widens further. The honest caveat: DeepSeek is text-only and trails on multimodal.

Q: Is there an off-peak discount?

Not currently. DeepSeek previously ran off-peak (UTC-night) discounts, but the live pricing page no longer lists off-peak hours. Today the savings come from a flat 75% V4 Pro launch promo (through May 31, 2026) and a cache-hit input price cut to about 1/10 of base.

Q: Are DeepSeek models open-weight?

Yes. V4 Pro and V4 Flash weights are MIT-licensed on Hugging Face — fully self-hostable, commercially usable, with no per-token fee. This is what lets teams run DeepSeek outside China for data-residency reasons.

Q: Where is my data stored, and is that a problem?

Using the direct DeepSeek API, data is processed and stored in China by default, which is a blocker for many EU/US compliance and procurement teams. The workaround is to self-host the MIT weights on your own US/EU cloud, or use a Western third-party host. US export-control and policy questions also apply.

Q: What happens after the 75% promo ends on May 31, 2026?

V4 Pro reverts to its standard list of $1.74/M input and $3.48/M output (¼ of the original launch list). Even at list, that remains far below frontier US models. V4 Flash pricing ($0.14/$0.28) is not part of the promo.

The Chinese lab behind the DeepSeek open-weight models. Spun out of quant hedge fund High-Flyer in 2023 by Liang Wenfeng, it shocked the industry by matching frontier reasoning quality at a fraction of the cost — and shipping the weights under an MIT license.

Open DeepSeek Platform →View all models Calculate cost

Production models

+ MIT open weights

Founded

2023

by High-Flyer fund

Cheapest tier

$0.14/M

V4 Flash input

Cost vs frontier

~11×

cheaper input vs GPT-5.5

Context window

both V4 models

Methodology →

Model	Input /M	Output /M	Cached	Context	Max output	Vision	Tools	Tier
DeepSeek V4 Flash	$0.14	$0.28	$0.0−98%	1M	—	✗	✓	Active
DeepSeek V4 Pro FLAGSHIP	$0.43	$0.87	$0.0−99%	1M	—	✗	✓	Frontier
DeepSeek V3 RETIRED	—	—	—	Retired Apr 26, 2026	→ deepseek-v4-flash	✗	✗	Retired
DeepSeek R1 RETIRED	—	—	—	Retired Apr 26, 2026	→ deepseek-v4-pro	✗	✗	Retired

§ 03 / PRICE CURVE

Pricing across the lineup.

How DeepSeek priced 4 models · DEC 24 → APR 26.

oldest → newest →

Input · newest $0.43/M

Output · newest $0.87/M

Each point is a model at its listed $/M price.

The DeepSeek story so far.

A short, loud history: V3 → R1 → V4, with V4 arriving 484 days after V3. Sourced from DeepSeek's API docs, model cards, and contemporaneous coverage.

MAY 31 · 2026

V4 Pro 75% launch promo ends — list price reverts to $1.74/$3.48 per M (still ¼ of the original launch list)

PRICING

APR 26 · 2026

DeepSeek V4 Pro + V4 Flash launched (preview) — MIT open weights, 1.6T-param MoE, 1M context

RELEASE

APR 26 · 2026

Pricing reset — cache-hit input cut to 1/10 of launch; legacy deepseek-chat/deepseek-reasoner fold into V4; V3 + R1 retired

PRICING

JAN 20 · 2025

DeepSeek R1 released — o1-class reasoning at a fraction of the price; the consumer app tops the US App Store and rattles AI markets

RELEASE

DEC 26 · 2024

DeepSeek V3 released — 671B-param MoE open weights trained at a breakthrough-low compute cost

RELEASE

JUL · 2023

DeepSeek founded by Liang Wenfeng as an AI lab funded by quant hedge fund High-Flyer, Hangzhou

CORPORATE

§ 04 / ACCESS

Where to get it.

Methodology →

PRIMARY · DIRECT

DeepSeek Platform

platform.deepseek.com — direct API with an OpenAI-compatible endpoint, dev console, and billing. Prices are list-low; cache-hit input is ~1/10 of base.

Console + API →

OPEN-WEIGHT · SELF-HOST

Hugging Face

V4 Pro + V4 Flash weights ship under the MIT license on Hugging Face — fully self-hostable with no per-token fee and no China-routing of your data.

MIT license →

CONSUMER · APP

DeepSeek App

chat.deepseek.com plus iOS/Android apps — free consumer chat that briefly topped the US App Store in early 2025. No API; for end-user usage.

Free web + mobile →

THIRD-PARTY · HOSTS

Western GPU clouds

Because the weights are MIT-licensed, third-party inference hosts (Together, Fireworks, and others) serve DeepSeek V4 from US/EU regions — useful when China data residency is a blocker.

US / EU regions →

§ 04 / BEST FOR

Which DeepSeek for what.

More scenarios →

If you need frontier reasoning at minimal cost…

DeepSeek V4 Pro

Profile →

If you want a cheap high-volume daily driver…

DeepSeek V4 Flash

Profile →

If you process 1M-token long documents…

DeepSeek V4 Flash

Profile →

If you must self-host for data control…

V4 weights (MIT)

Hugging Face →

If you need non-China data residency…

V4 on a Western host

Hosts →

If you're cost-cutting from GPT-5 / Claude…

DeepSeek V4 Flash

Profile →

§ 06 / BACKGROUND

The company behind it.

www.deepseek.com →

DeepSeek was founded in July 2023 in Hangzhou by Liang Wenfeng, who also co-founded and runs the quantitative hedge fund High-Flyer (started in 2015–2016). DeepSeek is owned and funded by High-Flyer rather than by external venture capital — an unusual structure that let it train large models on a GPU cluster the fund had already built for trading.

The lab's signature is compute efficiency. Its mixture-of-experts (MoE) training recipes delivered frontier-class quality at a small fraction of the budgets reported by US labs. Liang has reportedly held a controlling personal stake (~84% as of 2024), and the team is famously lean — on the order of ~150 people, with many hired straight out of university.

DeepSeek's breakout moment came with V3 (December 2024) and then R1 (January 2025): R1 matched OpenAI o1-class reasoning at roughly 1/27 the price, the consumer app briefly topped the US App Store, and the release triggered a sharp sell-off in AI hardware stocks. In April 2026 the lab shipped the V4 family — a 1.6T-parameter MoE (V4 Pro) and a 284B MoE (V4 Flash), both with 1M-token context and MIT-licensed open weights.

Pricing is the headline: V4 Pro runs at $0.435/$0.87 per M under a 75% launch promo (reverting to $1.74/$3.48 after May 31, 2026), and V4 Flash at $0.14/$0.28 — roughly an order of magnitude below frontier US models, with cache-hit input cut to about 1/10 of base.

The trade-offs are real: models are text-only (no native vision or audio), the company is China-based with data stored in China by default, and US export-control and procurement-policy questions apply. The MIT weights are the escape hatch — teams that need US/EU residency can self-host. Versus OpenAI and Anthropic, DeepSeek trades multimodal breadth and Western data governance for radically lower cost and full open weights.

§ 07 / COMPETITORS

Other frontier labs.

All providers →

CHINESE PEER

Alibaba (Qwen)

Broad open-weight Qwen lineup with strong multilingual + multimodal coverage and Alibaba Cloud distribution. Comparable Chinese-data-residency questions apply.

Qwen3 family →

PRIMARY RIVAL

OpenAI

Frontier benchmarks + multimodal + ecosystem (GPT-5, ChatGPT, Azure). But ~10× pricier input, closed weights, and US data residency.

GPT-5 family →

FRONTIER LEADER

Anthropic

Leads coding + reasoning with Claude and zero-retention defaults. But far higher prices, no open weights, and no China-cost story.

Claude lineup →

OPEN-WEIGHT PEER

Mistral

EU-native open weights (Apache 2.0) with GDPR-first data residency. Comparable openness, but trails DeepSeek on raw cost-per-token.

EU data residency →

OPEN-WEIGHT LEADER

ByteDance (Doubao)

The Doubao model family from ByteDance — parent of TikTok and Douyin. Served through the Volcano Ark (火山方舟) platform on ByteDance's Volcano Engine cloud, Doubao powers China's most-used consumer AI app and undercuts most frontier labs on price.

14 models · from $0.02/M →

Frequently asked.

Practical questions about DeepSeek pricing, open weights, and data residency.

Q · 01 Which DeepSeek model should I start with? +

For most workloads: DeepSeek V4 Flash ($0.14/$0.28) — it covers both non-thinking and thinking modes cheaply. Step up to V4 Pro ($0.435/$0.87 on promo) for the hardest reasoning. Both ship a 1M-token context. See the use case picker above.

Q · 02 How much cheaper is DeepSeek than OpenAI or Anthropic? +

On input, V4 Pro is roughly 11× cheaper than GPT-5.5 ($0.435 vs $5) and ~34× cheaper on output ($0.87 vs $30). With cache-hit input at $0.0036/M the gap widens further. The honest caveat: DeepSeek is text-only and trails on multimodal.

Q · 03 Is there an off-peak discount? +

Not currently. DeepSeek previously ran off-peak (UTC-night) discounts, but the live pricing page no longer lists off-peak hours. Today the savings come from a flat 75% V4 Pro launch promo (through May 31, 2026) and a cache-hit input price cut to about 1/10 of base.

Q · 04 Are DeepSeek models open-weight? +

Yes. V4 Pro and V4 Flash weights are MIT-licensed on Hugging Face — fully self-hostable, commercially usable, with no per-token fee. This is what lets teams run DeepSeek outside China for data-residency reasons.

Q · 05 Where is my data stored, and is that a problem? +

Using the direct DeepSeek API, data is processed and stored in China by default, which is a blocker for many EU/US compliance and procurement teams. The workaround is to self-host the MIT weights on your own US/EU cloud, or use a Western third-party host. US export-control and policy questions also apply.

Q · 06 What happens after the 75% promo ends on May 31, 2026? +

V4 Pro reverts to its standard list of $1.74/M input and $3.48/M output (¼ of the original launch list). Even at list, that remains far below frontier US models. V4 Flash pricing ($0.14/$0.28) is not part of the promo.

Q · 07 How does DeepSeek compare to Qwen and Mistral? +

Qwen (Alibaba) is the closest Chinese peer with broader multimodal and multilingual coverage; Mistral is the EU open-weight peer with GDPR-native residency. DeepSeek's edge is the lowest cost-per-token at frontier reasoning quality, with the same China-data caveat as Qwen.

Reviewed by Yaroslav Vikhariev Founder · AI//COST · DeepSeek models tested · Pricing pulled from api-docs.deepseek.com

Methodology Report a correction More by Y.V.

DeepSeek API Pricing

The full roster.

DeepSeek V4 Flash

DeepSeek V4 Pro

DeepSeek V3 LAUNCHED DEC 2024

DeepSeek R1 LAUNCHED JAN 2025

All side-by-side.

Pricing across the lineup.

How DeepSeek priced 4 models · DEC 24 → APR 26.

The DeepSeek story so far.

Where to get it.

Which DeepSeek for what.

The company behind it.

Other frontier labs.

Alibaba (Qwen)

OpenAI

Anthropic

Mistral

Meta

ByteDance (Doubao)

Frequently asked.