Last verified 2026-06-08

VERIFIED PRICERETIRED MODEL128KTEXT MODELPROMPT CACHING

DeepSeek V3 API Pricing

Q: How much does DeepSeek V3 cost?

DeepSeek V3 is listed at $0.27/M input and $1.1/M output.

Q: Is cached-input pricing included?

Yes. Cached input is shown at $0.07/M.

DeepSeek V3 is retired from the current price table; this archive preserves the $0.27/$1.10 baseline. $0.27/M input, $1.1/M output, and $0.07/M cached input. It is kept for the Jan 2025 price shock and migration comparisons against V4 Flash.

Input - per 1M tokens

$0.27/M

Source DeepSeek retired

Output - per 1M tokens

$1.10/M

Use for cost checks retired

Cached input

$0.07/M

Cache hit price discounted

Effective - agentic blend

$0.19/M

92/8 split - 82% cache

§ 01 / TERMINAL

Run the numbers.

Calculator pre-loaded with DeepSeek V3 rates. Tweak spend, output mix, or cache hit rate to compare this model with nearby alternatives.

Spend

$ /mo

Workload split

Prompt cache hit rate

Tokens you can process

—

Words equivalent (English)

—

Effective rate

—

Open full calculator (all models · share URL · CSV) →

§ 02 / SCENARIOS

Real-world presets.

AGENT LOOP

Repo-wide bug fix

$0.016/task

81k in - 7k out~6,250 units/$100

LONG DOC

Reading 100-page contracts

$0.027/doc

175k in - 8k out~3,703 units/$100

RAG SUPPORT

Support agent ticket triage

$0.002/ticket

4k in - 1k out~50,000 units/$100

ASSISTANT TURN

Research planning turn

$0.003/turn

12k in - 2k out~33,333 units/$100

§ 03 / TOKENIZER

Paste text. See tokens. See cost.

Your text · live count

Calibrated · measured on the vendor's tokenizer · 2026-06-10 Auto-counts as you type

Counts use a chars-per-token calibration measured on the vendor's own published tokenizer (deepseek-ai/DeepSeek-V3.2-Exp, 2026-06-10). English prose is typically within a few percent; code and non-Latin scripts tokenize heavier. For billing-exact counts use the vendor's count-tokens API.

Characters 260

Words 39

Tokens (estimated) 49 tokens

Cost as input · uncached $0.000013 USD

Cost as output · uncached $0.000054 USD

Cost as cached input $0.000003 USD

§ 04 / SHELF

Up against the shelf.

All models →

Model	Input /M	Output /M	Effective blended	Context	Best for
DeepSeek V3	$0.27 cache $0.07	$1.10	$0.19 current page	128K	Cheap DeepSeek workloads
DeepSeek V4 Flash	$0.14 cache $0.00	$0.28	$0.05 cheaper	1M	Cheap DeepSeek workloads
DeepSeek V4 Pro	$0.43 cache $0.00	$0.87	$0.14 cheaper	1M	Low-cost reasoning
GPT-5.4 mini	$0.75 cache $0.07	$4.50	$0.54 pricier	400K	Open-weight multimodal work
Claude Sonnet 4.6	$3.00 cache $0.30	$15.00	$1.92 pricier	1M	Open-weight multimodal work
Gemini 2.5 Flash	$0.30 cache $0.03	$2.50	$0.27 pricier	1M	Open-weight multimodal work
Grok 4.3	$1.25 cache $0.20	$2.50	$0.56 pricier	1M	Grok long-context agents

§ 05 / DEEP LINKS

Specific scenarios.

All calculators →

By comparison

Compare DeepSeek → Compare Mistral → Compare Grok →

Frequently asked.

Short answers for teams checking DeepSeek V3 pricing, status, and migration choices.

Q · 01 Is DeepSeek V3 still available? +

DeepSeek V3 is retired. Keep this page for pricing history, old invoices, and migration planning.

Q · 02 How much does DeepSeek V3 cost? +

DeepSeek V3 is listed at $0.27/M input and $1.1/M output.

Q · 03 Is cached-input pricing included? +

Yes. Cached input is shown at $0.07/M.

Q · 04 What should teams compare it against? +

Use the shelf to compare nearby current models. Current replacement: deepseek-v4-flash non-thinking mode.

Q · 05 Why keep this page? +

These pages preserve live price checks, preview economics, and migration context in one consistent calculator format.

Reviewed by Yaroslav Vikhariev Founder - AI//COST - Last verified 2026-06-08 - Vendor pricing

Methodology Report a correction More by Y.V.