Last verified 2026-06-14

CODE BUDGET1M CONTEXTSINGAPORE BASELINELOW COST

Qwen3 Coder Flash API Pricing

Q: What is Qwen3 Coder Flash priced at?

Qwen3 Coder Flash is shown at CNY 2.202/M input and CNY 11.009/M output on Alibaba Cloud Model Studio's International/Singapore 0-32K deployment row. AI//COST converts that to $0.3256/M and $1.628/M.

Q: Which API model ID should I use?

Use the rolling model ID qwen3-coder-flash. Alibaba states its current capability is equivalent to qwen3-coder-flash-2025-07-28.

Qwen3 Coder Flash is Alibaba's code-specialized Qwen3 model for agentic software work. The official International / Singapore 0-32K row lists CNY 2.202/M input and CNY 11.009/M output, shown here as $0.3256/M and $1.628/M. Converted from CNY at 1 CNY = $0.14788 (Frankfurter latest weekday rate for 2026-06-12, checked 2026-06-14).

Input - per 1M tokens

$0.33/M

International 0-32K input tier 0-32K

Output - per 1M tokens

$1.63/M

Output 0-32K input tier 0-32K

Cache not itemized

$0.33/M

Cache discount noted, price not itemized not itemized

Effective - agentic blend

$0.43/M

92/8 split - no cache discount

§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with the official Qwen3 Coder Flash International/Singapore 0-32K token row, converted from CNY to USD. Alibaba publishes higher prices for longer input bands up to 1M context.

Spend

$ /mo

Workload split

Prompt cache hit rate

Tokens you can process

—

Words equivalent (English)

—

Effective rate

—

Open full calculator (all models · share URL · CSV) →

§ 02 / SCENARIOS

Real-world presets.

CODING AGENT

Repo patch

$0.052/task

120,000 in - 8,000 out~1,919 units/$100

CODE REVIEW

Pull request review

$0.020/review

45,000 in - 3,500 out~4,901 units/$100

TEST GEN

Unit test drafting

$0.010/file

18,000 in - 2,500 out~10,101 units/$100

CHATBOT

Developer assistant

$0.002/turn

3,500 in - 700 out~43,478 units/$100

§ 03 / TOKENIZER

Paste text. See tokens. See cost.

Your text · live count

Calibrated · measured on the vendor's tokenizer · 2026-06-10 Auto-counts as you type

Counts use a chars-per-token calibration measured on the vendor's own published tokenizer (Qwen/Qwen3.5-397B-A17B, 2026-06-10). English prose is typically within a few percent; code and non-Latin scripts tokenize heavier. For billing-exact counts use the vendor's count-tokens API.

Characters 457

Words 64

Tokens (estimated) 87 tokens

Cost as input · uncached $0.000028 USD

Cost as output · uncached $0.000142 USD

Cost as cached input $0.000028 USD

§ 04 / SHELF

Up against the shelf.

All models →

Model	Input /M	Output /M	Effective blended	Context	Best for
Qwen3 Coder Flash Current	$0.33	$1.63	$0.43 International 0-32K tier	1M	High-volume coding assistants
Qwen3 Max	$1.20	$6.00	$1.58 pricier	252K	Frontier Qwen proprietary reasoning
Qwen 3.5 Plus	$0.40	$2.40	$0.56 pricier	256K	General Qwen production workloads
Qwen 3.5 Flash	$0.10	$0.40	$0.12 cheaper	1M	Cheap long-context Qwen traffic
Qwen3 Coder Plus	$1.09	$5.43	$1.43 pricier	1M	Agentic coding and code review
QwQ Plus	$0.80	$2.40	$0.93 pricier	131K	Proprietary reasoning workloads
QwQ 32B	$0.29	$0.86	$0.33 cheaper	131K	Open reasoning on a budget
Qwen3 235B A22B	$0.70	$2.80	$0.87 pricier	131K	Open MoE reasoning baseline
GPT-5.4 mini	$0.75 cache $0.07	$4.50	$0.54 pricier	400K	Coding and computer-use workloads
Gemini 2.5 Flash	$0.30 cache $0.03	$2.50	$0.27 cheaper	1M	Low-latency multimodal RAG
DeepSeek V4 Flash	$0.14 cache $0.00	$0.28	$0.05 cheaper	1M	Ultra-cheap API throughput

Frequently asked.

Practical pricing questions, separated from calculator assumptions and regional tiers.

Q · 01 What is Qwen3 Coder Flash priced at? +

Qwen3 Coder Flash is shown at CNY 2.202/M input and CNY 11.009/M output on Alibaba Cloud Model Studio's International/Singapore 0-32K deployment row. AI//COST converts that to $0.3256/M and $1.628/M.

Q · 02 Does this page include higher context pricing tiers? +

The quote tiles use the 0-32K International/Singapore tier. Alibaba also lists 32K-128K, 128K-256K, and 256K-1M input bands for Qwen3 Coder Flash; the highest band reaches $1.7366/M input and $10.4192/M output after FX conversion.

Q · 03 Is prompt caching priced separately? +

Alibaba marks Qwen3 Coder Flash as eligible for Context Cache discount, but the pricing table does not publish a separate cache-read token amount. AI//COST therefore keeps cached input at $0.3256/M until Alibaba lists the exact cache-read price.

Q · 04 How is the effective price calculated? +

AI//COST uses the same 92/8 agentic blend everywhere. With no exact cache-read token price published for this row, Qwen3 Coder Flash's effective blended cost is $0.4298/M.

Q · 05 Is there a free quota or batch discount? +

Alibaba lists a 90-day activation free quota for many International Qwen rows, but not every open-source row includes one. Batch and context-cache support are model-specific; this page only publishes prices that are explicit in the vendor table.

Q · 06 Which API model ID should I use? +

Use the rolling model ID qwen3-coder-flash. Alibaba states its current capability is equivalent to qwen3-coder-flash-2025-07-28.

Reviewed by Yaroslav Vikhariev Founder - AI//COST - Pricing pulled daily from Alibaba Cloud Model Studio - Last verified Jun 14, 2026

Methodology Report a correction More by Y.V.