Last verified 2026-06-14

OPEN-WEIGHT GIANTQWEN SERIES 2023ALIBABA CLOUDAPACHE 2.0119 LANGUAGES

Qwen API Pricing

Q: Which Qwen model should I start with?

For most teams: Qwen 3.5 Plus ($0.40/$2.40) as a multimodal daily driver, or Qwen3 Max ($1.20/$6.00) for the proprietary frontier. Want to self-host? Use the open-weight Qwen 3.5 397B. Drop to Qwen 3.5 Flash ($0.10/$0.40) for cheap 1M-context volume. See the use case picker.

Q: Why are the prices here lower than what I'm billed?

Model Studio uses tiered pricing by input length. The rates shown are the 0–32K input range on the International (Singapore) region; longer prompts cost more (for example Qwen3 Coder Plus climbs from $1/$5 to $6/$60 at the 256K–1M tier). Batch API offers a 50% discount.

Q: Where is my data stored?

It depends on the endpoint. The International (Singapore) Model Studio region keeps non-China traffic in Singapore; the China-domestic region keeps data in mainland China. For full control — or to avoid both — self-host the Apache-2.0 weights on your own cloud. US export-control and procurement questions apply.

Q: How many languages does Qwen support?

Qwen3 was trained on roughly 36 trillion tokens across 119 languages and dialects — one of the widest multilingual footprints of any major model family, which is a key reason for its global open-weight adoption.

Q: What's the difference between Qwen3 Max and the open Qwen 3.5 models?

Qwen3 Max is the proprietary, cloud-only flagship (multimodal, 252K context, $1.20/$6.00). The Qwen 3.5 open MoE models (397B-A17B, 122B-A10B) are Apache-2.0 and self-hostable, at lower per-token cost. Choose Max for the managed top tier; choose 3.5 open for control and self-hosting.

The Qwen (Tongyi Qianwen) model family from Alibaba Cloud's Tongyi Lab. First released in 2023, Qwen is the most-downloaded open-weight model family in the world — most tiers ship under Apache 2.0 on Hugging Face and ModelScope, while the proprietary Max tier is served through Alibaba Cloud's Model Studio.

Open Model Studio →View all models Calculate cost

Production models

Qwen3 / Qwen3.7 text + multimodal

Qwen series

2023

Tongyi Lab

Cheapest tier

$0.05/M

Qwen3 VL Flash input

Open weights

Apache 2.0

HF + ModelScope

Context window

Qwen3.7 Max / Plus

Languages

119

trained on 36T tokens

§ 01 / LINEUP

The full roster.

Side-by-side →

REASONING

Qwen3 235B A22B Thinking 2507 LAUNCHED JUL 2025

Alibaba Model Studio Singapore/International row: CNY 1.688/M input and CNY 16.88/M thinking-mode output.

262K ctx · text-only →

TEXT

Qwen3 235B A22B Instruct 2507 LAUNCHED JUL 2025

Alibaba Model Studio Singapore/International row: CNY 1.688/M input and CNY 6.752/M output.

262K ctx · text-only →

FRONTIER · REASONING

Qwen3 235B A22B

Open-weight Qwen3 frontier — 235B total params, 22B active per token (MoE), Apache 2.0.

131K ctx · text-only →

REASONING

Qwen3 Next 80B A3B Thinking LAUNCHED SEP 2025

Alibaba Model Studio Singapore/International row: CNY 1.101/M input and CNY 8.807/M thinking-mode output.

262K ctx · text-only →

TEXT

Qwen3 Next 80B A3B Instruct LAUNCHED SEP 2025

Alibaba Model Studio Singapore/International row: CNY 1.101/M input and CNY 8.807/M output.

262K ctx · text-only →

BALANCED · MID-TIER

Qwen3 32B

Open-weight dense 32B Qwen3 - Apache 2.0.

131K ctx · text-only →

REASONING

Qwen3 30B A3B Thinking 2507 LAUNCHED JUL 2025

Alibaba Model Studio Singapore/International row: CNY 1.468/M input and CNY 17.614/M thinking-mode output.

262K ctx · text-only →

TEXT

Qwen3 30B A3B Instruct 2507 LAUNCHED JUL 2025

Alibaba Model Studio Singapore/International row: CNY 1.468/M input and CNY 5.871/M output.

262K ctx · text-only →

TEXT

Qwen3 30B A3B LAUNCHED APR 2025

Alibaba Model Studio Singapore/International row: CNY 1.468/M input and CNY 5.871/M output.

128K ctx · text-only →

FAST · LIGHTWEIGHT

Qwen3 14B

Open-weight dense 14B Qwen3 - Apache 2.0.

131K ctx · text-only →

TEXT

Qwen3 8B LAUNCHED APR 2025

Alibaba Model Studio Singapore/International row: CNY 1.321/M input and CNY 5.137/M output.

128K ctx · text-only →

FRONTIER · REASONING

Qwen3.7 Max LAUNCHED MAY 2026

International/Singapore row is the AI//COST baseline: CNY 18.736/M input and CNY 56.207/M output for 0-1M tokens, converted at 1 CNY = $0.14788 (Frankfurter latest weekday rate for 2026-06-12, chec…

Qwen3.7 Plus LAUNCHED JUN 2026

International/Singapore row is now the AI//COST baseline: CNY 2.998/M input and CNY 11.991/M output for 0-256K tokens; 256K-1M is CNY 8.993/M input and CNY 35.972/M output.

Qwen3.6 35B A3B LAUNCHED APR 2026

Alibaba Model Studio International/Singapore row verified 2026-06-30: CNY 2.810325/M input and CNY 16.86195/M output.

256K ctx · vision →

FRONTIER · REASONING

Qwen3.6 27B LAUNCHED APR 2026

Alibaba Model Studio International/Singapore row verified 2026-06-30: CNY 4.49652/M input and CNY 26.97912/M output.

256K ctx · vision →

BALANCED

Qwen3.6 Plus LAUNCHED APR 2026

Alibaba Model Studio Singapore/International row: CNY 3.7471/M input and CNY 22.4826/M output for the base token range; long-context row is CNY 14.9884/M input and CNY 44.965/M output.

Qwen3.6 Flash LAUNCHED APR 2026

Alibaba Model Studio International/Singapore row verified 2026-06-30: CNY 1.87355/M input and CNY 11.2413/M output; long-context row CNY 7.4942/M input and CNY 29.9758/M output.

Qwen 3.5 397B A17B

Open-weight Qwen 3.5 flagship MoE - 397B total parameters, 17B active per token.

256K ctx · text-only →

BALANCED · MID-TIER

Qwen 3.5 122B A10B

Open-weight Qwen 3.5 mid-tier MoE - 122B total parameters, 10B active per token.

256K ctx · text-only →

BALANCED · MID-TIER

Qwen 3.5 Plus

International deployment (Singapore).

256K ctx · vision →

FAST · LIGHTWEIGHT

Qwen 3.5 Flash

International deployment (Singapore).

Qwen3 Max

Listed prices are for Alibaba Cloud Model Studio International deployment (Singapore) at 0-32K input tokens: $1.20/M input and $6.00/M output.

252K ctx · vision →

BALANCED · MID-TIER

Qwen3 VL Plus

Vision-language sibling of Qwen 3.5 Plus on Model Studio.

256K ctx · vision →

FAST · LIGHTWEIGHT

Qwen3 VL Flash

Cheapest tier of the Qwen3 VL family on Model Studio.

256K ctx · vision →

BALANCED · MID-TIER

Qwen3 Coder Plus

International/Singapore Qwen3 Coder Plus row: CNY 7.339/M input and CNY 36.696/M output for 0-32K input, converted at 1 CNY = $0.14788 (Frankfurter latest weekday rate for 2026-06-12, checked 2026-…

Qwen3 Coder Flash

International/Singapore Qwen3 Coder Flash row: CNY 2.202/M input and CNY 11.009/M output for 0-32K input, converted at 1 CNY = $0.14788 (Frankfurter latest weekday rate for 2026-06-12, checked 2026…

Qwen3 Coder Next LAUNCHED MAR 2026

Alibaba Model Studio International row: $0.30/M input and $1.50/M output for 0-32K input; $0.50/$2.50 at 32K-128K and $0.80/$4.00 at 128K-256K.

262K ctx · text-only →

BALANCED · MID-TIER

QwQ Plus

Proprietary reasoning model in the QwQ family on Model Studio.

131K ctx · text-only →

PREVIEW · EARLY ACCESS

Qwen3.6 Max Preview LAUNCHED JUN 2026

Alibaba Model Studio Singapore/International row: CNY 9.742/M input and CNY 58.455/M output for the base token range; long-context row is CNY 14.988/M input and CNY 89.93/M output.

256K ctx · text-only →

DEPRECATED · 2026-07-11

QwQ 32B

Open-weight Qwen reasoning model — 32B params, Apache 2.0.

131K ctx · text-only →

DEPRECATED · RETIRING

Qwen Turbo LAUNCHED APR 2025

Deprecated text model.

LEGACY · STILL SUPPORTED

Qwen 2.5 72B Instruct LAUNCHED SEP 2024

Open-weight Qwen 2.5 flagship dense model — 72.7B params, Qwen Research License (non-Apache because of size).

131K ctx · text-only →

LEGACY · STILL SUPPORTED

Qwen 2.5 VL 72B Instruct LAUNCHED JAN 2025

Vision-language flagship of the Qwen 2.5 era — 72B params, strong on document and chart understanding, agentic UI.

131K ctx · vision →

LEGACY · STILL SUPPORTED

Qwen 2.5 32B Instruct LAUNCHED SEP 2024

Open-weight Qwen 2.5 mid-tier — 32B dense, Apache 2.0.

131K ctx · text-only →

LEGACY · STILL SUPPORTED

Qwen 2.5 Coder 32B Instruct LAUNCHED NOV 2024

Code-specialised Qwen 2.5 — 32B dense, Apache 2.0.

131K ctx · text-only →

LEGACY · STILL SUPPORTED

Qwen 2.5 14B Instruct LAUNCHED SEP 2024

Open-weight Qwen 2.5 14B dense — Apache 2.0.

131K ctx · text-only →

LEGACY · STILL SUPPORTED

Qwen 2.5 7B Instruct LAUNCHED SEP 2024

Open-weight Qwen 2.5 7B dense — Apache 2.0.

131K ctx · text-only →

LEGACY · STILL SUPPORTED

Qwen Max (2.5) LAUNCHED JAN 2025

Legacy proprietary Qwen 2.5 Max row.

32K ctx · text-only →

LEGACY · STILL SUPPORTED

Qwen Plus (2.5) LAUNCHED DEC 2025

Legacy proprietary Qwen Plus tier.

LEGACY · STILL SUPPORTED

Qwen Flash (2.5) LAUNCHED JUL 2025

Legacy proprietary Qwen Flash tier.

LEGACY · STILL SUPPORTED

Qwen VL Max LAUNCHED JAN 2024

Original Qwen vision-language flagship.

128K ctx · vision →

LEGACY · STILL SUPPORTED

Qwen VL Plus LAUNCHED JAN 2024

Original cost-efficient Qwen vision-language model.

128K ctx · vision →

§ 02 / SHELF

All side-by-side.

Methodology →

Model	Input /M	Output /M	Cached	Context	Max output	Vision	Tools	Tier
Qwen3 235B A22B Thinking 2507	$0.25	$2.49	—	262K	—	✗	✓	Active
Qwen3 235B A22B Instruct 2507	$0.25	$1.0	—	262K	—	✗	✓	Active
Qwen3 235B A22B	$0.7	$2.80	—	131K	—	✗	✓	Frontier
Qwen3 Next 80B A3B Thinking	$0.16	$1.30	—	262K	—	✗	✓	Active
Qwen3 Next 80B A3B Instruct	$0.16	$1.30	—	262K	—	✗	✓	Active
Qwen3 32B	$0.16	$0.64	—	131K	—	✗	✓	Mid
Qwen3 30B A3B Thinking 2507	$0.22	$2.60	—	262K	—	✗	✓	Active
Qwen3 30B A3B Instruct 2507	$0.22	$0.87	—	262K	—	✗	✓	Active
Qwen3 30B A3B	$0.22	$0.87	—	128K	—	✗	✓	Active
Qwen3 14B	$0.35	$1.40	—	131K	—	✗	✓	Light
Qwen3 8B	$0.2	$0.76	—	128K	—	✗	✓	Active
Qwen3.7 Max FLAGSHIP	$2.77	$8.31	—	1M	—	✗	✓	Frontier
Qwen3.7 Plus	$0.44	$1.77	—	1M	—	✓	✓	Active
Qwen3.6 35B A3B	$0.41	$2.48	$0.41	256K	—	✓	✓	Active
Qwen3.6 27B	$0.66	$3.98	$0.66	256K	—	✓	✓	Frontier
Qwen3.6 Plus	$0.55	$3.32	—	1M	—	✗	✓	Active
Qwen3.6 Flash	$0.28	$1.66	$0.28	1M	—	✓	✓	Active
Qwen 3.5 397B A17B	$0.6	$3.60	—	256K	—	✗	✓	Frontier
Qwen 3.5 122B A10B	$0.4	$3.20	—	256K	—	✗	✓	Mid
Qwen 3.5 Plus	$0.4	$2.40	—	256K	—	✓	✓	Mid
Qwen 3.5 Flash	$0.1	$0.4	—	1M	—	✓	✓	Light
Qwen3 Max	$1.20	$6.00	—	252K	—	✓	✓	Frontier
Qwen3 VL Plus	$0.2	$1.60	—	256K	—	✓	✓	Mid
Qwen3 VL Flash	$0.05	$0.4	—	256K	—	✓	✓	Light
Qwen3 Coder Plus	$1.09	$5.43	—	1M	—	✗	✓	Mid
Qwen3 Coder Flash	$0.33	$1.63	—	1M	—	✗	✓	Light
Qwen3 Coder Next	$0.3	$1.50	—	262K	—	✗	✓	Active
QwQ Plus	$0.8	$2.40	—	131K	—	✗	✓	Mid
Qwen3.6 Max Preview PREVIEW	$1.44	$8.64	—	256K	—	✗	✓	Preview
QwQ 32B DEPRECATED	$0.29	$0.86	—	131K	—	✗	✓	Deprecated
Qwen Turbo DEPRECATED	$0.05	$0.2	—	1M	—	✗	✓	Deprecated
Qwen 2.5 72B Instruct LEGACY	$1.40	$5.60	—	131K	—	✗	✓	Legacy
Qwen 2.5 VL 72B Instruct LEGACY	$2.80	$8.40	—	131K	—	✓	✓	Legacy
Qwen 2.5 32B Instruct LEGACY	$0.7	$2.80	—	131K	—	✗	✓	Legacy
Qwen 2.5 Coder 32B Instruct LEGACY	$0.29	$0.86	—	131K	—	✗	✓	Legacy
Qwen 2.5 14B Instruct LEGACY	$0.35	$1.40	—	131K	—	✗	✓	Legacy
Qwen 2.5 7B Instruct LEGACY	$0.17	$0.7	—	131K	—	✗	✓	Legacy
Qwen Max (2.5) LEGACY	$1.60	$6.40	—	32K	—	✗	✓	Legacy
Qwen Plus (2.5) LEGACY	$0.4	$1.20	—	1M	—	✗	✓	Legacy
Qwen Flash (2.5) LEGACY	$0.05	$0.4	—	1M	—	✗	✓	Legacy
Qwen VL Max LEGACY	$0.8	$3.20	—	128K	—	✓	✓	Legacy
Qwen VL Plus LEGACY	$0.21	$0.63	—	128K	—	✓	✓	Legacy

§ 03 / PRICE CURVE

Pricing across the lineup.

How Alibaba (Qwen) priced 20 models · APR 25 → JUN 26.

oldest → newest →

Input · newest $1.4/M

Output · newest $8.6/M

Each point is a model at its listed $/M price.

The Qwen story so far.

Major releases and the open-weight cadence from Alibaba's Tongyi Lab. Sourced from Qwen blog/changelog, Alibaba Cloud Model Studio, and Wikipedia.

JUN 14 - 2026

Qwen3.6 Max Preview and Qwen3.6 Plus added from Alibaba International pricing rows

PRICING

JUN 09 - 2026

AI//COST adds current Qwen3 Next and 2507 International pricing rows from Alibaba Model Studio

PRICING

MAY 21 · 2026

Qwen3.7 Max released — new text-only Max flagship with thinking enabled by default

RELEASE

JUN 01 · 2026

Qwen3.7 Plus released — multimodal Plus tier with GUI perception and visual reference-to-code workflows

RELEASE

APR · 2026

Qwen3.5-Omni + Qwen3.6-Plus released — newest tiers shipped as proprietary (cloud-only)

RELEASE

FEB 16 · 2026

Qwen 3.5 family launched — Plus, Flash, and open-weight 397B/122B MoE models

RELEASE

JAN 23 · 2026

Qwen3 Max released — proprietary multimodal flagship at $1.20/$6.00, 252K context

RELEASE

JAN · 2026

Qwen passes 200,000 derivative models — first open model family to hit the milestone

CORPORATE

APR · 2025

Qwen3 released — 8 sizes, trained on 36T tokens across 119 languages, Apache 2.0 open weights

RELEASE

MAR 06 · 2025

QwQ 32B full release — open-weight reasoning model with strong math/code scores for its size

RELEASE

2023

Qwen (Tongyi Qianwen) series launched by Alibaba Cloud's Tongyi Lab; open-sourcing begins

CORPORATE

§ 04 / ACCESS

Where to get it.

Methodology →

PRIMARY · CLOUD

Alibaba Cloud Model Studio

Model Studio (Bailian / DashScope) — managed API for the full Qwen catalog, including proprietary tiers. The International (Singapore) endpoint serves non-China traffic; pricing is tiered by input length.

Managed API →

OPEN-WEIGHT · SELF-HOST

Hugging Face + ModelScope

Most Qwen tiers ship Apache 2.0 open weights on Hugging Face and ModelScope — fully self-hostable. The most-downloaded open model family, with 300+ models released.

Apache 2.0 weights →

CONSUMER · WEB

Qwen Chat

chat.qwen.ai — free consumer assistant covering chat, vision, and coding. For end-user usage; no API access on this surface.

Free web →

CHINA · DOMESTIC

Model Studio (China)

The China-region Model Studio endpoint keeps data inside mainland China for domestic compliance — the other half of Qwen's dual-deployment model alongside Singapore.

China data residency →

§ 04 / BEST FOR

Which Alibaba (Qwen) for what.

More scenarios →

If you need multimodal GUI / coding-agent work…

Qwen3.7 Plus

Profile →

If you need the newest Qwen text-only flagship…

Qwen3.7 Max

Profile →

If you need the proprietary multilingual flagship…

Qwen3 Max

Profile →

If you want an open-weight frontier MoE to self-host…

Qwen 3.5 397B

Profile →

If you need transparent reasoning…

QwQ Plus

Profile →

If you're building agentic coding tools…

Qwen3 Coder Plus

Profile →

If you process documents, charts, or UI images…

Qwen3 VL Plus

Profile →

If you want cheap high-volume 1M context…

Qwen 3.5 Flash

Profile →

§ 06 / BACKGROUND

The company behind it.

qwen.ai →

Qwen — branded Tongyi Qianwen (通义千问) — is the large-model family from Alibaba Cloud's Tongyi Lab, which was established in 2022; the Qwen series was first released in 2023 and open-sourcing began that year. It is developed in Hangzhou under Alibaba Group, a public company (NYSE: BABA, HKEX: 9988).

Qwen's defining strategy is open weights at scale. The lab has released 300+ models spanning text, coding, vision, speech, and video, mostly under the permissive Apache 2.0 license on Hugging Face and ModelScope. By March 2026 the family had been downloaded over 940 million times and spawned 200,000+ derivative models — making Qwen the most-downloaded open-weight model family in the world.

Technically, Qwen3 (April 2025) was trained on roughly 36 trillion tokens across 119 languages and dialects, with both dense and mixture-of-experts (MoE) variants from 0.6B up to 397B parameters. The 2026 Qwen 3.5 generation continued the open MoE line (397B-A17B, 122B-A10B) alongside managed Plus/Flash tiers, while the top Qwen3 Max flagship is served as a proprietary, cloud-only model.

Distribution is dual-track: managed inference through Alibaba Cloud Model Studio (with separate International/Singapore and China-domestic endpoints), and self-hosting via open weights. Model Studio pricing is tiered by input length — the headline rates here are the 0–32K range on the Singapore region, and costs scale up for longer prompts.

The trade-offs: most models are text-or-vision with strong multilingual coverage, but the company is China-based, so the direct API stores data per its region (China or Singapore), and US export-control / procurement questions apply. Versus DeepSeek, Qwen offers a far broader lineup and multimodal range; versus OpenAI and Anthropic, it trades some frontier-benchmark lead for radically more open weights and lower cost.

§ 07 / COMPETITORS

Other frontier labs.

All providers →

CHINESE PEER

DeepSeek

Lowest cost-per-token at frontier reasoning quality with MIT open weights. But text-only and a far narrower lineup than Qwen's full-modality catalog.

$0.14/M input →

PRIMARY RIVAL

OpenAI

Frontier benchmarks + ecosystem (GPT-5, ChatGPT, Azure). But closed weights, US data residency, and multiples higher cost than Qwen's open tiers.

GPT-5 family →

FRONTIER LEADER

Anthropic

Leads coding + reasoning with Claude and zero-retention defaults. But no open weights and no low-cost / multilingual story to match Qwen.

Claude lineup →

OPEN-WEIGHT PEER

Mistral

EU-native open weights (Apache 2.0) with GDPR-first residency — the European answer to Qwen's openness, but a smaller catalog and no China-domestic option.

EU data residency →

CHINA CONSUMER LEADER

ByteDance (Doubao)

The Doubao model family from ByteDance — parent of TikTok and Douyin. Served through the Volcano Ark (火山方舟) platform on ByteDance's Volcano Engine cloud, Doubao powers China's most-used consumer AI app and undercuts most frontier labs on price.

14 models · from $0.02/M →

Frequently asked.

Practical questions about Qwen pricing, open weights, and deployment.

Q · 01 Which Qwen model should I start with? +

For most teams: Qwen 3.5 Plus ($0.40/$2.40) as a multimodal daily driver, or Qwen3 Max ($1.20/$6.00) for the proprietary frontier. Want to self-host? Use the open-weight Qwen 3.5 397B. Drop to Qwen 3.5 Flash ($0.10/$0.40) for cheap 1M-context volume. See the use case picker.

Q · 02 Are Qwen models really open-weight? +

Mostly yes. The dense and MoE Qwen3 / Qwen 3.5 models (e.g. 397B-A17B, 32B, QwQ 32B) ship under Apache 2.0 on Hugging Face and ModelScope — fully self-hostable and commercially usable. The top Qwen3 Max tier and some newest releases are proprietary, cloud-only. Always check each model's license.

Q · 03 Why are the prices here lower than what I'm billed? +

Model Studio uses tiered pricing by input length. The rates shown are the 0–32K input range on the International (Singapore) region; longer prompts cost more (for example Qwen3 Coder Plus climbs from $1/$5 to $6/$60 at the 256K–1M tier). Batch API offers a 50% discount.

Q · 04 Where is my data stored? +

It depends on the endpoint. The International (Singapore) Model Studio region keeps non-China traffic in Singapore; the China-domestic region keeps data in mainland China. For full control — or to avoid both — self-host the Apache-2.0 weights on your own cloud. US export-control and procurement questions apply.

Q · 05 How many languages does Qwen support? +

Qwen3 was trained on roughly 36 trillion tokens across 119 languages and dialects — one of the widest multilingual footprints of any major model family, which is a key reason for its global open-weight adoption.

Q · 06 What's the difference between Qwen3 Max and the open Qwen 3.5 models? +

Qwen3 Max is the proprietary, cloud-only flagship (multimodal, 252K context, $1.20/$6.00). The Qwen 3.5 open MoE models (397B-A17B, 122B-A10B) are Apache-2.0 and self-hostable, at lower per-token cost. Choose Max for the managed top tier; choose 3.5 open for control and self-hosting.

Q · 07 How does Qwen compare to DeepSeek and Llama? +

DeepSeek is cheaper per token but text-only with a narrow lineup; Qwen offers full-modality breadth (vision, coding, speech) and 300+ open models. Versus Meta's Llama, Qwen ships more sizes and modalities and tops global download charts, but Llama carries a Western governance posture some teams prefer.

Reviewed by Yaroslav Vikhariev Founder · AI//COST · Qwen models tested · Pricing pulled from Alibaba Cloud Model Studio

Methodology Report a correction More by Y.V.