Last verified 2026-07-11

OPEN MOE122B TOTAL10B ACTIVE256K CONTEXT

Qwen 3.5 122B A10B API Pricing

Q: What is Qwen 3.5 122B A10B priced at?

Qwen 3.5 122B A10B is shown at $0.4/M input and $3.2/M output in Alibaba Cloud Model Studio's International/Singapore deployment section.

Qwen 3.5 122B A10B is the mid-sized open-weight Qwen 3.5 MoE tier. Baseline International/Singapore rates are $0.40/M input and $3.20/M output. Pulled directly from alibabacloud.com daily.

Input - per 1M tokens

$0.40/M

Source Alibaba Cloud flat

Output - per 1M tokens

$3.20/M

Context 256K flat

Cache N/A

$0.40/M

Cache no dollar row not listed

Effective - agentic blend

$0.62/M

92/8 split - no cache

§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with current Qwen 3.5 122B A10B rates. Tweak spend, output mix, or cache assumptions and share the URL to share the calculation.

Spend

$ /mo

Workload split

Prompt cache hit rate

Tokens you can process

—

Words equivalent (English)

—

Effective rate

—

Open full calculator (all models · share URL · CSV) →

§ 02 / SCENARIOS

Real-world presets.

CHATBOT

Domain assistant

$0.004/turn

3.5k in - 800 out~25,000 units/$100

RAG

Policy answer

$0.016/query

25k in - 1.8k out~6,329 units/$100

AGENT

Planning task

$0.031/task

50k in - 3.5k out~3,205 units/$100

BULK

Report summary

$0.009/doc

12k in - 1.2k out~11,628 units/$100

§ 03 / TOKENIZER

Paste text. See tokens. See cost.

Your text · live count

Calibrated · measured on the vendor's tokenizer · 2026-06-10 Auto-counts as you type

Counts use a chars-per-token calibration measured on the vendor's own published tokenizer (Qwen/Qwen3.5-397B-A17B, 2026-06-10). English prose is typically within a few percent; code and non-Latin scripts tokenize heavier. For billing-exact counts use the vendor's count-tokens API.

Characters 434

Words 69

Tokens (estimated) 83 tokens

Cost as input · uncached $0.000033 USD

Cost as output · uncached $0.000266 USD

Cost as cached input $0.000033 USD

§ 04 / SHELF

Up against the shelf.

All models →

Model	Input /M	Output /M	Effective blended	Context	Best for
Qwen 3.5 122B A10B Current	$0.40	$3.20	$0.62 agentic 92/8	256K	Open MoE production chat
Qwen3 Max	$1.20	$6.00	$1.58 pricier	252K	Frontier Qwen proprietary reasoning
Qwen 3.5 Plus	$0.40	$2.40	$0.56 cheaper	256K	General Qwen production workloads
Qwen 3.5 Flash	$0.10	$0.40	$0.12 cheaper	1M	Cheap long-context Qwen traffic
Qwen3 235B A22B	$0.70	$2.80	$0.87 pricier	131K	Open MoE reasoning baseline
Qwen3 32B	$0.16	$0.64	$0.20 cheaper	131K	Open 32B chat and reasoning
Qwen3 14B	$0.35	$1.40	$0.43 cheaper	131K	Compact open Qwen reasoning
Qwen 3.5 397B A17B	$0.60	$3.60	$0.84 pricier	256K	Open MoE frontier workloads
QwQ 32B	$0.29	$0.86	$0.33 cheaper	131K	Open reasoning on a budget
GPT-5.4 mini	$0.75 cache $0.07	$4.50	$0.54 cheaper	400K	Coding and computer-use workloads

Frequently asked.

Practical pricing questions, separated from calculator assumptions and regional tiers.

Q · 01 What is Qwen 3.5 122B A10B priced at? +

Qwen 3.5 122B A10B is shown at $0.4/M input and $3.2/M output in Alibaba Cloud Model Studio's International/Singapore deployment section.

Q · 02 Does this page use International or Global pricing? +

This page uses Alibaba Cloud Model Studio International deployment pricing, where endpoint and data storage are in Singapore and inference resources are dynamically scheduled globally excluding Chinese Mainland. Global and Chinese Mainland sections can list different prices.

Q · 03 Is prompt caching priced separately? +

Alibaba marks context-cache support on some Qwen families, but this row does not publish a concrete cache-read dollar price. The calculator therefore treats cached input as the same $0.4/M baseline instead of inventing a discount.

Q · 04 How is the effective price calculated? +

AI//COST uses the same 92/8 agentic blend everywhere. With no separate cache-read price published for this row, Qwen 3.5 122B A10B's effective blended cost is $0.62/M.

Q · 05 Is there a free quota or batch discount? +

Alibaba lists a 90-day activation free quota for many International Qwen rows, but quota eligibility is model-specific. Batch Invocation is 50% off where the vendor row explicitly marks Batch support; this page only publishes the real-time list prices in the quote tiles.

Q · 06 Are regional prices different? +

Yes. Alibaba Cloud publishes separate International, Global, China (Hong Kong), EU, US, and Chinese Mainland deployment sections. AI//COST uses the International/Singapore baseline for Alibaba queue pages unless a page explicitly says otherwise.

Reviewed by Yaroslav Vikhariev Founder - AI//COST - Pricing pulled daily from alibabacloud.com - Last verified July 11, 2026

Methodology Report a correction More by Y.V.