OPEN MOE122B TOTAL10B ACTIVE256K CONTEXT
Qwen 3.5 122B A10B API Pricing
Qwen 3.5 122B A10B is the mid-sized open-weight Qwen 3.5 MoE tier. Baseline International/Singapore rates are $0.40/M input and $3.20/M output. Pulled directly from alibabacloud.com daily.
Input - per 1M tokens
$0.40/M
Source Alibaba Cloud flat
Output - per 1M tokens
$3.20/M
Context 256K flat
Cache N/A
$0.40/M
Cache no dollar row not listed
Effective - agentic blend
$0.62/M
92/8 split - no cache
§ 01 / TERMINAL
Run the numbers.
Live calculator pre-loaded with current Qwen 3.5 122B A10B rates. Tweak spend, output mix, or cache assumptions and share the URL to share the calculation.
$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
—
Words equivalent (English)
—
Effective rate
—
§ 02 / SCENARIOS
Real-world presets.
CHATBOT
Domain assistant
$0.004/turn
RAG
Policy answer
$0.016/query
AGENT
Planning task
$0.031/task
BULK
Report summary
$0.009/doc
§ 03 / TAPE
Price history.
Input · $0.40/M
Output · $3.2/M
Cached · $0.40/M
FEB 15 Qwen 3.5 122B A10B International row listed at $0.40/M input and $3.20/M outputMAY 19 Live verification kept $0.4/M input and $3.2/M output
§ 04 / TOKENIZER
Paste text. See tokens. See cost.
Estimate · qwen-tokenizer-estimate · ≈3.85 chars/token Auto-counts as you type
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
Characters —
Words —
Tokens (estimated) —
Cost as input · uncached —
Cost as output · uncached —
Cost as cached input —
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| Qwen 3.5 122B A10B Current | $0.40 | $3.20 | $0.62 agentic 92/8 | 256K | Open MoE production chat |
| Qwen3 Max | $1.20 | $6.00 | $1.58 pricier | 252K | Frontier Qwen proprietary reasoning |
| Qwen 3.5 Plus | $0.40 | $2.40 | $0.56 cheaper | 256K | General Qwen production workloads |
| Qwen 3.5 Flash | $0.10 | $0.40 | $0.12 cheaper | 1M | Cheap long-context Qwen traffic |
| Qwen3 235B A22B | $0.70 | $2.80 | $0.87 pricier | 131K | Open MoE reasoning baseline |
| Qwen3 32B | $0.16 | $0.64 | $0.20 cheaper | 131K | Open 32B chat and reasoning |
| Qwen3 14B | $0.35 | $1.40 | $0.43 cheaper | 131K | Compact open Qwen reasoning |
| Qwen 3.5 397B A17B | $0.60 | $3.60 | $0.84 pricier | 256K | Open MoE frontier workloads |
| QwQ 32B | $0.29 | $0.86 | $0.33 cheaper | 131K | Open reasoning on a budget |
| GPT-5.4 mini | $0.75 cache $0.07 | $4.50 | $0.54 cheaper | 400K | Coding and computer-use workloads |
Frequently asked.
Practical pricing questions, separated from calculator assumptions and regional tiers.
Q · 01 What is Qwen 3.5 122B A10B priced at? +
Qwen 3.5 122B A10B is shown at
$0.4/M input and $3.2/M output in Alibaba Cloud Model Studio's International/Singapore deployment section.Q · 02 Does this page use International or Global pricing? +
This page uses Alibaba Cloud Model Studio International deployment pricing, where endpoint and data storage are in Singapore and inference resources are dynamically scheduled globally excluding Chinese Mainland. Global and Chinese Mainland sections can list different prices.
Q · 03 Is prompt caching priced separately? +
Alibaba marks context-cache support on some Qwen families, but this row does not publish a concrete cache-read dollar price. The calculator therefore treats cached input as the same
$0.4/M baseline instead of inventing a discount.Q · 04 How is the effective price calculated? +
AI//COST uses the same 92/8 agentic blend everywhere. With no separate cache-read price published for this row, Qwen 3.5 122B A10B's effective blended cost is
$0.62/M.Q · 05 Is there a free quota or batch discount? +
Alibaba lists a 90-day activation free quota for many International Qwen rows, but quota eligibility is model-specific. Batch Invocation is 50% off where the vendor row explicitly marks Batch support; this page only publishes the real-time list prices in the quote tiles.
Q · 06 Are regional prices different? +
Yes. Alibaba Cloud publishes separate International, Global, China (Hong Kong), EU, US, and Chinese Mainland deployment sections. AI//COST uses the International/Singapore baseline for Alibaba queue pages unless a page explicitly says otherwise.