Qwen Max API Pricing
Qwen Max is the legacy proprietary Qwen 2.5 Max row kept for backward-compatible workloads. Alibaba still lists it at $1.60/M input and $6.40/M output, while Qwen3 Max is the current replacement. Pulled directly from alibabacloud.com daily.
Run the numbers.
Live calculator pre-loaded with current Qwen Max rates. Tweak spend, output mix, or cache assumptions and share the URL to share the calculation.
Real-world presets.
Invoice replay
Older app assistant
Compatibility query
A/B comparison
Price history.
Paste text. See tokens. See cost.
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| Qwen Max (2.5) Current | $1.60 | $6.40 | $1.98 agentic 92/8 | 32K | Legacy Qwen 2.5 compatibility |
| Qwen Max (2.5) Current | $1.60 | $6.40 | $1.98 agentic 92/8 | 32K | Legacy Qwen 2.5 compatibility |
| Qwen3 Max | $1.20 | $6.00 | $1.58 cheaper | 252K | Frontier Qwen proprietary reasoning |
| Qwen 3.5 Plus | $0.40 | $2.40 | $0.56 cheaper | 256K | General Qwen production workloads |
| Qwen 3.5 Flash | $0.10 | $0.40 | $0.12 cheaper | 1M | Cheap long-context Qwen traffic |
| Qwen3 235B A22B | $0.70 | $2.80 | $0.87 cheaper | 131K | Open MoE reasoning baseline |
| Qwen3 32B | $0.16 | $0.64 | $0.20 cheaper | 131K | Open 32B chat and reasoning |
| Qwen3 14B | $0.35 | $1.40 | $0.43 cheaper | 131K | Compact open Qwen reasoning |
| Qwen 3.5 397B A17B | $0.60 | $3.60 | $0.84 cheaper | 256K | Open MoE frontier workloads |
| Qwen 3.5 122B A10B | $0.40 | $3.20 | $0.62 cheaper | 256K | Open MoE production chat |
Frequently asked.
Practical pricing questions, separated from calculator assumptions and regional tiers.
Q · 01 What is Qwen Max priced at? +
$1.6/M input and $6.4/M output in Alibaba Cloud Model Studio's International/Singapore deployment section.Q · 02 Does this page use International or Global pricing? +
Q · 03 Is prompt caching priced separately? +
$1.6/M baseline instead of inventing a discount.Q · 04 How is the effective price calculated? +
$1.98/M.