Qwen VL Max API Pricing
Qwen VL Max is the legacy flagship Qwen vision-language row for complex visual reasoning. Alibaba lists the International/Singapore baseline at $0.8/M input and $3.2/M output; the current snapshot is qwen-vl-max-2025-08-13. Pulled directly from alibabacloud.com daily.
Run the numbers.
Live calculator pre-loaded with current Qwen VL Max rates. Tweak spend, output mix, or cache assumptions and share the URL to share the calculation.
Real-world presets.
Invoice extraction
Screenshot inspection
Clip frame review
Dashboard explain
Price history.
Paste text. See tokens. See cost.
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| Qwen VL Max Current | $0.80 | $3.20 | $0.99 agentic 92/8 | 128K | Legacy visual reasoning workloads |
| Qwen3 VL Plus | $0.20 | $1.60 | $0.31 cheaper | 256K | Current vision and document understanding |
| Qwen3 VL Flash | $0.05 | $0.40 | $0.08 cheaper | 256K | Cheap high-volume vision tasks |
| Qwen VL Plus | $0.21 | $0.63 | $0.24 cheaper | 32K | Legacy low-cost vision apps |
| Qwen 3.5 Plus | $0.40 | $2.40 | $0.56 cheaper | 256K | Current Qwen production default |
| GPT-5.4 mini | $0.75 cache $0.07 | $4.50 | $0.54 cheaper | 400K | OpenAI mini coding and CUA |
| Gemini 2.5 Flash | $0.30 cache $0.03 | $2.50 | $0.27 cheaper | 1M | Google long-context Flash workloads |
Frequently asked.
Practical pricing questions, separated from calculator assumptions and regional tiers.
Q · 01 What is Qwen VL Max priced at? +
$0.8/M input and $3.2/M output in Alibaba Cloud Model Studio's International/Singapore deployment section. The page stores USD per-million-token pricing.Q · 02 What replaced Qwen VL Max? +
Q · 03 Does this page use International or Global pricing? +
Q · 04 Is prompt caching priced separately? +
$0.8/M baseline instead of inventing a discount.Q · 05 How is the effective price calculated? +
$0.99/M.