Qwen2.5 VL 72B Instruct API Pricing
Qwen2.5 VL 72B Instruct is Alibaba's legacy Qwen2.5 vision-language flagship row for compatibility and invoice checks. Alibaba lists the International/Singapore baseline at $2.8/M input and $8.4/M output; newer workloads should compare Qwen3 VL Plus or Qwen3 VL Flash. Pulled directly from alibabacloud.com daily.
Run the numbers.
Live calculator pre-loaded with current Qwen2.5 VL 72B Instruct rates. Tweak spend, output mix, or cache assumptions and share the URL to share the calculation.
Real-world presets.
Invoice extraction
Screenshot inspection
Dashboard explain
Visual knowledge answer
Price history.
Paste text. See tokens. See cost.
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| Qwen 2.5 VL 72B Instruct Current | $2.80 | $8.40 | $3.25 agentic 92/8 | 131K | Legacy visual reasoning compatibility |
| Qwen3 VL Plus | $0.20 | $1.60 | $0.31 cheaper | 256K | Current vision and document understanding |
| Qwen3 VL Flash | $0.05 | $0.40 | $0.08 cheaper | 256K | Cheap high-volume vision tasks |
| Qwen VL Max | $0.80 | $3.20 | $0.99 cheaper | 128K | Legacy Qwen vision flagship |
| Qwen VL Plus | $0.21 | $0.63 | $0.24 cheaper | 128K | Legacy low-cost vision apps |
| Qwen3 235B A22B | $0.70 | $2.80 | $0.87 cheaper | 131K | Current open MoE reasoning baseline |
| GPT-5.4 mini | $0.75 cache $0.07 | $4.50 | $0.54 cheaper | 400K | OpenAI mini coding and CUA |
| Gemini 2.5 Flash | $0.30 cache $0.03 | $2.50 | $0.27 cheaper | 1M | Google long-context Flash workloads |
Frequently asked.
Practical pricing questions, separated from calculator assumptions and regional tiers.
Q · 01 What is Qwen2.5 VL 72B Instruct priced at? +
$2.8/M input and $8.4/M output in Alibaba Cloud Model Studio's International/Singapore deployment section. The page stores USD per-million-token pricing.Q · 02 What replaced Qwen2.5 VL 72B Instruct? +
Q · 03 Does this page use International or Global pricing? +
Q · 04 Is prompt caching priced separately? +
$2.8/M baseline instead of inventing a discount.Q · 05 How is the effective price calculated? +
$3.25/M.