Qwen2.5 72B Instruct API Pricing
Qwen2.5 72B Instruct is Alibaba's legacy open-weight flagship dense Qwen2.5 row for compatibility and invoice checks. Alibaba lists the International/Singapore baseline at $1.4/M input and $5.6/M output; newer workloads should compare Qwen3 235B A22B. Pulled directly from alibabacloud.com daily.
Run the numbers.
Live calculator pre-loaded with current Qwen2.5 72B Instruct rates. Tweak spend, output mix, or cache assumptions and share the URL to share the calculation.
Real-world presets.
Support assistant
Knowledge base answer
Repository review
Intent routing
Price history.
Paste text. See tokens. See cost.
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| Qwen 2.5 72B Instruct Current | $1.40 | $5.60 | $1.74 agentic 92/8 | 131K | Legacy 72B open Qwen compatibility |
| Qwen 2.5 32B Instruct | $0.70 | $2.80 | $0.87 cheaper | 131K | Legacy 32B open chat workloads |
| Qwen 2.5 14B Instruct | $0.35 | $1.40 | $0.43 cheaper | 131K | Legacy compact open Qwen apps |
| Qwen 2.5 7B Instruct | $0.17 | $0.70 | $0.22 cheaper | 131K | Legacy small open Qwen deployments |
| Qwen3 235B A22B | $0.70 | $2.80 | $0.87 cheaper | 131K | Current open MoE reasoning baseline |
| Qwen3 32B | $0.16 | $0.64 | $0.20 cheaper | 131K | Current open 32B chat and reasoning |
| Qwen Max (2.5) | $1.60 | $6.40 | $1.98 pricier | 32K | Comparable production workloads |
| GPT-5.4 mini | $0.75 cache $0.07 | $4.50 | $0.54 cheaper | 400K | OpenAI mini coding and CUA |
Frequently asked.
Practical pricing questions, separated from calculator assumptions and regional tiers.
Q · 01 What is Qwen2.5 72B Instruct priced at? +
$1.4/M input and $5.6/M output in Alibaba Cloud Model Studio's International/Singapore deployment section. The page stores USD per-million-token pricing.Q · 02 What replaced Qwen2.5 72B Instruct? +
Q · 03 Does this page use International or Global pricing? +
Q · 04 Is prompt caching priced separately? +
$1.4/M baseline instead of inventing a discount.Q · 05 How is the effective price calculated? +
$1.74/M.