Qwen2.5 14B Instruct API Pricing
Qwen2.5 14B Instruct is Alibaba's legacy open-weight 14B Qwen2.5 row for compatibility and invoice checks. Alibaba lists the International/Singapore baseline at $0.35/M input and $1.4/M output; newer workloads should compare Qwen3 14B. Pulled directly from alibabacloud.com daily.
Run the numbers.
Live calculator pre-loaded with current Qwen2.5 14B Instruct rates. Tweak spend, output mix, or cache assumptions and share the URL to share the calculation.
Real-world presets.
Support assistant
Knowledge base answer
Repository review
Intent routing
Price history.
Paste text. See tokens. See cost.
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| Qwen 2.5 14B Instruct Current | $0.35 | $1.40 | $0.43 agentic 92/8 | 131K | Legacy compact open Qwen apps |
| Qwen 2.5 32B Instruct | $0.70 | $2.80 | $0.87 pricier | 131K | Legacy 32B open chat workloads |
| Qwen 2.5 7B Instruct | $0.17 | $0.70 | $0.22 cheaper | 131K | Legacy small open Qwen deployments |
| Qwen3 14B | $0.35 | $1.40 | $0.43 same effective | 131K | Current compact open Qwen reasoning |
| Qwen3 32B | $0.16 | $0.64 | $0.20 cheaper | 131K | Current open 32B chat and reasoning |
| Qwen 3.5 Flash | $0.10 | $0.40 | $0.12 cheaper | 1M | Cheap long-context Qwen traffic |
| GPT-5.4 mini | $0.75 cache $0.07 | $4.50 | $0.54 pricier | 400K | OpenAI mini coding and CUA |
Frequently asked.
Practical pricing questions, separated from calculator assumptions and regional tiers.
Q · 01 What is Qwen2.5 14B Instruct priced at? +
$0.35/M input and $1.4/M output in Alibaba Cloud Model Studio's International/Singapore deployment section. The page stores USD per-million-token pricing.Q · 02 What replaced Qwen2.5 14B Instruct? +
Q · 03 Does this page use International or Global pricing? +
Q · 04 Is prompt caching priced separately? +
$0.35/M baseline instead of inventing a discount.Q · 05 How is the effective price calculated? +
$0.43/M.