Qwen2.5 Coder 32B Instruct API Pricing
Qwen2.5 Coder 32B Instruct is Alibaba's legacy Qwen2.5 code-specialized 32B row for compatibility and invoice checks. The live vendor table lists $0.287/M input and $0.861/M output. Pulled directly from alibabacloud.com daily.
Run the numbers.
Live calculator pre-loaded with current Qwen2.5 Coder 32B Instruct rates. Tweak spend, output mix, or cache assumptions and share the URL to share the calculation.
Real-world presets.
Repo implementation
Pull request review
Knowledge base answer
Support assistant
Price history.
Paste text. See tokens. See cost.
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| Qwen2.5 Coder 32B Instruct Current | $0.29 | $0.86 | $0.33 agentic 92/8 | 131K | Legacy code-specialized Qwen 2.5 |
| Qwen3 Coder Plus | $1.00 | $5.00 | $1.32 pricier | 1M | Current Qwen code flagship |
| Qwen3 Coder Flash | $0.30 | $1.50 | $0.40 pricier | 1M | Cheap current code workloads |
| Qwen2.5 32B Instruct | $0.70 | $2.80 | $0.87 pricier | 131K | Legacy 32B general chat |
| Qwen3 32B | $0.16 | $0.64 | $0.20 cheaper | 131K | Current open 32B reasoning |
| DeepSeek V4 Flash | $0.14 cache $0.00 | $0.28 | $0.05 cheaper | 1M | Budget coding and reasoning |
Frequently asked.
Practical pricing questions, separated from calculator assumptions and regional tiers.
Q · 01 What is Qwen2.5 Coder 32B Instruct priced at? +
$0.287/M input and $0.861/M output on the live Alibaba Cloud pricing table. This page stores USD per-million-token pricing.Q · 02 How is the effective price calculated? +
$0.33/M.Q · 03 Is prompt caching priced separately? +
$0.287/M baseline instead of inventing a discount.Q · 04 Are regional prices different? +
Q · 05 Is there a batch discount? +
Q · 06 How accurate is the tokenizer estimate? +
qwen-tokenizer-estimate chars-per-token estimate for English text. It is useful for rough planning, but actual billing comes from the vendor API usage fields and can differ for Chinese, code, or mixed-language prompts.