Last verified
OPEN MOE131K CONTEXTTHINKING MODEAPACHE 2.0

Qwen3 235B A22B API Pricing

Qwen3 235B A22B is Alibaba's open-weight frontier MoE with 235B total parameters and 22B active per token. Baseline International/Singapore rates are $0.7/M input and $2.8/M output. Pulled directly from alibabacloud.com daily.

Input - per 1M tokens
$0.70/M
Source Alibaba Cloud flat
Output - per 1M tokens
$2.80/M
Context 131K flat
Cache N/A
$0.70/M
Cache not listed not listed
Effective - agentic blend
$0.87/M
92/8 split - 82% cache
§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with current Qwen3 235B A22B rates. Tweak spend, output mix, or cache assumptions and share the URL to share the calculation.

$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
Words equivalent (English)
Effective rate
§ 02 / SCENARIOS

Real-world presets.

§ 03 / TAPE

Price history.

Listed at $0.7/M input and $2.8/M output on Alibaba's International baseline.

Input · $0.70/M
Output · $2.8/M
Cached · $0.70/M
APR 29 Open-weight Qwen3 baseline $0.70/M input and $2.80/M non-thinking outputMAY 19 Live verification kept $0.7/M input and $2.8/M output
§ 04 / TOKENIZER

Paste text. See tokens. See cost.

Estimate · qwen-tokenizer-estimate · ≈3.85 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters
Words
Tokens (estimated)
Cost as input · uncached
Cost as output · uncached
Cost as cached input
§ 05 / SHELF

Up against the shelf.

All models →
Model Input /M Output /M Effective blended Context Best for
Qwen3 235B A22B Current $0.70 $2.80 $0.87 agentic 92/8 131K Open MoE reasoning baseline
Qwen3 Max $1.20 $6.00 $1.58 pricier 252K Frontier Qwen proprietary reasoning
Qwen 3.5 Plus $0.40 $2.40 $0.56 cheaper 256K General Qwen production workloads
Qwen 3.5 Flash $0.10 $0.40 $0.12 cheaper 1M Cheap long-context Qwen traffic
Qwen3 Coder Plus $1.00 $5.00 $1.32 pricier 1M Agentic coding and code review
Qwen3 Coder Flash $0.30 $1.50 $0.40 cheaper 1M High-volume coding assistants
QwQ Plus $0.80 $2.40 $0.93 pricier 131K Proprietary reasoning workloads
QwQ 32B $0.29 $0.86 $0.33 cheaper 131K Open reasoning on a budget
GPT-5.4 mini $0.75 cache $0.07 $4.50 $0.54 cheaper 400K Coding and computer-use workloads
Gemini 2.5 Flash $0.30 cache $0.03 $2.50 $0.27 cheaper 1M Low-latency multimodal RAG
DeepSeek V4 Flash $0.14 cache $0.00 $0.28 $0.05 cheaper 1M Ultra-cheap API throughput

Frequently asked.

Practical pricing questions, separated from calculator assumptions and regional tiers.

Q · 01 What is Qwen3 235B A22B priced at? +
Qwen3 235B A22B is shown at $0.7/M input and $2.8/M output on Alibaba Cloud Model Studio's International/Singapore deployment section.
Q · 02 Does this page include higher context pricing tiers? +
The quote tiles use the baseline tier for the queue. Alibaba publishes higher long-context tiers for some Qwen rows; for example Qwen3 Coder Plus rises above the 0-32K band, and Qwen3 235B A22B has a separate thinking-mode output price where applicable.
Q · 03 Is prompt caching priced separately? +
The Alibaba table separates Qwen3 235B A22B non-thinking and thinking-mode output. This page uses the baseline non-thinking output price $2.80/M; thinking-mode CoT plus response is listed separately at $8.40/M.
Q · 04 How is the effective price calculated? +
AI//COST uses the same 92/8 agentic blend everywhere. With no exact cache-read dollar price published for this row, Qwen3 235B A22B's effective blended cost is $0.87/M.
Q · 05 Is there a free quota or batch discount? +
Alibaba lists a 90-day activation free quota for many International Qwen rows, but not every open-source row includes one. Batch and context-cache support are model-specific; this page only publishes prices that are explicit in the vendor table.
Q · 06 Are regional prices different? +
Yes. Alibaba Cloud publishes separate International, Global, China (Hong Kong), EU, US, and Chinese Mainland deployment sections. AI//COST uses the International/Singapore baseline for Alibaba queue pages unless the page says otherwise.