Last verified 2026-07-11

LOW COST1M CONTEXTBATCH -50%TEXT + VISION

Qwen 3.5 Flash API Pricing

Q: What is Qwen 3.5 Flash priced at?

Qwen 3.5 Flash is shown at $0.1/M input and $0.4/M output. The page stores USD per-million-token baseline pricing from alibabacloud.com.

Qwen 3.5 Flash is Qwen 3.5 cheap long-context tier. Baseline rates are $0.1/M input and $0.4/M output. Pulled directly from alibabacloud.com and re-verified against the pricing page.

Input - per 1M tokens

$0.10/M

Source alibabacloud.com flat

Output - per 1M tokens

$0.40/M

Context 1M flat

Cache N/A

$0.10/M

Cache vendor row not listed

Effective - agentic blend

$0.12/M

92/8 split - 82% cache

§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with Qwen 3.5 Flash rates. Tweak spend, output mix, or cache assumptions to compare it with sibling models.

Spend

$ /mo

Workload split

Prompt cache hit rate

Tokens you can process

—

Words equivalent (English)

—

Effective rate

—

Open full calculator (all models · share URL · CSV) →

§ 02 / SCENARIOS

Real-world presets.

VISION

Invoice extraction

$0.001/doc

6,000 in - 800 out~111,111 units/$100

CHATBOT

Support assistant

$0.001/turn

2,500 in - 600 out~200,000 units/$100

RAG

Knowledge base answer

$0.001/query

9,000 in - 1,000 out~76,923 units/$100

BULK

Image QA review

$0.001/item

3,000 in - 400 out~200,000 units/$100

§ 03 / TOKENIZER

Paste text. See tokens. See cost.

Your text · live count

Calibrated · measured on the vendor's tokenizer · 2026-06-10 Auto-counts as you type

Counts use a chars-per-token calibration measured on the vendor's own published tokenizer (Qwen/Qwen3.5-397B-A17B, 2026-06-10). English prose is typically within a few percent; code and non-Latin scripts tokenize heavier. For billing-exact counts use the vendor's count-tokens API.

Characters 381

Words 61

Tokens (estimated) 73 tokens

Cost as input · uncached $0.000007 USD

Cost as output · uncached $0.000029 USD

Cost as cached input $0.000007 USD

§ 04 / SHELF

Up against the shelf.

All models →

Model	Input /M	Output /M	Effective blended	Context	Best for
Qwen 3.5 Flash Current	$0.10	$0.40	$0.12 agentic 92/8	1M	Bulk chat and long-context RAG
Qwen3 Max	$1.20	$6.00	$1.58 pricier	252K	Frontier Qwen reasoning
Qwen 3.5 Plus	$0.40	$2.40	$0.56 pricier	256K	General Qwen production workloads
Qwen 3.5 Flash Current	$0.10	$0.40	$0.12 agentic 92/8	1M	Bulk chat and long-context RAG
Qwen3 VL Plus	$0.20	$1.60	$0.31 pricier	256K	Vision and document understanding
Qwen3 VL Flash	$0.05	$0.40	$0.08 cheaper	256K	Vision and document understanding
GPT-5.4 mini	$0.75 cache $0.07	$4.50	$0.54 pricier	400K	Mistral production workloads
Gemini 2.5 Flash	$0.30 cache $0.03	$2.50	$0.27 pricier	1M	Mistral production workloads

Frequently asked.

Practical pricing questions, separated from calculator assumptions.

Q · 01 What is Qwen 3.5 Flash priced at? +

Qwen 3.5 Flash is shown at $0.1/M input and $0.4/M output. The page stores USD per-million-token baseline pricing from alibabacloud.com.

Q · 02 Does this page include higher context pricing tiers? +

Alibaba publishes tiered pricing for several Qwen models. This page uses the baseline Singapore / International tier from the queue and snapshot; higher-token tiers are noted in the source page and can be added as a variant later.

Q · 03 Is prompt caching priced separately? +

No separate cache-read price is published for this row, so the calculator treats cached input as $0.1/M.

Q · 04 How is the effective price calculated? +

AI//COST uses the same 92/8 agentic blend everywhere. For Qwen 3.5 Flash, that gives $0.12/M with only documented cache discounts included.

Q · 05 Is there a batch discount? +

Alibaba lists Batch Invocation at 50% off for supported Qwen rows.

Q · 06 Are regional prices different? +

Yes. Alibaba Cloud publishes separate International, Global, US, EU, China (Hong Kong), and Chinese Mainland deployment sections. AI//COST uses the International / Singapore baseline for this queue unless a page explicitly says otherwise.

Reviewed by Yaroslav Vikhariev Founder - AI//COST - Pricing pulled daily from alibabacloud.com - Last verified July 11, 2026

Methodology Report a correction More by Y.V.