Last verified
OPEN 8B128K CONTEXTHYBRID MODESINTERNATIONAL PRICETEXT + CODE

Qwen3 8B API Pricing

Qwen3 8B is an Alibaba Qwen3 text model priced from the Singapore/International row of Model Studio. The verified rate is $0.1951/M input and $0.7586/M output, converted from CNY at 1 CNY = $0.14768 The Qwen3 launch blog lists Qwen3 8B as an Apache 2.0 dense model with 128K context.

Input - per 1M tokens
$0.20/M
Source Alibaba Model Studio flat
Output - per 1M tokens
$0.76/M
Mode non-thinking baseline flat
Thinking output $2.28/M
$2.28/M
Thinking chain + answer reasoning
Effective - agentic blend
$0.24/M
92/8 split - no cache
§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with the verified International Qwen3 8B rates. Use it for invoice checks, agent traces, and scenario planning before large Model Studio runs.

$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
Words equivalent (English)
Effective rate
Open full calculator (all models · share URL · CSV) →
§ 02 / SCENARIOS

Real-world presets.

§ 03 / TAPE

Price history.

Qwen3 8B is listed at $0.1951/M input and $0.7586/M output on Alibaba's International Model Studio row.

Input · $0.20/M
Output · $0.76/M
Cached · $0.20/M
APR 29 Qwen3 8B entered the Qwen3 API/catalog lineageJUN 09 Live International pricing verified at $0.1951/M input and $0.7586/M output
§ 04 / TOKENIZER

Paste text. See tokens. See cost.

Calibrated · measured on the vendor's tokenizer · 2026-06-10 Auto-counts as you type

Counts use a chars-per-token calibration measured on the vendor's own published tokenizer (Qwen/Qwen3.5-397B-A17B, 2026-06-10). English prose is typically within a few percent; code and non-Latin scripts tokenize heavier. For billing-exact counts use the vendor's count-tokens API.

Characters 397
Words 58
Tokens (estimated) 76 tokens
Cost as input · uncached $0.000015 USD
Cost as output · uncached $0.000058 USD
Cost as cached input $0.000015 USD
§ 05 / SHELF

Up against the shelf.

All models →
Model Input /M Output /M Effective blended Context Best for
Qwen3 8B Current $0.20 $0.76 $0.24 agentic 92/8 128K Small open Qwen3 deployment
Qwen3 Next 80B A3B Thinking $0.16 $1.30 $0.25 thinking 128K Qwen3 sibling
Qwen3 Next 80B A3B Instruct $0.16 $1.30 $0.25 sibling 128K Qwen3 sibling
Qwen3 235B A22B Thinking 2507 $0.25 $2.49 $0.43 thinking 128K Qwen3 sibling
Qwen3 235B A22B Instruct 2507 $0.25 $1.00 $0.31 sibling 128K Qwen3 sibling
Qwen3 30B A3B Thinking 2507 $0.22 $2.60 $0.41 thinking 128K Qwen3 sibling
Qwen3 30B A3B Instruct 2507 $0.22 $0.87 $0.27 sibling 128K Qwen3 sibling
Qwen3 30B A3B $0.22 $0.87 $0.27 sibling 128K Qwen3 sibling
Qwen3.7 Plus $0.44 $1.77 $0.55 sibling 1M Current Plus tier
Qwen3.7 Max $2.77 $8.31 $3.21 sibling 1M Current Max flagship
§ 06 / DEEP LINKS

Specific scenarios.

All calculators →

Audit links

Frequently asked.

Short answers for teams comparing Qwen3 8B against other current Qwen3 text models.

Q · 01 What is the Qwen3 8B input price? +
Alibaba's Singapore/International pricing row lists 1.321 CNY per 1M input tokens, converted here to $0.1951/M at 1 CNY = $0.14768.
Q · 02 What is the Qwen3 8B output price? +
Qwen3 8B is shown at $0.7586/M output on this page. The original vendor row is 5.137 CNY per 1M non-thinking output tokens.
Q · 03 Does Alibaba list prompt-cache pricing for this row? +
No exact cache-read token price is listed in the verified row, so the calculator keeps cache disabled instead of inventing a discount.
Q · 04 Why does this page use International pricing? +
AI//COST uses the Singapore/International row for Alibaba pages because it is the relevant baseline for non-mainland deployment and differs from some US/EU global rows.
Q · 05 Is this a text LLM page only? +
Yes. This page covers the text/code API model. Audio, image, and video models are intentionally handled separately.
Q · 06 Where did the context window come from? +
Context comes from Qwen's official model/blog documentation: Qwen3 launch blog lists Qwen3-8B with 128K context. Pricing comes separately from Alibaba Model Studio.