Last verified 2026-05-18

DEPRECATEDMULTIMODALLOW COSTNO CACHE SKU

Mistral Small 3.2 API Pricing

Q: What is Mistral Small 3.2 priced at?

Mistral Small 3.2 is shown at $0.1/M input and $0.3/M output. The page stores USD per-million-token baseline pricing from mistral.ai.

Deprecated: Mistral removed Small 3.2 from the live pricing page and moved it to the legacy table (verified 2026-07-11) - the recommended replacement is Mistral Small 4. Its last-listed baseline rates were $0.1/M input and $0.3/M output, retained here for reference.

Input - per 1M tokens

$0.10/M

Source mistral.ai flat

Output - per 1M tokens

$0.30/M

Context 256K flat

Cache N/A

$0.10/M

Cache vendor row not listed

Effective - agentic blend

$0.12/M

92/8 split - 82% cache

§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with Mistral Small 3.2 rates. Tweak spend, output mix, or cache assumptions to compare it with sibling models.

Spend

$ /mo

Workload split

Prompt cache hit rate

Tokens you can process

—

Words equivalent (English)

—

Effective rate

—

Open full calculator (all models · share URL · CSV) →

§ 02 / SCENARIOS

Real-world presets.

VISION

Invoice extraction

$0.001/doc

6,000 in - 800 out~125,000 units/$100

CHATBOT

Support assistant

$0.000/turn

2,500 in - 600 out~250,000 units/$100

RAG

Knowledge base answer

$0.001/query

9,000 in - 1,000 out~83,333 units/$100

BULK

Image QA review

$0.000/item

3,000 in - 400 out~250,000 units/$100

§ 03 / TOKENIZER

Paste text. See tokens. See cost.

Your text · live count

Calibrated · measured on the vendor's tokenizer · 2026-06-10 Auto-counts as you type

Counts use a chars-per-token calibration measured on the vendor's own published tokenizer (mistralai/Mistral-Nemo-Instruct-2407, 2026-06-10). English prose is typically within a few percent; code and non-Latin scripts tokenize heavier. For billing-exact counts use the vendor's count-tokens API.

Characters 381

Words 60

Tokens (estimated) 75 tokens

Cost as input · uncached $0.000008 USD

Cost as output · uncached $0.000023 USD

Cost as cached input $0.000008 USD

§ 04 / SHELF

Up against the shelf.

All models →

Model	Input /M	Output /M	Effective blended	Context	Best for
Mistral Small 3.2 Current	$0.10	$0.30	$0.12 agentic 92/8	256K	Low-cost multimodal chat
Mistral Large 3	$0.50	$1.50	$0.58 pricier	128K	Mistral production workloads
Mistral Small 4	$0.15	$0.60	$0.19 pricier	256K	Low-cost multimodal chat
Magistral Medium	$2.00	$5.00	$2.24 pricier	256K	Transparent reasoning
Magistral Small	$0.50	$1.50	$0.58 pricier	256K	Transparent reasoning
Ministral 3 14B	$0.20	$0.20	$0.20 pricier	128K	Mistral production workloads
Ministral 3 3B	$0.10	$0.10	$0.10 cheaper	128K	Mistral production workloads
DeepSeek V4 Flash	$0.14 cache $0.00	$0.28	$0.05 cheaper	1M	Mistral production workloads

Frequently asked.

Practical pricing questions, separated from calculator assumptions.

Q · 01 What is Mistral Small 3.2 priced at? +

Mistral Small 3.2 is shown at $0.1/M input and $0.3/M output. The page stores USD per-million-token baseline pricing from mistral.ai.

Q · 02 Does this page include higher context pricing tiers? +

This page uses the public API baseline for the model row. If Mistral publishes separate long-context tiers later, the snapshot entry should be re-verified and updated.

Q · 03 Is prompt caching priced separately? +

No separate cache-read price is published for this row, so the calculator treats cached input as $0.1/M.

Q · 04 How is the effective price calculated? +

AI//COST uses the same 92/8 agentic blend everywhere. For Mistral Small 3.2, that gives $0.12/M with only documented cache discounts included.

Q · 05 Is there a batch discount? +

Mistral pricing does not list a separate per-model batch discount in this row.

Q · 06 Are regional prices different? +

Mistral's public pricing page lets you view USD and EUR. AI//COST stores the USD baseline and handles currency variants in calculator URLs.

Reviewed by Yaroslav Vikhariev Founder - AI//COST - Pricing pulled daily from mistral.ai - Last verified May 18, 2026

Methodology Report a correction More by Y.V.