Last verified
OPEN MODELMULTIMODALLOW COSTNO CACHE SKU

Mistral Small 3.2 API Pricing

Mistral Small 3.2 is open multimodal small model. Baseline rates are $0.1/M input and $0.3/M output. Pulled directly from mistral.ai and re-verified against the pricing page.

Input - per 1M tokens
$0.10/M
Source mistral.ai flat
Output - per 1M tokens
$0.30/M
Context 256K flat
Cache N/A
$0.10/M
Cache vendor row not listed
Effective - agentic blend
$0.12/M
92/8 split - 82% cache
§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with Mistral Small 3.2 rates. Tweak spend, output mix, or cache assumptions to compare it with sibling models.

$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
Words equivalent (English)
Effective rate
§ 02 / SCENARIOS

Real-world presets.

§ 03 / TAPE

Price history.

Listed at $0.1/M input and $0.3/M output.

Input · $0.10/M
Output · $0.30/M
Cached · $0.10/M
JUN 20 Launch price $0.1/M input and $0.3/M outputMAY 18 Live verification kept $0.1/M and $0.3/M
§ 04 / TOKENIZER

Paste text. See tokens. See cost.

Estimate · mistral-tokenizer-estimate · ≈3.85 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters
Words
Tokens (estimated)
Cost as input · uncached
Cost as output · uncached
Cost as cached input
§ 05 / SHELF

Up against the shelf.

All models →
Model Input /M Output /M Effective blended Context Best for
Mistral Small 3.2 Current $0.10 $0.30 $0.12 agentic 92/8 256K Low-cost multimodal chat
Mistral Large 3 $0.50 $1.50 $0.58 pricier 128K Mistral production workloads
Mistral Small 4 $0.15 $0.60 $0.19 pricier 256K Low-cost multimodal chat
Magistral Medium $2.00 $5.00 $2.24 pricier 256K Transparent reasoning
Magistral Small $0.50 $1.50 $0.58 pricier 256K Transparent reasoning
Ministral 3 14B $0.20 $0.20 $0.20 pricier 128K Mistral production workloads
Ministral 3 3B $0.10 $0.10 $0.10 cheaper 128K Mistral production workloads
DeepSeek V4 Flash $0.14 cache $0.00 $0.28 $0.05 cheaper 1M Mistral production workloads

Frequently asked.

Practical pricing questions, separated from calculator assumptions.

Q · 01 What is Mistral Small 3.2 priced at? +
Mistral Small 3.2 is shown at $0.1/M input and $0.3/M output. The page stores USD per-million-token baseline pricing from mistral.ai.
Q · 02 Does this page include higher context pricing tiers? +
This page uses the public API baseline for the model row. If Mistral publishes separate long-context tiers later, the snapshot entry should be re-verified and updated.
Q · 03 Is prompt caching priced separately? +
No separate cache-read price is published for this row, so the calculator treats cached input as $0.1/M.
Q · 04 How is the effective price calculated? +
AI//COST uses the same 92/8 agentic blend everywhere. For Mistral Small 3.2, that gives $0.12/M with only documented cache discounts included.
Q · 05 Is there a batch discount? +
Mistral pricing does not list a separate per-model batch discount in this row.
Q · 06 Are regional prices different? +
Mistral's public pricing page lets you view USD and EUR. AI//COST stores the USD baseline and handles currency variants in calculator URLs.