VERIFIED PRICECHEAP OPEN MODEL256KTEXT + VISIONNO CACHE SKU
Mistral Small 4 API Pricing
Mistral Small 4 is the cheap multimodal Mistral tier at $0.15/$0.60 per million tokens. $0.15/M input, $0.6/M output. The model card lists 256K context and a hybrid instruct, reasoning, and coding profile.
Input - per 1M tokens
$0.15/M
Source Mistral active
Output - per 1M tokens
$0.60/M
Use for cost checks active
Cached input
$0.15/M
Cache no separate SKU not listed
Effective - agentic blend
$0.19/M
92/8 split - 82% cache
§ 01 / TERMINAL
Run the numbers.
Calculator pre-loaded with Mistral Small 4 rates. Tweak spend, output mix, or cache hit rate to compare this model with nearby alternatives.
$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
—
Words equivalent (English)
—
Effective rate
—
§ 02 / SCENARIOS
Real-world presets.
AGENT LOOP
Repo-wide bug fix
$0.016/task
LONG DOC
Reading 100-page contracts
$0.031/doc
RAG SUPPORT
Support agent ticket triage
$0.001/ticket
ASSISTANT TURN
Research planning turn
$0.003/turn
§ 03 / TAPE
Price history.
Input · $0.15/M
Output · $0.60/M
Cached · $0.15/M
MAR 16 Model card price at $0.15/M input and $0.60/M outputMAY 18 Live Mistral model-card pricing verified
§ 04 / TOKENIZER
Paste text. See tokens. See cost.
Estimate · mistral-tokenizer-estimate · ≈4.2 chars/token Auto-counts as you type
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
Characters —
Words —
Tokens (estimated) —
Cost as input · uncached —
Cost as output · uncached —
Cost as cached input —
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| Mistral Small 4 | $0.15 cache $0.15 | $0.60 | $0.19 current page | 256K | Low-cost Mistral workloads |
| Mistral Large 3 | $0.50 cache $0.50 | $1.50 | $0.58 pricier | 256K | Open-weight multimodal work |
| Mistral Medium 3.5 | $1.50 cache $1.50 | $7.50 | $1.98 pricier | 256K | Agentic and coding workloads |
| Devstral 2 | $0.40 cache $0.40 | $2.00 | $0.53 pricier | 256K | Coding and software agents |
| GPT-5.4 mini | $0.75 cache $0.07 | $4.50 | $0.54 pricier | 400K | Open-weight multimodal work |
| Claude Sonnet 4.6 | $3.00 cache $0.30 | $15.00 | $1.92 pricier | 1M | Open-weight multimodal work |
Frequently asked.
Short answers for teams checking Mistral Small 4 pricing, status, and migration choices.
Q · 01 Is Mistral Small 4 still available? +
Mistral Small 4 is listed as active/current in the pricing snapshot and was checked against the vendor page used for this entry.
Q · 02 How much does Mistral Small 4 cost? +
Mistral Small 4 is listed at $0.15/M input and $0.6/M output.
Q · 03 Is cached-input pricing included? +
No separate cached-input SKU is listed for this entry, so the calculator treats cached input as undiscounted.
Q · 04 What should teams compare it against? +
Use the shelf to compare nearby current models. Snapshot replacement:
current same-provider models.Q · 05 Why keep this page? +
These pages preserve live price checks, preview economics, and migration context in one consistent calculator format.