CODE AGENTOPEN MODELLOW COSTTEXT + VISION
Devstral Small 2 API Pricing
Devstral Small 2 is open coding-agent model. Baseline rates are $0.1/M input and $0.3/M output. Pulled directly from mistral.ai and re-verified against the pricing page.
Input - per 1M tokens
$0.10/M
Source mistral.ai flat
Output - per 1M tokens
$0.30/M
Context 256K flat
Cache N/A
$0.10/M
Cache vendor row not listed
Effective - agentic blend
$0.12/M
92/8 split - 82% cache
§ 01 / TERMINAL
Run the numbers.
Live calculator pre-loaded with Devstral Small 2 rates. Tweak spend, output mix, or cache assumptions to compare it with sibling models.
$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
—
Words equivalent (English)
—
Effective rate
—
§ 02 / SCENARIOS
Real-world presets.
VISION
Invoice extraction
$0.001/doc
CHATBOT
Support assistant
$0.000/turn
RAG
Knowledge base answer
$0.001/query
BULK
Image QA review
$0.000/item
§ 03 / TAPE
Price history.
Input · $0.10/M
Output · $0.30/M
Cached · $0.10/M
DEC 01 Launch price $0.1/M input and $0.3/M outputMAY 18 Live verification kept $0.1/M and $0.3/M
§ 04 / TOKENIZER
Paste text. See tokens. See cost.
Estimate · mistral-tokenizer-estimate · ≈3.85 chars/token Auto-counts as you type
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
Characters —
Words —
Tokens (estimated) —
Cost as input · uncached —
Cost as output · uncached —
Cost as cached input —
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| Devstral Small 2 Current | $0.10 | $0.30 | $0.12 agentic 92/8 | 256K | Open coding agents |
| Mistral Large 3 | $0.50 | $1.50 | $0.58 pricier | 128K | Mistral production workloads |
| Mistral Small 4 | $0.15 | $0.60 | $0.19 pricier | 256K | Low-cost multimodal chat |
| Magistral Medium | $2.00 | $5.00 | $2.24 pricier | 256K | Transparent reasoning |
| Magistral Small | $0.50 | $1.50 | $0.58 pricier | 256K | Transparent reasoning |
| Ministral 3 14B | $0.20 | $0.20 | $0.20 pricier | 128K | Mistral production workloads |
| Ministral 3 3B | $0.10 | $0.10 | $0.10 cheaper | 128K | Mistral production workloads |
| DeepSeek V4 Flash | $0.14 cache $0.00 | $0.28 | $0.05 cheaper | 1M | Mistral production workloads |
Frequently asked.
Practical pricing questions, separated from calculator assumptions.
Q · 01 What is Devstral Small 2 priced at? +
Devstral Small 2 is shown at
$0.1/M input and $0.3/M output. The page stores USD per-million-token baseline pricing from mistral.ai.Q · 02 Does this page include higher context pricing tiers? +
This page uses the public API baseline for the model row. If Mistral publishes separate long-context tiers later, the snapshot entry should be re-verified and updated.
Q · 03 Is prompt caching priced separately? +
No separate cache-read price is published for this row, so the calculator treats cached input as
$0.1/M.Q · 04 How is the effective price calculated? +
AI//COST uses the same 92/8 agentic blend everywhere. For Devstral Small 2, that gives
$0.12/M with only documented cache discounts included.Q · 05 Is there a batch discount? +
Mistral pricing does not list a separate per-model batch discount in this row.
Q · 06 Are regional prices different? +
Mistral's public pricing page lets you view USD and EUR. AI//COST stores the USD baseline and handles currency variants in calculator URLs.