Last verified
REASONING MODEL128K CONTEXTPREMIER TIERNO CACHE SKU

Magistral Medium API Pricing

Magistral Medium is Mistral's current reasoning model, with live pricing verified from the official model card. Baseline token rates are $2/M input and $5/M output; no separate prompt-cache price is published for this row. Pulled from official Mistral sources and the project pricing snapshot.

Input - per 1M tokens
$2.00/M
Source Mistral flat
Output - per 1M tokens
$5.00/M
Context 128K flat
Cache N/A
$2.00/M
Cache no separate SKU not listed
Effective - agentic blend
$2.24/M
92/8 split - 82% cache
§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with Magistral Medium rates. Tweak spend, output mix, or cache assumptions to compare this row against current Mistral replacements.

$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
Words equivalent (English)
Effective rate
§ 02 / SCENARIOS

Real-world presets.

§ 03 / TAPE

Price history.

Held at $2/M input and $5/M output since the current card.

Input · $2/M
Output · $5/M
Cached · $2/M
SEP 18 Launch price $2/M input and $5/M outputMAY 18 Live verification kept $2/M and $5/M
§ 04 / TOKENIZER

Paste text. See tokens. See cost.

Estimate · mistral-tokenizer-estimate · ≈3.85 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters
Words
Tokens (estimated)
Cost as input · uncached
Cost as output · uncached
Cost as cached input
§ 05 / SHELF

Up against the shelf.

All models →
Model Input /M Output /M Effective blended Context Best for
Magistral Medium Current $2.00 $5.00 $2.24 agentic 92/8 128K Transparent reasoning
Mistral Large 3 $0.50 $1.50 $0.58 cheaper 256K Frontier multimodal work
Mistral Medium 3.5 $1.50 $7.50 $1.98 cheaper 256K General Mistral workloads
Mistral Small 4 $0.15 $0.60 $0.19 cheaper 256K Low-cost chat and support
Codestral $0.30 $0.90 $0.35 cheaper 256K Code generation and agents
Devstral 2 $0.40 $2.00 $0.53 cheaper 256K Code generation and agents
Magistral Medium Current $2.00 $5.00 $2.24 agentic 92/8 128K Transparent reasoning
DeepSeek V4 Flash $0.14 cache $0.00 $0.28 $0.05 cheaper 1M Budget coding and bulk reasoning

Frequently asked.

Practical pricing questions, with archived prices separated from current availability.

Q · 01 What is Magistral Medium priced at? +
Magistral Medium is shown here at $2/M input and $5/M output. The current official model card shows the same headline rate.
Q · 02 Is Magistral Medium still available? +
Magistral Medium is listed as an active Mistral model today. The page keeps the live price and calculator assumptions together for invoice checks.
Q · 03 Does Mistral list prompt caching for this model? +
No separate prompt-cache SKU is published for this row. The calculator therefore treats cache-hit input as the base input rate, $2/M, so the effective blend does not assume a hidden discount.
Q · 04 What does the effective price mean? +
The effective tile uses the site-wide agentic blend: 92% input tokens, 8% output tokens, and no cache discount unless a vendor lists one. For Magistral Medium, that gives $2.24/M.
Q · 05 Where does the archive price come from? +
For active rows, the price comes from Mistral's current pricing or model card. For retired rows, AI//COST preserves the verified snapshot price and uses Mistral's current docs to verify deprecation, retirement, and replacement status.
Q · 06 Is there a batch discount? +
Mistral exposes batching in the API docs, but this archive row does not publish a separate per-model batch rate. The calculator keeps the baseline token prices visible and avoids inventing a discount.