Last verified 2026-07-11

REASONING MODEL128K CONTEXTPREMIER TIERNO CACHE SKU

Magistral Medium API Pricing

Q: What is Magistral Medium priced at?

Magistral Medium is shown here at $2/M input and $5/M output. The current official model card shows the same headline rate.

Magistral Medium is Mistral's current reasoning model, with live pricing verified from the official model card. Baseline token rates are $2/M input and $5/M output; no separate prompt-cache price is published for this row. Pulled from official Mistral sources and the project pricing snapshot.

Input - per 1M tokens

$2.00/M

Source Mistral flat

Output - per 1M tokens

$5.00/M

Context 128K flat

Cache N/A

$2.00/M

Cache no separate SKU not listed

Effective - agentic blend

$2.24/M

92/8 split - 82% cache

§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with Magistral Medium rates. Tweak spend, output mix, or cache assumptions to compare this row against current Mistral replacements.

Spend

$ /mo

Workload split

Prompt cache hit rate

Tokens you can process

—

Words equivalent (English)

—

Effective rate

—

Open full calculator (all models · share URL · CSV) →

§ 02 / SCENARIOS

Real-world presets.

CHATBOT

Support assistant

$0.006/turn

2,000 in - 500 out~15,384 units/$100

CODING AGENT

Repo task iteration

$0.125/task

50,000 in - 5,000 out~800 units/$100

LONG DOC

Contract review

$0.370/doc

180,000 in - 2,000 out~270 units/$100

RAG

Knowledge base answer

$0.021/query

8,000 in - 1,000 out~4,761 units/$100

§ 03 / TOKENIZER

Paste text. See tokens. See cost.

Your text · live count

Calibrated · measured on the vendor's tokenizer · 2026-06-10 Auto-counts as you type

Counts use a chars-per-token calibration measured on the vendor's own published tokenizer (mistralai/Mistral-Nemo-Instruct-2407, 2026-06-10). English prose is typically within a few percent; code and non-Latin scripts tokenize heavier. For billing-exact counts use the vendor's count-tokens API.

Characters 408

Words 65

Tokens (estimated) 80 tokens

Cost as input · uncached $0.000160 USD

Cost as output · uncached $0.000400 USD

Cost as cached input $0.000160 USD

§ 04 / SHELF

Up against the shelf.

All models →

Model	Input /M	Output /M	Effective blended	Context	Best for
Magistral Medium Current	$2.00	$5.00	$2.24 agentic 92/8	128K	Transparent reasoning
Mistral Large 3	$0.50	$1.50	$0.58 cheaper	256K	Frontier multimodal work
Mistral Medium 3.5	$1.50	$7.50	$1.98 cheaper	256K	General Mistral workloads
Mistral Small 4	$0.15	$0.60	$0.19 cheaper	256K	Low-cost chat and support
Codestral	$0.30	$0.90	$0.35 cheaper	256K	Code generation and agents
Devstral 2	$0.40	$2.00	$0.53 cheaper	256K	Code generation and agents
Magistral Medium Current	$2.00	$5.00	$2.24 agentic 92/8	128K	Transparent reasoning
DeepSeek V4 Flash	$0.14 cache $0.00	$0.28	$0.05 cheaper	1M	Budget coding and bulk reasoning

Frequently asked.

Practical pricing questions, with archived prices separated from current availability.

Q · 01 What is Magistral Medium priced at? +

Magistral Medium is shown here at $2/M input and $5/M output. The current official model card shows the same headline rate.

Q · 02 Is Magistral Medium still available? +

Magistral Medium is listed as an active Mistral model today. The page keeps the live price and calculator assumptions together for invoice checks.

Q · 03 Does Mistral list prompt caching for this model? +

No separate prompt-cache SKU is published for this row. The calculator therefore treats cache-hit input as the base input rate, $2/M, so the effective blend does not assume a hidden discount.

Q · 04 What does the effective price mean? +

The effective tile uses the site-wide agentic blend: 92% input tokens, 8% output tokens, and no cache discount unless a vendor lists one. For Magistral Medium, that gives $2.24/M.

Q · 05 Where does the archive price come from? +

For active rows, the price comes from Mistral's current pricing or model card. For retired rows, AI//COST preserves the verified snapshot price and uses Mistral's current docs to verify deprecation, retirement, and replacement status.

Q · 06 Is there a batch discount? +

Mistral exposes batching in the API docs, but this archive row does not publish a separate per-model batch rate. The calculator keeps the baseline token prices visible and avoids inventing a discount.

Reviewed by Yaroslav Vikhariev Founder - AI//COST - Pricing pulled daily from mistral.ai - Last verified July 11, 2026

Methodology Report a correction More by Y.V.