Last verified
EDGE FLAGSHIP128K CONTEXTTEXT + VISIONOPEN WEIGHTS

Ministral 3 14B API Pricing

Ministral 3 14B is Mistral's current edge flagship, with live pricing verified from the official pricing page. Baseline rates are $0.2/M input and $0.2/M output; no separate prompt-cache price is published for this row. Pulled from official Mistral sources and the project pricing snapshot.

Input - per 1M tokens
$0.20/M
Source Mistral flat
Output - per 1M tokens
$0.20/M
Context 128K flat
Cache N/A
$0.20/M
Cache no separate SKU not listed
Effective - agentic blend
$0.20/M
92/8 split - 82% cache
§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with Ministral 3 14B rates. Tweak spend, output mix, or cache assumptions to compare it with current Mistral replacements.

$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
Words equivalent (English)
Effective rate
§ 02 / SCENARIOS

Real-world presets.

§ 03 / TAPE

Price history.

Listed at $0.2/M input and $0.2/M output.

Input · $0.20/M
Output · $0.20/M
Cached · $0.20/M
JAN 01 Launch price $0.2/M input and $0.2/M outputMAY 18 Live verification kept $0.2/M and $0.2/M
§ 04 / TOKENIZER

Paste text. See tokens. See cost.

Estimate · mistral-tokenizer-estimate · ≈3.85 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters
Words
Tokens (estimated)
Cost as input · uncached
Cost as output · uncached
Cost as cached input
§ 05 / SHELF

Up against the shelf.

All models →
Model Input /M Output /M Effective blended Context Best for
Ministral 3 14B Current $0.20 $0.20 $0.20 agentic 92/8 128K Edge inference and compact agents
Ministral 3 14B Current $0.20 $0.20 $0.20 agentic 92/8 128K Edge inference and compact agents
Ministral 3 8B $0.15 $0.15 $0.15 cheaper 128K Edge inference and compact agents
Ministral 3 3B $0.10 $0.10 $0.10 cheaper 128K Edge inference and compact agents
Mistral Small 4 $0.15 $0.60 $0.19 cheaper 128K Low-cost chat and support
Codestral $0.30 $0.90 $0.35 pricier 128K Coding and code agents
Mistral Large 3 $0.50 $1.50 $0.58 pricier 128K General Mistral workloads
DeepSeek V4 Flash $0.14 cache $0.00 $0.28 $0.05 cheaper 1M Budget coding and bulk reasoning

Frequently asked.

Practical pricing questions, with archive status separated from current pricing.

Q · 01 What is Ministral 3 14B priced at? +
Ministral 3 14B is shown at $0.2/M input and $0.2/M output. This rate is the live baseline used in the calculator.
Q · 02 Is Ministral 3 14B still available? +
Ministral 3 14B is active in the current Mistral lineup, with pricing checked against the vendor pricing page on 2026-05-18.
Q · 03 Does this model have prompt-cache pricing? +
Mistral does not publish a separate cache-read SKU for this row. The calculator therefore sets cached input to $0.2/M, the same as normal input, and does not invent a discount.
Q · 04 How is the effective price calculated? +
AI//COST uses the same agentic blend everywhere: 92% input, 8% output, and only published cache discounts. For Ministral 3 14B, that is $0.2/M.
Q · 05 Why keep an archive page? +
Archive pages are useful for invoice checks, migration planning, and comparing how model pricing compressed over time. They are clearly labeled so retired API rows do not look like current recommendations.
Q · 06 Are there regional surcharges? +
The Mistral public pricing page is denominated in USD and EUR. This page stores the USD baseline; currency conversion and localized calculator variants are handled elsewhere on AI//COST.