Last verified 2026-06-08

M3 FRONTIER1M CONTEXTMULTIMODALPROMPT CACHINGSTANDARD + PRIORITY

MiniMax M3 API Pricing

Q: What is the standard MiniMax M3 API price?

MiniMax's official pay-as-you-go pricing page lists MiniMax-M3 at $0.30/M input, $1.20/M output, and $0.06/M prompt-cache read for the standard ≤512K input-token row.

Q: What happens above 512K input tokens?

MiniMax lists a separate >512K standard row at $0.60/M input, $2.40/M output, and $0.12/M cache read. The pricing page says input tokens above 512K are available in limited quantity for a limited time, with public availability expected soon.

Q: Does MiniMax M3 have Priority pricing?

Yes. The Priority service-tier rows list $0.45/M input, $1.80/M output, and $0.09/M cache read for ≤512K, and $0.90/M input, $3.60/M output, and $0.18/M cache read above 512K.

Q: What context window does MiniMax M3 support?

MiniMax's text-generation docs list 1,000,000 tokens for MiniMax M3. The model page says the API supports up to a 1M-token context window with a guaranteed minimum of 512K tokens.

MiniMax M3 is MiniMax's newest frontier model for coding, agents, multimodal input, and long-context tasks. The official pricing page lists the standard ≤512K row at $0.30/M input, $1.20/M output, and $0.06/M cached input; >512K and Priority rows cost more.

Input - per 1M tokens

$0.30/M

Standard ≤512K input row base tier

Output - per 1M tokens

$1.20/M

Standard ≤512K input row 4x input

Cached input - prompt cache read

$0.06/M

Cache read standard row 80% off

Effective - agentic blend

$0.19/M

92/8 split - 82% cache

§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with MiniMax M3 standard ≤512K rates. MiniMax also lists long-context >512K and Priority service-tier rows; use those for very long or high-priority traffic.

Spend

$ /mo

Workload split

Prompt cache hit rate

Tokens you can process

—

Words equivalent (English)

—

Effective rate

—

Open full calculator (all models · share URL · CSV) →

§ 02 / SCENARIOS

Real-world presets.

CODING AGENT

Repo repair task

$0.020/task

60k in / 2k out~4,901 units/$100

OFFICE

Analyst memo draft

$0.016/memo

40k in / 3k out~6,410 units/$100

Web research brief

$0.011/brief

25k in / 3k out~9,009 units/$100

CHATBOT

Product support turn

$0.003/turn

6k in / 1k out~33,333 units/$100

§ 03 / TOKENIZER

Paste text. See tokens. See cost.

Your text · live count

Calibrated · measured on the vendor's tokenizer · 2026-06-10 Auto-counts as you type

Counts use a chars-per-token calibration measured on the vendor's own published tokenizer (MiniMaxAI/MiniMax-M2.1, 2026-06-10). English prose is typically within a few percent; code and non-Latin scripts tokenize heavier. For billing-exact counts use the vendor's count-tokens API.

Characters 443

Words 70

Tokens (estimated) 83 tokens

Cost as input · uncached $0.00002 USD

Cost as output · uncached $0.0001 USD

Cost as cached input $0.000005 USD

§ 04 / SHELF

Up against the shelf.

All models →

Model	Input /M	Output /M	Effective blended	Context	Best for
MiniMax M3 Current	$0.30 cache $0.06	$1.20	$0.19 standard ≤512K	1M	MiniMax frontier coding and agent loops
MiniMax M2.7	$0.30 cache $0.06	$1.20	$0.19 same standard price	205K	Previous MiniMax flagship loops
MiniMax M2.7 Highspeed	$0.60 cache $0.06	$2.40	$0.34 pricier MiniMax sibling	205K	Faster MiniMax flagship loops
DeepSeek V4 Pro	$0.43 cache $0.0036	$0.87	$0.14 verified shelf sibling	1M	Low-cost reasoning workloads
Gemini 2.5 Flash	$0.30 cache $0.03	$2.50	$0.27 verified shelf sibling	1M	Global multimodal budget workloads
GPT-5.4 mini	$0.75 cache $0.07	$4.50	$0.54 verified shelf sibling	400K	OpenAI subagent workloads

Frequently asked.

Practical MiniMax M3 pricing questions, with live MiniMax list rates separated from workload assumptions.

Q · 01 What is the standard MiniMax M3 API price? +

MiniMax's official pay-as-you-go pricing page lists MiniMax-M3 at $0.30/M input, $1.20/M output, and $0.06/M prompt-cache read for the standard ≤512K input-token row.

Q · 02 What happens above 512K input tokens? +

MiniMax lists a separate >512K standard row at $0.60/M input, $2.40/M output, and $0.12/M cache read. The pricing page says input tokens above 512K are available in limited quantity for a limited time, with public availability expected soon.

Q · 03 Does MiniMax M3 have Priority pricing? +

Yes. The Priority service-tier rows list $0.45/M input, $1.80/M output, and $0.09/M cache read for ≤512K, and $0.90/M input, $3.60/M output, and $0.18/M cache read above 512K.

Q · 04 What context window does MiniMax M3 support? +

MiniMax's text-generation docs list 1,000,000 tokens for MiniMax M3. The model page says the API supports up to a 1M-token context window with a guaranteed minimum of 512K tokens.

Q · 05 Is MiniMax M3 multimodal? +

Yes. MiniMax describes M3 as native multimodal; the text-generation docs list multimodal chat input with text, image, and video content parts.

Q · 06 When was this price last checked? +

This page was verified against MiniMax's official pay-as-you-go pricing page on 2026-06-08.

Reviewed by Yaroslav Vikhariev Founder - AI//COST - Pricing pulled daily from platform.minimax.io - Last verified Jun 08, 2026

Methodology Report a correction More by Y.V.