MiniMax M3 API Pricing
MiniMax M3 is MiniMax's newest frontier model for coding, agents, multimodal input, and long-context tasks. The official pricing page lists the standard ≤512K row at $0.30/M input, $1.20/M output, and $0.06/M cached input; >512K and Priority rows cost more.
Run the numbers.
Live calculator pre-loaded with MiniMax M3 standard ≤512K rates. MiniMax also lists long-context >512K and Priority service-tier rows; use those for very long or high-priority traffic.
Real-world presets.
Repo repair task
Analyst memo draft
Web research brief
Product support turn
Price history.
Paste text. See tokens. See cost.
Counts use a chars-per-token calibration measured on the vendor's own published tokenizer (MiniMaxAI/MiniMax-M2.1, 2026-06-10). English prose is typically within a few percent; code and non-Latin scripts tokenize heavier. For billing-exact counts use the vendor's count-tokens API.
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| MiniMax M3 Current | $0.30 cache $0.06 | $1.20 | $0.19 standard ≤512K | 1M | MiniMax frontier coding and agent loops |
| MiniMax M2.7 | $0.30 cache $0.06 | $1.20 | $0.19 same standard price | 205K | Previous MiniMax flagship loops |
| MiniMax M2.7 Highspeed | $0.60 cache $0.06 | $2.40 | $0.34 pricier MiniMax sibling | 205K | Faster MiniMax flagship loops |
| DeepSeek V4 Pro | $0.43 cache $0.00 | $0.87 | $0.14 verified shelf sibling | 1M | Low-cost reasoning workloads |
| Gemini 2.5 Flash | $0.30 cache $0.03 | $2.50 | $0.27 verified shelf sibling | 1M | Global multimodal budget workloads |
| GPT-5.4 mini | $0.75 cache $0.07 | $4.50 | $0.54 verified shelf sibling | 400K | OpenAI subagent workloads |
Frequently asked.
Practical MiniMax M3 pricing questions, with live MiniMax list rates separated from workload assumptions.
Q · 01 What is the standard MiniMax M3 API price? +
MiniMax-M3 at $0.30/M input, $1.20/M output, and $0.06/M prompt-cache read for the standard ≤512K input-token row.Q · 02 What happens above 512K input tokens? +
$0.60/M input, $2.40/M output, and $0.12/M cache read. The pricing page says input tokens above 512K are available in limited quantity for a limited time, with public availability expected soon.Q · 03 Does MiniMax M3 have Priority pricing? +
$0.45/M input, $1.80/M output, and $0.09/M cache read for ≤512K, and $0.90/M input, $3.60/M output, and $0.18/M cache read above 512K.Q · 04 What context window does MiniMax M3 support? +
1,000,000 tokens for MiniMax M3. The model page says the API supports up to a 1M-token context window with a guaranteed minimum of 512K tokens.Q · 05 Is MiniMax M3 multimodal? +
Q · 06 When was this price last checked? +
2026-06-08.