MiniMax M2-her API Pricing
MiniMax M2-her is MiniMax's dialogue and role-play M2 variant. The official pricing page lists $0.3/M input, $1.2/M output, with no cache-read price published for this row. Pulled directly from platform.minimax.io daily.
Run the numbers.
Live calculator pre-loaded with current MiniMax M2-her rates. Tweak workload split and cache hit rate, then share the URL to share the calculation.
Real-world presets.
Character conversation
Story arc session
NPC reply batch
Persona support chat
Price history.
Paste text. See tokens. See cost.
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| MiniMax M2-her Current | $0.30 | $1.20 | $0.37 agentic 92/8 | 64K | Role-play and long dialogue |
| MiniMax M2 | $0.30 cache $0.03 | $1.20 | $0.17 cheaper MiniMax sibling | 205K | Original M2 coding agents |
| MiniMax M2.1 | $0.30 cache $0.03 | $1.20 | $0.17 cheaper MiniMax sibling | 205K | Stable coding and office agents |
| MiniMax M2.5 | $0.30 cache $0.03 | $1.20 | $0.17 cheaper MiniMax sibling | 205K | Value-first MiniMax agent loops |
| MiniMax M2.7 | $0.30 cache $0.06 | $1.20 | $0.19 cheaper MiniMax sibling | 205K | MiniMax flagship agent loops |
| GPT-5.4 mini | $0.75 cache $0.07 | $4.50 | $0.54 verified shelf sibling | 400K | OpenAI subagent workloads |
| DeepSeek V4 Pro | $0.43 cache $0.00 | $0.87 | $0.14 verified shelf sibling | 1M | Low-cost reasoning workloads |
| Gemini 2.5 Flash | $0.30 cache $0.03 | $2.50 | $0.27 verified shelf sibling | 1M | Global multimodal budget workloads |
Frequently asked.
Practical MiniMax M2-her pricing questions, with live MiniMax list rates separated from workload assumptions.
Q · 01 What is the standard MiniMax M2-her API price? +
MiniMax-M2-her at $0.3/M input and $1.2/M output. No cache-read price is published for this row. AI//COST stores those direct USD list prices without currency conversion.Q · 02 Does MiniMax publish prompt caching for this model? +
MiniMax M2-her on the pay-as-you-go page. This page therefore uses the base input price for cached-token math and labels cache as not available.Q · 03 What context window does MiniMax M2-her support? +
M2-her with a 64 K context window for role-play and multi-turn dialogue. This is smaller than the 204,800-token context listed for the coding-oriented M2 line.Q · 04 Is there a Batch API discount? +
MiniMax M2-her. Treat the public $0.3/M input and $1.2/M output rates as the default unless your account has a negotiated enterprise agreement.Q · 05 When was this price last checked? +
May 23, 2026. The same page currently lists the active text rows for M2.7, M2.5, M2.1, M2, and M2-her.Q · 06 How accurate is the tokenizer estimate? +
minimax-tokenizer-estimate chars-per-token approximation for English planning. Real billing depends on MiniMax server-side tokenization and can differ for Chinese, code, and mixed-language prompts.