Last verified 2026-07-11

LEGACY CODING205K CONTEXTTEXT ONLYPROMPT CACHING60 TPS TARGET

MiniMax M2.1 API Pricing

Q: What is the standard MiniMax M2.1 API price?

MiniMax's official pay-as-you-go pricing page lists MiniMax-M2.1 at $0.3/M input and $1.2/M output. Cache reads are listed at $0.03/M. AI//COST stores those direct USD list prices without currency conversion.

Q: Does MiniMax publish prompt caching for this model?

MiniMax lists prompt-cache reads at $0.03/M and cache writes at $0.375/M for MiniMax M2.1. The quote board uses the read price because repeated cache hits drive recurring workload cost.

Q: What context window does MiniMax M2.1 support?

MiniMax's text-generation docs list 204,800 tokens of context for this M2-family text model. This page rounds that to 205K for display consistency.

Q: How accurate is the tokenizer estimate?

The browser widget uses a minimax-tokenizer-estimate chars-per-token approximation for English planning. Real billing depends on MiniMax server-side tokenization and can differ for Chinese, code, and mixed-language prompts.

MiniMax M2.1 is MiniMax's stable M2.1 coding snapshot. The official pricing page lists $0.3/M input, $1.2/M output, and $0.03/M cached input. Pulled directly from platform.minimax.io daily.

Input - per 1M tokens

$0.30/M

Direct USD vendor row base tier

Output - per 1M tokens

$1.20/M

Direct USD vendor row 4x input

Cached input - prompt cache read

$0.03/M

Cache write $0.375/M 90% off

Effective - agentic blend

$0.17/M

92/8 split - 82% cache

§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with current MiniMax M2.1 rates. Tweak workload split and cache hit rate, then share the URL to share the calculation.

Spend

$ /mo

Workload split

Prompt cache hit rate

Tokens you can process

—

Words equivalent (English)

—

Effective rate

—

Open full calculator (all models · share URL · CSV) →

§ 02 / SCENARIOS

Real-world presets.

CODING AGENT

Repo repair task

$0.020/task

60k in / 2k out~4,901 units/$100

OFFICE

Analyst memo draft

$0.016/memo

40k in / 3k out~6,410 units/$100

Web research brief

$0.011/brief

25k in / 3k out~9,009 units/$100

CHATBOT

Product support turn

$0.003/turn

6k in / 1k out~33,333 units/$100

§ 03 / TOKENIZER

Paste text. See tokens. See cost.

Your text · live count

Calibrated · measured on the vendor's tokenizer · 2026-06-10 Auto-counts as you type

Counts use a chars-per-token calibration measured on the vendor's own published tokenizer (MiniMaxAI/MiniMax-M2.1, 2026-06-10). English prose is typically within a few percent; code and non-Latin scripts tokenize heavier. For billing-exact counts use the vendor's count-tokens API.

Characters 429

Words 70

Tokens (estimated) 80 tokens

Cost as input · uncached $0.000024 USD

Cost as output · uncached $0.000096 USD

Cost as cached input $0.000002 USD

§ 04 / SHELF

Up against the shelf.

All models →

Model	Input /M	Output /M	Effective blended	Context	Best for
MiniMax M2.1 Current	$0.30 cache $0.03	$1.20	$0.17 agentic 92/8	205K	Stable coding and office agents
MiniMax M2.5	$0.30 cache $0.03	$1.20	$0.17 same effective tier	205K	Value-first MiniMax agent loops
MiniMax M2.1 Highspeed	$0.60 cache $0.03	$2.40	$0.31 pricier MiniMax sibling	205K	Lower-latency M2.1 agents
MiniMax M2	$0.30 cache $0.03	$1.20	$0.17 same effective tier	205K	Original M2 coding agents
MiniMax M2-her	$0.30	$1.20	$0.37 pricier MiniMax sibling	64K	Role-play and long dialogue
MiniMax M2.7	$0.30 cache $0.06	$1.20	$0.19 pricier MiniMax sibling	205K	MiniMax flagship agent loops
GPT-5.4 mini	$0.75 cache $0.07	$4.50	$0.54 verified shelf sibling	400K	OpenAI subagent workloads
DeepSeek V4 Pro	$0.43 cache $0.00	$0.87	$0.14 verified shelf sibling	1M	Low-cost reasoning workloads

Frequently asked.

Practical MiniMax M2.1 pricing questions, with live MiniMax list rates separated from workload assumptions.

Q · 01 What is the standard MiniMax M2.1 API price? +

MiniMax's official pay-as-you-go pricing page lists MiniMax-M2.1 at $0.3/M input and $1.2/M output. Cache reads are listed at $0.03/M. AI//COST stores those direct USD list prices without currency conversion.

Q · 02 Does MiniMax publish prompt caching for this model? +

MiniMax lists prompt-cache reads at $0.03/M and cache writes at $0.375/M for MiniMax M2.1. The quote board uses the read price because repeated cache hits drive recurring workload cost.

Q · 03 What context window does MiniMax M2.1 support? +

MiniMax's text-generation docs list 204,800 tokens of context for this M2-family text model. This page rounds that to 205K for display consistency.

Q · 04 Is there a Batch API discount? +

MiniMax's current pay-as-you-go table does not publish a separate Batch API discount for MiniMax M2.1. Treat the public $0.3/M input and $1.2/M output rates as the default unless your account has a negotiated enterprise agreement.

Q · 05 When was this price last checked? +

This page was verified against MiniMax's official pay-as-you-go pricing page on May 23, 2026. The same page currently lists the active text rows for M2.7, M2.5, M2.1, M2, and M2-her.

Q · 06 How accurate is the tokenizer estimate? +

The browser widget uses a minimax-tokenizer-estimate chars-per-token approximation for English planning. Real billing depends on MiniMax server-side tokenization and can differ for Chinese, code, and mixed-language prompts.

Reviewed by Yaroslav Vikhariev Founder - AI//COST - Pricing pulled daily from platform.minimax.io - Last verified July 11, 2026

Methodology Report a correction More by Y.V.