Last verified
MEDICAL REASONING32K CONTEXTTEXT ONLYSPLIT IO PRICINGMEDICAL SEARCH SURCHARGE

Baichuan-M2-Plus API Pricing

Baichuan-M2-Plus is the earlier medical reasoning row that still carries the same live price as Baichuan-M3. The official pricing page lists $1.41/M input and $4.23/M output, converted from 0.01/0.03 yuan per 1K tokens at 7.10 CNY/USD. Baichuan also notes an additional 0.03 yuan per-call medical-search charge for this model. Pulled directly from platform.baichuan-ai.com daily.

Input - per 1M tokens
$1.41/M
Source Baichuan same as M3
Output - per 1M tokens
$4.23/M
Medical tuned row same as M3
Cached input - no separate discount
$1.41/M
Cache not listed 0%
Effective - agentic blend
$1.63/M
92/8 split - no cache discount
§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with current Baichuan-M2-Plus token rates. Use it for token math, then add Baichuan's per-call medical-search fee separately if your workflow triggers it.

$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
Words equivalent (English)
Effective rate
§ 02 / SCENARIOS

Real-world presets.

§ 03 / TAPE

Price history.

Baichuan-M2-Plus still matches Baichuan-M3 at $1.41/M input and $4.23/M output in our live snapshots.

Input · $1.4/M
Output · $4.2/M
Cached · $1.4/M
MAY 18 First AI//COST verified snapshot stored $1.41/M input and $4.23/M outputMAY 23 Live verification kept $1.41/M input and $4.23/M output
§ 04 / TOKENIZER

Paste text. See tokens. See cost.

Estimate · baichuan-tokenizer-estimate · ≈3.85 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters
Words
Tokens (estimated)
Cost as input · uncached
Cost as output · uncached
Cost as cached input
§ 05 / SHELF

Up against the shelf.

All models →
Model Input /M Output /M Effective blended Context Best for
Baichuan-M2-Plus Current $1.41 cache $1.41 $4.22 $1.63 agentic 92/8 32K Legacy medical copilots
Baichuan-M3 $1.41 cache $1.41 $4.22 $1.63 same list price 32K Higher-depth medical reasoning
Baichuan-M2 $0.28 cache $0.28 $2.82 $0.48 cheaper 32K Budget Chinese text workloads
Baichuan3-Turbo $1.69 cache $1.69 $1.69 $1.69 slightly pricier 32K Balanced legacy production traffic
Baichuan4 Air $0.14 cache $0.14 $0.14 $0.14 cheaper 32K Lowest-cost Baichuan API traffic
GLM-5 $1.00 cache $0.20 $3.20 $0.57 cheaper 200K Chinese coding and agent tasks
Gemini 2.5 Flash $0.30 cache $0.03 $2.50 $0.27 cheaper 1M Global multimodal budget workloads

Frequently asked.

Practical pricing questions for Baichuan-M2-Plus, especially around the pricing parity with M3 and the extra medical-search charge.

Q · 01 What is Baichuan-M2-Plus priced at? +
Baichuan's official pricing page lists Baichuan-M2-Plus at about $1.41/M input and $4.23/M output. Those USD figures come from 0.01/0.03 yuan per 1K tokens converted at 7.10 CNY/USD.
Q · 02 Why does Baichuan-M2-Plus cost the same as Baichuan-M3? +
On the current public table, both rows carry the same token prices: $1.41/M input and $4.23/M output. That means M2-Plus no longer offers a cheaper on-ramp relative to M3, so the decision is about model behavior and integration history rather than list price.
Q · 03 What is the medical-search surcharge? +
Baichuan notes that Baichuan-M2-Plus can automatically trigger a medical-search service billed separately at 0.03 yuan per call. At 7.10 CNY/USD, that is roughly $0.0042 per triggered call and sits outside the token quote board.
Q · 04 Does Baichuan-M2-Plus have prompt-cache pricing? +
No separate cache-hit discount is listed for Baichuan-M2-Plus on the public pricing page. AI//COST therefore treats cached input as the same rate as normal input instead of inventing another billing tier.
Q · 05 How does it compare with Baichuan-M2? +
Baichuan-M2 is much cheaper at about $0.282/M input and $2.817/M output. On the standard 92/8 blend, Baichuan-M2-Plus lands around $1.63/M versus about $0.48/M for Baichuan-M2.
Q · 06 How accurate is the tokenizer estimate? +
The browser widget uses a baichuan-tokenizer-estimate chars-per-token approximation for English planning. Real billing comes from Baichuan's API usage counters and can differ for Chinese, code, or mixed-language prompts.