Kimi API Pricing

Kimi K2 (0711 Preview)

Kimi K2 (0711 Preview) archive row.

131K ctx · text-only →

Kimi K2 Turbo Preview

Kimi K2 Turbo Preview archive row.

Kimi K2 Thinking

Kimi K2 Thinking archive row.

Kimi K2 Thinking Turbo

Kimi K2 Thinking Turbo archive row.

§ 02 / SHELF

All side-by-side.

Methodology →

Model	Input /M	Output /M	Cached	Context	Max output	Vision	Tools	Tier
Kimi K2.7 Code FLAGSHIP	$0.95	$4.00	$0.19−80%	262K	—	✓	✓	Active
Kimi K2.6	$0.95	$4.00	$0.16−83%	262K	—	✓	✓	Frontier
Kimi K2.5	$0.6	$3.00	$0.1−83%	262K	—	✓	✓	Mid
Moonshot V1 128K Vision Preview PREVIEW	$2.00	$5.00	—	131K	—	✓	✓	Preview
Moonshot V1 32K Vision Preview PREVIEW	$1.00	$3.00	—	32K	—	✓	✓	Preview
Moonshot V1 8K Vision Preview PREVIEW	$0.2	$2.00	—	8K	—	✓	✓	Preview
Moonshot V1 (128K) LEGACY	$2.00	$5.00	—	131K	—	✗	✓	Legacy
Moonshot V1 (32K) LEGACY	$1.00	$3.00	—	32K	—	✗	✓	Legacy
Moonshot V1 (8K) LEGACY	$0.2	$2.00	—	8K	—	✗	✓	Legacy
Kimi K2 (0905 Preview) RETIRED	—	—	—	Retired May 25, 2026	→ kimi-k2-5	✗	✗	Retired
Kimi K2 (0711 Preview) RETIRED	—	—	—	Retired May 25, 2026	→ kimi-k2-5	✗	✗	Retired
Kimi K2 Turbo Preview RETIRED	—	—	—	Retired May 25, 2026	→ kimi-k2-6	✗	✗	Retired
Kimi K2 Thinking RETIRED	—	—	—	Retired May 25, 2026	→ kimi-k2-5	✗	✗	Retired
Kimi K2 Thinking Turbo RETIRED	—	—	—	Retired May 25, 2026	→ kimi-k2-6	✗	✗	Retired

§ 03 / PRICE CURVE

Pricing across the lineup.

How Moonshot (Kimi) priced 3 models · JAN 26 → JUN 26.

oldest → newest →

Input · newest $0.95/M

Output · newest $4/M

Each point is a model at its listed $/M price.

The Kimi story so far.

Releases and funding from Moonshot's Kimi line. Sourced from Moonshot's GitHub/model cards, TechCrunch, and Wikipedia — verified at publication.

JUN 14 - 2026

Kimi K2.7 Code pricing verified - coding-specialized Kimi model at $0.95/$4.00 with $0.19 cache-hit input

PRICING

MAY 07 · 2026

$2B raised at a $20B valuation — led by Meituan's Long-Z, China's top-funded LLM startup

CORPORATE

APR 20 · 2026

Kimi K2.6 released — 1T-param open-weight MoE, native multimodal, four variants up to 300-agent swarms

RELEASE

MAY 25 · 2026

K2 preview series retires — dated K2 0711/0905, Turbo, and Thinking previews fold into K2.5 / K2.6

PRICING

JAN 27 · 2026

Kimi K2.5 released — multimodal upgrade adding a MoonViT vision encoder

RELEASE

JUL · 2025

Kimi K2 open-weighted — 1T-param MoE (32B active) trained with the MuonClip optimizer, released on Hugging Face

RELEASE

FEB · 2024

Alibaba-led $1B round — early backing at a $2.5B valuation

CORPORATE

OCT · 2023

Kimi assistant launched — famous for processing ~200,000 Chinese characters per conversation

RELEASE

§ 04 / ACCESS

Where to get it.

Methodology →

PRIMARY · DIRECT

Kimi Platform

platform.kimi.ai — direct API with an OpenAI-compatible endpoint and automatic context caching (cache-hit input is a fraction of the base rate). Billed per token.

Console + API →

OPEN-WEIGHT · SELF-HOST

Hugging Face

Kimi K2 weights — a 1-trillion-parameter MoE — are published on Hugging Face under a Modified MIT license, fully self-hostable with no per-token fee.

1T-param MoE →

CONSUMER · APP

Kimi

The Kimi assistant (kimi.com) — a free long-context chat app that made its name handling very long documents. For end-user usage; no API on this surface.

Free web + mobile →

§ 04 / BEST FOR

Which Moonshot (Kimi) for what.

More scenarios →

If you need a frontier agentic multimodal model…

Kimi K2.6

If you want cheaper multimodal at near-flagship quality…

Kimi K2.5

If you must self-host an open-weight 1T MoE…

Kimi K2 (HF)

Hugging Face →

If you process very long documents…

Kimi K2.6 (262K)

If you need a cheap short-context legacy tier…

Moonshot V1 8K