Last verified 2026-06-08

RETIRED MAY 25 2026262K CONTEXTTEXT ONLYPROMPT CACHINGTURBO K2

Kimi K2 Turbo Preview API Pricing

Q: Is Kimi K2 Turbo Preview still supported?

No for new planning. Kimi's pricing page says the kimi-k2 series will be discontinued on May 25, 2026 and recommends moving to newer Kimi models. This page keeps the price visible for archive and invoice checks.

Q: What is Kimi K2 Turbo Preview priced at?

Kimi K2 Turbo Preview is listed at $1.15/M input, $8/M output, and $0.15/M cache-hit input. Prices exclude applicable taxes and are stored here in USD per one million tokens.

Kimi K2 Turbo Preview is a deprecated low-latency K2 preview with a premium output rate. The live Kimi pricing surface lists $1.15/M input and $8/M output, with cache hits at $0.15/M; Kimi says the whole kimi-k2 series retired on May 25, 2026. Pulled directly from platform.kimi.ai daily.

Input - per 1M tokens

$1.15/M

Source Kimi API retired

Output - per 1M tokens

$8.00/M

60-100 tokens/sec target retired

Cached input - per 1M tokens

$0.15/M

Cache automatic -87%

Effective - agentic blend

$0.94/M

92/8 split - 82% cache

§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with Kimi K2 Turbo Preview archive rates. Use it for invoice checks and migration sizing, then compare against Kimi K2.6 before new production work.

Spend

$ /mo

Workload split

Prompt cache hit rate

Tokens you can process

—

Words equivalent (English)

—

Effective rate

—

Open full calculator (all models · share URL · CSV) →

§ 02 / SCENARIOS

Real-world presets.

CODING AGENT

Legacy codebase patch

$0.064/task

28,000 in - 4,000 out~1,557 units/$100

DEEP RESEARCH

Tool-heavy brief

$0.127/brief

55,000 in - 8,000 out~785 units/$100

RAG

Knowledge-base lookup

$0.020/query

9,000 in - 1,200 out~5,025 units/$100

CHATBOT

Support turn

$0.010/turn

3,500 in - 700 out~10,416 units/$100

§ 03 / TOKENIZER

Paste text. See tokens. See cost.

Your text · live count

Estimate · moonshot-tokenizer-estimate · ≈3.85 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters 405

Words 68

Tokens (estimated) 105 tokens

Cost as input · uncached $0.000121 USD

Cost as output · uncached $0.000840 USD

Cost as cached input $0.000016 USD

§ 04 / SHELF

Up against the shelf.

All models →

Model	Input /M	Output /M	Effective blended	Context	Best for
Kimi K2 Turbo Preview Current	$1.15 cache $0.15	$8.00	$0.94 agentic 92/8	262K	Retired K2 archive page
Kimi K2.5	$0.60 cache $0.10	$3.00	$0.41 cheaper	262K	Supported K2 migration target
Kimi K2.6	$0.95 cache $0.16	$4.00	$0.60 cheaper	262K	Current Kimi flagship agents
Gemini 2.5 Flash	$0.30 cache $0.03	$2.50	$0.27 cheaper	1M	Multimodal budget work
Doubao Seed 2.0 Pro	$0.45 cache $0.09	$2.25	$0.32 cheaper	256K	Chinese multimodal agents
GPT-5.4 mini	$0.75 cache $0.07	$4.50	$0.54 cheaper	400K	OpenAI low-latency coding

Frequently asked.

Archive pricing questions for teams still seeing this slug in logs, invoices, or migration plans.

Q · 01 Is Kimi K2 Turbo Preview still supported? +

No for new planning. Kimi's pricing page says the kimi-k2 series will be discontinued on May 25, 2026 and recommends moving to newer Kimi models. This page keeps the price visible for archive and invoice checks.

Q · 02 What is Kimi K2 Turbo Preview priced at? +

Kimi K2 Turbo Preview is listed at $1.15/M input, $8/M output, and $0.15/M cache-hit input. Prices exclude applicable taxes and are stored here in USD per one million tokens.

Q · 03 Which model should replace it? +

Use Kimi K2.6 for new Kimi workloads. It is the recommended migration target for this archive slug and remains on the current Kimi pricing surface.

Q · 04 How does prompt caching affect the cost? +

Kimi supports automatic context caching for K2 models. Cache-hit input is billed at $0.15/M instead of the fresh-input $1.15/M rate, and AI//COST's default effective blend assumes an 82% cache-hit rate.

Q · 05 How is the effective price calculated? +

AI//COST uses the same 92/8 agentic blend everywhere. With an 82% cache-hit rate, Kimi K2 Turbo Preview's effective blended cost is $0.94/M.

Q · 06 Does this K2 variant support vision? +

No. Kimi documents the K2 preview series as text-only and explicitly says it does not support vision functionality. Use Kimi K2.5 or Kimi K2.6 when image or video input is part of the workload.

Q · 07 Are taxes or regional surcharges included? +

No. The Kimi pricing page states that listed prices exclude applicable taxes and checkout applies tax based on jurisdiction. AI//COST stores the vendor token price before taxes.

Reviewed by Yaroslav Vikhariev Founder - AI//COST - Pricing pulled daily from platform.kimi.ai - Last verified May 22, 2026

Methodology Report a correction More by Y.V.