Kimi K2 Turbo Preview API Pricing
Kimi K2 Turbo Preview is a deprecated low-latency K2 preview with a premium output rate. The live Kimi pricing surface lists $1.15/M input and $8/M output, with cache hits at $0.15/M; Kimi says the whole kimi-k2 series retires on May 25, 2026. Pulled directly from platform.kimi.ai daily.
Run the numbers.
Live calculator pre-loaded with Kimi K2 Turbo Preview archive rates. Use it for invoice checks and migration sizing, then compare against Kimi K2.6 before new production work.
Real-world presets.
Legacy codebase patch
Tool-heavy brief
Knowledge-base lookup
Support turn
Price history.
Paste text. See tokens. See cost.
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| Kimi K2 Turbo Preview Current | $1.15 cache $0.15 | $8.00 | $0.94 agentic 92/8 | 262K | Deprecated K2 archive page |
| Kimi K2.5 | $0.60 cache $0.10 | $3.00 | $0.41 cheaper | 262K | Supported K2 migration target |
| Kimi K2.6 | $0.95 cache $0.16 | $4.00 | $0.60 cheaper | 262K | Current Kimi flagship agents |
| Gemini 2.5 Flash | $0.30 cache $0.03 | $2.50 | $0.27 cheaper | 1M | Multimodal budget work |
| Doubao Seed 2.0 Pro | $0.45 cache $0.09 | $2.25 | $0.32 cheaper | 256K | Chinese multimodal agents |
| GPT-5.4 mini | $0.75 cache $0.07 | $4.50 | $0.54 cheaper | 400K | OpenAI low-latency coding |
Frequently asked.
Archive pricing questions for teams still seeing this slug in logs, invoices, or migration plans.
Q · 01 Is Kimi K2 Turbo Preview still supported? +
kimi-k2 series will be discontinued on May 25, 2026 and recommends moving to newer Kimi models. This page keeps the price visible for archive and invoice checks.Q · 02 What is Kimi K2 Turbo Preview priced at? +
$1.15/M input, $8/M output, and $0.15/M cache-hit input. Prices exclude applicable taxes and are stored here in USD per one million tokens.Q · 03 Which model should replace it? +
Q · 04 How does prompt caching affect the cost? +
$0.15/M instead of the fresh-input $1.15/M rate, and AI//COST's default effective blend assumes an 82% cache-hit rate.Q · 05 How is the effective price calculated? +
$0.94/M.