Kimi K2.6 API Pricing
Kimi K2.6 is Moonshot Kimi's current flagship for long-horizon coding, multimodal input, and agent tasks. The live Kimi pricing surface lists $0.95/M input and $4/M output, with cache hits at $0.16/M. Pulled directly from platform.kimi.ai daily.
Run the numbers.
Live calculator pre-loaded with current Kimi K2.6 rates. Tweak spend, output mix, or cache assumptions and share the URL to share the calculation.
Real-world presets.
Long-horizon coding
Screenshot to implementation
Research brief
Support assistant
Price history.
Paste text. See tokens. See cost.
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| Kimi K2.6 Current | $0.95 cache $0.16 | $4.00 | $0.60 agentic 92/8 | 262K | Frontier Kimi agents |
| Kimi K2.5 | $0.60 cache $0.10 | $3.00 | $0.41 cheaper | 262K | Multimodal Kimi production |
| Kimi K2 (0905 Preview) | $0.60 cache $0.15 | $2.50 | $0.41 cheaper | 262K | Legacy K2 migration checks |
| Kimi K2 Turbo Preview | $1.15 cache $0.15 | $8.00 | $0.94 pricier | 262K | Legacy fast K2 traffic |
| GPT-5.4 mini | $0.75 cache $0.07 | $4.50 | $0.54 cheaper | 400K | OpenAI coding agents |
| Gemini 2.5 Flash | $0.30 cache $0.03 | $2.50 | $0.27 cheaper | 1M | Multimodal budget work |
| Doubao Seed 2.0 Pro | $0.45 cache $0.09 | $2.25 | $0.32 cheaper | 256K | Chinese multimodal agents |
| DeepSeek V4 Flash | $0.14 cache $0.00 | $0.28 | $0.05 cheaper | 1M | Budget reasoning and coding |
Frequently asked.
Practical pricing questions, separated from calculator assumptions and regional taxes.
Q · 01 What is Kimi K2.6 priced at? +
$0.95/M input, $4/M output, and $0.16/M cache-hit input. The Kimi pricing docs state that prices exclude applicable taxes.Q · 02 How does prompt caching work? +
$0.16/M instead of the fresh-input $0.95/M rate, and the calculator's default agentic blend assumes an 82% cache-hit rate.Q · 03 How is the effective price calculated? +
$0.6/M.Q · 04 Does this include image and video input? +
Q · 05 Is there a batch discount? +
Q · 06 Are taxes or regional surcharges included? +
Q · 07 How accurate is the tokenizer estimate? +
moonshot-tokenizer-estimate chars-per-token estimate for English text. Actual billing comes from Kimi API usage fields and can differ for Chinese, code, images, video, or mixed-language prompts.