Gemini 2.5 Pro API Pricing
Google's stable Gemini 2.5 Pro tier for deep reasoning, coding, and large multimodal context: $1.25/M input, $10/M output, and $0.125/M cached input. Pulled directly from ai.google.dev.
Run the numbers.
Live calculator pre-loaded with current Gemini 2.5 Pro standard rates. Tier 1 applies to prompts up to 200K tokens; above 200K, Google lists $2.50/M input, $0.25/M cached input, and $15/M output.
Real-world presets.
Coding agent iteration
Document pack analysis
Video + docs briefing
Research agent loop
Large RAG synthesis
Price history.
Paste text. See tokens. See cost.
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| Gemini 3.1 Pro Preview | $2.00 cache $0.20 | $12.00 | $1.44 frontier preview | 1M | Google frontier preview |
| Gemini 3 Flash Preview | $0.50 cache $0.05 | $3.00 | $0.36 preview flash | 1M | Cheaper Gemini 3 preview |
| Gemini 3.1 Flash-Lite | $0.25 cache $0.03 | $1.50 | $0.18 light tier | 1M | High-volume Gemini 3.1 |
| Gemini 2.5 Pro Current | $1.25 cache $0.13 | $10.00 | $1.10 agentic 92/8 | 2M | Stable Gemini 2.5 Pro |
| Gemini 2.5 Flash | $0.30 cache $0.03 | $2.50 | $0.27 stable flash | 1M | Best price-performance Gemini 2.5 |
| Gemini 2.5 Flash-Lite | $0.10 cache $0.01 | $0.40 | $0.06 budget tier | 1M | Cheapest stable Gemini 2.5 |
| GPT-5.4 | $2.50 cache $0.25 | $15.00 | $1.80 OpenAI competitor | 1.05M | Affordable OpenAI frontier work |
| GPT-5.4 mini | $0.75 cache $0.07 | $4.50 | $0.54 OpenAI mini | 400K | Subagents and lightweight coding |
Frequently asked.
Practical Gemini 2.5 Pro pricing questions, with Google token rates separated from workload assumptions.
Q · 01 What is Gemini 2.5 Pro's standard API price? +
gemini-2.5-pro at $1.25/M input, $0.125/M cached input, and $10/M output for text, image, and video workloads. For prompts above 200K, the table lists $2.50/M input, $0.25/M cached input, and $15/M output. Audio input is listed separately at $1.25/M.Q · 02 Does output pricing include thinking tokens? +
Output price (including thinking tokens). This page treats generated reasoning and answer tokens as part of the published output rate.Q · 03 How much do Batch and Flex cost? +
Q · 04 Is this a preview model? +
Q · 05 What about Google Search grounding costs? +
Q · 06 How accurate is the tokenizer estimate? +
4.0 characters per token as a Gemini planning estimate. Exact billing can differ by language, media inputs, tool calls, and how Google tokenizes multimodal content.