Gemini 3.5 Flash API Pricing
Gemini 3.5 Flash is Google's GA model for sustained frontier performance on agentic and coding tasks: $1.50/M input, $9.00/M output, and $0.15/M cached input. Pulled directly from ai.google.dev.
Run the numbers.
Live calculator pre-loaded with Gemini 3.5 Flash standard rates. Google lists standard input at $1.50/M, output including thinking tokens at $9.00/M, and cached input at $0.15/M.
Real-world presets.
Repository iteration loop
Video + docs briefing
Search-grounded answer
Batch classification
Price history.
Paste text. See tokens. See cost.
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| Gemini 3.5 Flash Current | $1.50 cache $0.15 | $9.00 | $1.08 agentic 92/8 | 1M | Newest GA Gemini agent model |
| Gemini 3.1 Pro Preview | $2.00 cache $0.20 | $12.00 | $1.44 Pro preview | 1M | Higher-ceiling preview reasoning |
| Gemini 3.1 Flash-Lite | $0.25 cache $0.03 | $1.50 | $0.18 same token price | 1M | Low-latency high-volume tasks |
| Gemini 3 Flash Preview | $0.50 cache $0.05 | $3.00 | $0.36 older preview | 1M | Gemini 3 preview workloads |
Frequently asked.
Practical Gemini 3.5 Flash pricing questions, with standard, batch, flex, and priority tiers separated.
Q · 01 What is Gemini 3.5 Flash priced at? +
gemini-3.5-flash at $1.50/M input, $9.00/M output, and $0.15/M cached input. These are USD prices per 1M tokens on the paid Gemini API tier.Q · 02 Does output pricing include thinking tokens? +
Output price (including thinking tokens), so this page treats generated thinking and answer tokens as output.Q · 03 How much do Batch and Flex cost? +
$0.75/M input, $4.50/M output, and $0.075/M cached input. Flex uses the same input/output rates, with cached input listed at $0.08/M.Q · 04 What about Priority pricing? +
$2.70/M input, $16.20/M output, and $0.27/M cached input.Q · 05 What context window does it support? +
1,048,576 and an output token limit of 65,536. AI//COST rounds that to a 1M context label.Q · 06 Is this a preview model? +
gemini-3.5-flash as the GA version on May 19, 2026. The older preview row remains separate as gemini-3-flash-preview.