Last verified
GEMINI 3.5 GA1M CONTEXTMULTIMODALAGENTIC CODINGBATCH + FLEX

Gemini 3.5 Flash API Pricing

Gemini 3.5 Flash is Google's GA model for sustained frontier performance on agentic and coding tasks: $1.50/M input, $9.00/M output, and $0.15/M cached input. Pulled directly from ai.google.dev.

Input - per 1M tokens
$1.50/M
Base token price standard
Output - per 1M tokens
$9.00/M
Output tokens standard
Cached input
$0.15/M
Cache plus storage fee -90%
Effective - agentic blend
$1.08/M
92/8 split - 82% cache
§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with Gemini 3.5 Flash standard rates. Google lists standard input at $1.50/M, output including thinking tokens at $9.00/M, and cached input at $0.15/M.

$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
Words equivalent (English)
Effective rate
Open full calculator (all models · share URL · CSV) →
§ 02 / SCENARIOS

Real-world presets.

§ 03 / TAPE

Price history.

Gemini 3.5 Flash is currently listed at $1.50/M input, $9.00/M output, and $0.15/M cached input.

Input · $1.5/M
Output · $9/M
Cached · $0.15/M
MAY 19 Gemini 3.5 Flash released as GA modelJUN 08 Verified current $1.50/M input, $0.15/M cached input, and $9.00/M output on Google's pricing page
§ 04 / TOKENIZER

Paste text. See tokens. See cost.

Estimate · gemini-tokenizer-estimate · ≈3.85 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters 406
Words 64
Tokens (estimated) 105 tokens
Cost as input · uncached $0.000158 USD
Cost as output · uncached $0.000945 USD
Cost as cached input $0.000016 USD
§ 05 / SHELF

Up against the shelf.

All models →
Model Input /M Output /M Effective blended Context Best for
Gemini 3.5 Flash Current $1.50 cache $0.15 $9.00 $1.08 agentic 92/8 1M Newest GA Gemini agent model
Gemini 3.1 Pro Preview $2.00 cache $0.20 $12.00 $1.44 Pro preview 1M Higher-ceiling preview reasoning
Gemini 3.1 Flash-Lite $0.25 cache $0.03 $1.50 $0.18 same token price 1M Low-latency high-volume tasks
Gemini 3 Flash Preview $0.50 cache $0.05 $3.00 $0.36 older preview 1M Gemini 3 preview workloads

Frequently asked.

Practical Gemini 3.5 Flash pricing questions, with standard, batch, flex, and priority tiers separated.

Q · 01 What is Gemini 3.5 Flash priced at? +
Google lists gemini-3.5-flash at $1.50/M input, $9.00/M output, and $0.15/M cached input. These are USD prices per 1M tokens on the paid Gemini API tier.
Q · 02 Does output pricing include thinking tokens? +
Yes. Google's pricing table labels the output row as Output price (including thinking tokens), so this page treats generated thinking and answer tokens as output.
Q · 03 How much do Batch and Flex cost? +
Google lists Batch at $0.75/M input, $4.50/M output, and $0.075/M cached input. Flex uses the same input/output rates, with cached input listed at $0.08/M.
Q · 04 What about Priority pricing? +
Priority is higher-priced: $2.70/M input, $16.20/M output, and $0.27/M cached input.
Q · 05 What context window does it support? +
Google's model page lists an input token limit of 1,048,576 and an output token limit of 65,536. AI//COST rounds that to a 1M context label.
Q · 06 Is this a preview model? +
No. Google released gemini-3.5-flash as the GA version on May 19, 2026. The older preview row remains separate as gemini-3-flash-preview.