Last verified 2026-06-08

GEMINI 3.5 GA1M CONTEXTMULTIMODALAGENTIC CODINGBATCH + FLEX

Gemini 3.5 Flash API Pricing

Q: What is Gemini 3.5 Flash priced at?

Google lists gemini-3.5-flash at $1.50/M input, $9.00/M output, and $0.15/M cached input. These are USD prices per 1M tokens on the paid Gemini API tier.

Q: Does output pricing include thinking tokens?

Yes. Google's pricing table labels the output row as Output price (including thinking tokens), so this page treats generated thinking and answer tokens as output.

Q: How much do Batch and Flex cost?

Google lists Batch at $0.75/M input, $4.50/M output, and $0.075/M cached input. Flex uses the same input/output rates, with cached input listed at $0.08/M.

Q: What about Priority pricing?

Priority is higher-priced: $2.70/M input, $16.20/M output, and $0.27/M cached input.

Q: What context window does it support?

Google's model page lists an input token limit of 1,048,576 and an output token limit of 65,536. AI//COST rounds that to a 1M context label.

Q: Is this a preview model?

No. Google released gemini-3.5-flash as the GA version on May 19, 2026. The older preview row remains separate as gemini-3-flash-preview.

Gemini 3.5 Flash is Google's GA model for sustained frontier performance on agentic and coding tasks: $1.50/M input, $9.00/M output, and $0.15/M cached input. Pulled directly from ai.google.dev.

Input - per 1M tokens

$1.50/M

Base token price standard

Output - per 1M tokens

$9.00/M

Output tokens standard

Cached input

$0.15/M

Cache plus storage fee -90%

Effective - agentic blend

$1.08/M

92/8 split - 82% cache

§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with Gemini 3.5 Flash standard rates. Google lists standard input at $1.50/M, output including thinking tokens at $9.00/M, and cached input at $0.15/M.

Spend

$ /mo

Workload split

Prompt cache hit rate

Tokens you can process

—

Words equivalent (English)

—

Effective rate

—

Open full calculator (all models · share URL · CSV) →

§ 02 / SCENARIOS

Real-world presets.

CODING AGENT

Repository iteration loop

$0.137/task

120k cache-heavy in / 10k out~728 units/$100

MULTIMODAL

Video + docs briefing

$0.060/brief

60k mixed in / 4k out~1,677 units/$100

RAG

Search-grounded answer

$0.045/answer

45k in / 3k out~2,237 units/$100

BULK

Batch classification

$0.0074/item

15k batch in / 1k out~13,513 units/$100

§ 03 / TOKENIZER

Paste text. See tokens. See cost.

Your text · live count

Estimate · gemini-tokenizer-estimate · ≈3.85 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters 406

Words 64

Tokens (estimated) 105 tokens

Cost as input · uncached $0.00016 USD

Cost as output · uncached $0.00094 USD

Cost as cached input $0.00002 USD

§ 04 / SHELF

Up against the shelf.

All models →

Model	Input /M	Output /M	Effective blended	Context	Best for
Gemini 3.5 Flash Current	$1.50 cache $0.15	$9.00	$1.08 agentic 92/8	1M	Newest GA Gemini agent model
Gemini 3.1 Pro Preview	$2.00 cache $0.20	$12.00	$1.44 Pro preview	1M	Higher-ceiling preview reasoning
Gemini 3.1 Flash-Lite	$0.25 cache $0.03	$1.50	$0.18 same token price	1M	Low-latency high-volume tasks
Gemini 3 Flash Preview	$0.50 cache $0.05	$3.00	$0.36 older preview	1M	Gemini 3 preview workloads

Frequently asked.

Practical Gemini 3.5 Flash pricing questions, with standard, batch, flex, and priority tiers separated.

Q · 01 What is Gemini 3.5 Flash priced at? +

Google lists gemini-3.5-flash at $1.50/M input, $9.00/M output, and $0.15/M cached input. These are USD prices per 1M tokens on the paid Gemini API tier.

Q · 02 Does output pricing include thinking tokens? +

Yes. Google's pricing table labels the output row as Output price (including thinking tokens), so this page treats generated thinking and answer tokens as output.

Q · 03 How much do Batch and Flex cost? +

Google lists Batch at $0.75/M input, $4.50/M output, and $0.075/M cached input. Flex uses the same input/output rates, with cached input listed at $0.08/M.

Q · 04 What about Priority pricing? +

Priority is higher-priced: $2.70/M input, $16.20/M output, and $0.27/M cached input.

Q · 05 What context window does it support? +

Google's model page lists an input token limit of 1,048,576 and an output token limit of 65,536. AI//COST rounds that to a 1M context label.

Q · 06 Is this a preview model? +

No. Google released gemini-3.5-flash as the GA version on May 19, 2026. The older preview row remains separate as gemini-3-flash-preview.

Reviewed by Yaroslav Vikhariev Founder - AI//COST - Pricing pulled daily from ai.google.dev - Last verified Jun 08, 2026

Methodology Report a correction More by Y.V.