Last verified 2026-05-18

GEMINI 3 PREVIEW1M CONTEXTMULTIMODALCONTEXT CACHINGBATCH + FLEX -50%

Gemini 3.1 Pro Preview API Pricing

Q: What is Gemini 3.1 Pro Preview's standard API price?

Google lists gemini-3.1-pro-preview at $2/M input, $0.20/M cached input, and $12/M output for prompts up to 200K tokens. For prompts above 200K, the table lists $4/M input, $0.40/M cached input, and $18/M output.

Q: Does output pricing include thinking tokens?

Yes. Google's pricing page labels output as Output price (including thinking tokens). This page therefore treats all generated reasoning and answer tokens as part of the published output rate.

Q: How much do Batch and Flex cost?

Google lists Batch and Flex at $1/M input and $6/M output for prompts up to 200K, with $2/M input and $9/M output above 200K. Context caching in those sections is shown at the same cache rate as Standard.

Q: Is this a preview model?

Yes. Google's models page labels Gemini 3.1 Pro as Preview. Preview models may have more restrictive limits and can change before stable release.

Q: What about Google Search grounding costs?

For Gemini 3 models, Google lists 5,000 grounded prompts per month free, shared across Gemini 3, then $14 / 1,000 search queries. Tool charges are separate from token prices.

Q: How accurate is the tokenizer estimate?

The widget uses 4.0 characters per token as a Gemini planning estimate. Exact billing can differ by language, media inputs, tool calls, and how Google tokenizes multimodal content.

Google's current Gemini 3.1 Pro preview tier for multimodal reasoning and agentic work: $2/M input, $12/M output, and $0.20/M cached input for prompts up to 200K tokens. Pulled directly from ai.google.dev.

Input - per 1M tokens

$2.00/M

Tier 1 <=200K prompt standard

Output - per 1M tokens

$12.00/M

Tier 2 output is $18/M standard

Cached input - 90% off

$0.20/M

Cache plus storage fee -90%

Effective - agentic blend

$1.44/M

92/8 split - 82% cache

§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with current Gemini 3.1 Pro Preview tier-1 rates. For prompts above 200K tokens, Google lists higher tier-2 input, cache, and output prices.

Spend

$ /mo

Workload split

Prompt cache hit rate

Tokens you can process

—

Words equivalent (English)

—

Effective rate

—

Open full calculator (all models · share URL · CSV) →

§ 02 / SCENARIOS

Real-world presets.

CODING

Vibe-coding feature

$0.256/task

80k in / 8k out~390 tasks/$100

LONG CONTEXT

Document pack analysis

$0.460/pack

200k in / 5k out~217 packs/$100

MULTIMODAL

Video + docs briefing

$0.096/brief

30k in / 3k out~1,041 briefs/$100

AGENT

Research agent loop

$0.360/loop

120k in / 10k out~277 loops/$100

RAG

Large RAG synthesis

$0.444/answer

150k in / 12k out~225 answers/$100

§ 03 / TOKENIZER

Paste text. See tokens. See cost.

Your text · live count

Estimate · gemini-tokenizer-estimate · ≈3.85 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters 483

Words 74

Tokens (estimated) 125 tokens

Cost as input · uncached $0.000250 USD

Cost as output · uncached $0.001500 USD

Cost as cached input $0.000025 USD

§ 04 / SHELF

Up against the shelf.

All models →

Model	Input /M	Output /M	Effective blended	Context	Best for
Gemini 3.1 Pro Preview Current	$2.00 cache $0.20	$12.00	$1.44 agentic 92/8	1M	Google frontier preview
Gemini 3 Flash Preview	$0.50 cache $0.05	$3.00	$0.36 mid-tier preview	1M	Cheaper Gemini 3 preview
Gemini 3.1 Flash-Lite	$0.25 cache $0.03	$1.50	$0.18 light tier	1M	High-volume Gemini 3.1
Gemini 2.5 Pro	$1.25 cache $0.13	$10.00	$1.10 previous stable pro	2M	Stable Gemini 2.5 Pro
Gemini 2.5 Flash	$0.30 cache $0.03	$2.50	$0.27 stable flash	1M	Best price-performance Gemini 2.5
GPT-5.4	$2.50 cache $0.25	$15.00	$1.80 OpenAI competitor	1.05M	Affordable OpenAI frontier work
GPT-5.4 mini	$0.75 cache $0.07	$4.50	$0.54 OpenAI mini	400K	Subagents and lightweight coding

Frequently asked.

Practical Gemini 3.1 Pro Preview pricing questions, with Google's tiered prompt prices separated from workload assumptions.

Q · 01 What is Gemini 3.1 Pro Preview's standard API price? +

Google lists gemini-3.1-pro-preview at $2/M input, $0.20/M cached input, and $12/M output for prompts up to 200K tokens. For prompts above 200K, the table lists $4/M input, $0.40/M cached input, and $18/M output.

Q · 02 Does output pricing include thinking tokens? +

Yes. Google's pricing page labels output as Output price (including thinking tokens). This page therefore treats all generated reasoning and answer tokens as part of the published output rate.

Q · 03 How much do Batch and Flex cost? +

Google lists Batch and Flex at $1/M input and $6/M output for prompts up to 200K, with $2/M input and $9/M output above 200K. Context caching in those sections is shown at the same cache rate as Standard.

Q · 04 Is this a preview model? +

Yes. Google's models page labels Gemini 3.1 Pro as Preview. Preview models may have more restrictive limits and can change before stable release.

Q · 05 What about Google Search grounding costs? +

For Gemini 3 models, Google lists 5,000 grounded prompts per month free, shared across Gemini 3, then $14 / 1,000 search queries. Tool charges are separate from token prices.

Q · 06 How accurate is the tokenizer estimate? +

The widget uses 4.0 characters per token as a Gemini planning estimate. Exact billing can differ by language, media inputs, tool calls, and how Google tokenizes multimodal content.

Reviewed by Yaroslav Vikhariev Founder - AI//COST - Pricing pulled daily from ai.google.dev - Last verified May 18, 2026

Methodology Report a correction More by Y.V.