Last verified
GEMINI 3 PREVIEW1M CONTEXTMULTIMODALCONTEXT CACHINGBATCH + FLEX -50%

Gemini 3.1 Pro Preview API Pricing

Google's current Gemini 3.1 Pro preview tier for multimodal reasoning and agentic work: $2/M input, $12/M output, and $0.20/M cached input for prompts up to 200K tokens. Pulled directly from ai.google.dev.

Input - per 1M tokens
$2.00/M
Tier 1 <=200K prompt standard
Output - per 1M tokens
$12.00/M
Tier 2 output is $18/M standard
Cached input - 90% off
$0.20/M
Cache plus storage fee -90%
Effective - agentic blend
$1.44/M
92/8 split - 82% cache
§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with current Gemini 3.1 Pro Preview tier-1 rates. For prompts above 200K tokens, Google lists higher tier-2 input, cache, and output prices.

$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
Words equivalent (English)
Effective rate
§ 02 / SCENARIOS

Real-world presets.

§ 03 / TAPE

Price history.

Gemini 3.1 Pro Preview is listed at $2/M input and $12/M output for tier-1 prompts.

Input · $2/M
Output · $12/M
Cached · $0.20/M
MAY 07 Listed on Google's models page update as Gemini 3.1 Pro PreviewMAY 18 Verified unchanged on Google Gemini API pricing page
§ 04 / TOKENIZER

Paste text. See tokens. See cost.

Estimate · gemini-tokenizer-estimate · ≈3.85 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters
Words
Tokens (estimated)
Cost as input · uncached
Cost as output · uncached
Cost as cached input
§ 05 / SHELF

Up against the shelf.

All models →
Model Input /M Output /M Effective blended Context Best for
Gemini 3.1 Pro Preview Current $2.00 cache $0.20 $12.00 $1.44 agentic 92/8 1M Google frontier preview
Gemini 3 Flash Preview $0.50 cache $0.05 $3.00 $0.36 mid-tier preview 1M Cheaper Gemini 3 preview
Gemini 3.1 Flash-Lite $0.25 cache $0.03 $1.50 $0.18 light tier 1M High-volume Gemini 3.1
Gemini 2.5 Pro $1.25 cache $0.13 $10.00 $1.10 previous stable pro 2M Stable Gemini 2.5 Pro
Gemini 2.5 Flash $0.30 cache $0.03 $2.50 $0.27 stable flash 1M Best price-performance Gemini 2.5
GPT-5.4 $2.50 cache $0.25 $15.00 $1.80 OpenAI competitor 1.05M Affordable OpenAI frontier work
GPT-5.4 mini $0.75 cache $0.07 $4.50 $0.54 OpenAI mini 400K Subagents and lightweight coding

Frequently asked.

Practical Gemini 3.1 Pro Preview pricing questions, with Google's tiered prompt prices separated from workload assumptions.

Q · 01 What is Gemini 3.1 Pro Preview's standard API price? +
Google lists gemini-3.1-pro-preview at $2/M input, $0.20/M cached input, and $12/M output for prompts up to 200K tokens. For prompts above 200K, the table lists $4/M input, $0.40/M cached input, and $18/M output.
Q · 02 Does output pricing include thinking tokens? +
Yes. Google's pricing page labels output as Output price (including thinking tokens). This page therefore treats all generated reasoning and answer tokens as part of the published output rate.
Q · 03 How much do Batch and Flex cost? +
Google lists Batch and Flex at $1/M input and $6/M output for prompts up to 200K, with $2/M input and $9/M output above 200K. Context caching in those sections is shown at the same cache rate as Standard.
Q · 04 Is this a preview model? +
Yes. Google's models page labels Gemini 3.1 Pro as Preview. Preview models may have more restrictive limits and can change before stable release.
Q · 05 What about Google Search grounding costs? +
For Gemini 3 models, Google lists 5,000 grounded prompts per month free, shared across Gemini 3, then $14 / 1,000 search queries. Tool charges are separate from token prices.
Q · 06 How accurate is the tokenizer estimate? +
The widget uses 4.0 characters per token as a Gemini planning estimate. Exact billing can differ by language, media inputs, tool calls, and how Google tokenizes multimodal content.