Last verified
FRONTIER LABALPHABET SUBSIDIARYGOOGLE DEEPMINDMULTIMODAL NATIVE2M CONTEXT

Gemini API Pricing

Google DeepMind is the AI division of Alphabet, formed in April 2023 by merging the original DeepMind (founded London, 2010, acquired by Google in 2014) with Google Brain. CEO Demis Hassabis. Ships the Gemini family — the most multimodal frontier lineup on the market (text + image + audio + video native) — through both the Gemini API and the cloud platform now rebranded as Gemini Enterprise Agent Platform (formerly Vertex AI, renamed April 2026).

Production models
9
4 GA · 5 preview
Founded (parent)
1998
DeepMind merged Apr 2023
SWE-bench leader
78%
Gemini 3 Flash Preview
Cheapest tier
$0.10/M
2.5 Flash-Lite input
Max context
2M tokens
Gemini 2.5 Pro
Last price move
12d ago
3.1 Pro Preview launch
§ 01 / LINEUP

The full roster.

Side-by-side →
FAST · LIGHTWEIGHT

Gemini 3.1 Flash-Lite LAUNCHED MAY 2026

Standard text, image, and video input is $0.25/M; audio input is $0.50/M; output includes thinking tokens.

Input
$0.25/M
Output
$2/M
1M ctx · vision
FRONTIER · REASONING

Gemini 2.5 Pro LAUNCHED JUN 2025

Tiered pricing: $1.25/M input, $0.125/M cached input, and $10/M output for prompts <= 200K tokens; $2.50/$0.25/$15 above 200K.

Input
$1/M
Output
$10/M
2M ctx · vision
BALANCED · MID-TIER

Gemini 2.5 Flash LAUNCHED JUN 2025

Standard text, image, and video input is $0.30/M; audio input is $1/M; output includes thinking tokens.

Input
$0.30/M
Output
$3/M
1M ctx · vision
FAST · LIGHTWEIGHT

Gemini 2.5 Flash-Lite LAUNCHED JUL 2025

Standard text, image, and video input is $0.10/M; audio input is $0.30/M; output includes thinking tokens.

Input
$0.10/M
Output
$0.40/M
1M ctx · vision
PREVIEW · EARLY ACCESS

Gemini 3.1 Pro Preview LAUNCHED MAY 2026

Preview-stage Gemini 3.1 Pro.

Input
$2/M
Output
$12/M
1M ctx · vision
PREVIEW · EARLY ACCESS

Gemini 3.1 Flash-Lite Preview

Preview variant of Flash-Lite 3.1.

Input
$0.25/M
Output
$2/M
1M ctx · vision
PREVIEW · EARLY ACCESS

Gemini 3 Flash Preview LAUNCHED DEC 2025

Standard text, image, and video input is $0.50/M; audio input is $1/M; output includes thinking tokens.

Input
$0.50/M
Output
$3/M
1M ctx · vision
PREVIEW · EARLY ACCESS

Gemini 2.5 Flash-Lite Preview (09-2025)

Dated snapshot of 2.5 Flash-Lite from September 2025 preview drop.

Input
$0.10/M
Output
$0.40/M
1M ctx · vision
PREVIEW · EARLY ACCESS

Gemini 2.5 Computer Use Preview

Specialised preview for computer-use / browser-agent loops.

Input
$1/M
Output
$10/M
documented elsewhere ctx · vision
DEPRECATED · 2026-02-18

Gemini 2.0 Flash LAUNCHED FEB 2025

Google lists Gemini 2.0 Flash at $0.10/M input for text, image, and video, $0.70/M audio input, $0.40/M output, and $0.025/M cached text/image/video input.

Input
$0.10/M
Output
$0.40/M
1M ctx · vision
DEPRECATED · 2026-02-18

Gemini 2.0 Flash-Lite LAUNCHED FEB 2025

Google lists Gemini 2.0 Flash-Lite at $0.075/M input and $0.30/M output.

Input
$0.07/M
Output
$0.30/M
1M ctx · vision
RETIRED · MAR 2026

Gemini 3 Pro Preview LAUNCHED NOV 2025

Archive price mirrors the Gemini 3 Pro Preview list rate preserved in the project snapshot: $2/M input, $0.20/M cached input, and $12/M output for prompts up to 200K tokens; tier 2 above 200K was $…

Input
$2/M
Output
$12/M
1M ctx · vision
RETIRED · SEP 2025

Gemini 1.5 Flash-8B LAUNCHED OCT 2024

Half-price variant of Gemini 1.5 Flash — at $0.0375 input this was the cheapest production-grade model on any first-party API in late 2024.

Input
$0.04/M
Output
$0.15/M
1M ctx · vision
RETIRED · SEP 2025

Gemini 1.5 Pro LAUNCHED MAY 2024

Archive price uses the last-published project snapshot: $1.25/M input and $5/M output, with a higher >128K prompt tier at $2.50/M input and $10/M output.

Input
$1/M
Output
$5/M
2M ctx · vision
RETIRED · SEP 2025

Gemini 1.5 Flash LAUNCHED MAY 2024

Archive price uses the last-published project snapshot: $0.075/M input and $0.30/M output, with a higher >128K prompt tier at $0.15/M input and $0.60/M output.

Input
$0.07/M
Output
$0.30/M
1M ctx · vision
RETIRED · FEB 2025

Gemini 1.0 Pro LAUNCHED DEC 2023

Google's first general-availability Gemini API model — released December 13, 2023 as part of the original Gemini 1.0 family (alongside Ultra and Nano).

Input
$0.50/M
Output
$2/M
32K ctx · text-only
§ 02 / SHELF

All side-by-side.

Methodology →
Model Input /M Output /M Cached Context Max output Vision Tools Tier
Gemini 3.1 Flash-Lite $0.25 $1.50 $0.03−90% 1M Light
Gemini 2.5 Pro $1.25 $10 $0.13−90% 2M Frontier
Gemini 2.5 Flash $0.3 $2.50 $0.03−90% 1M Mid
Gemini 2.5 Flash-Lite $0.1 $0.4 $0.01−90% 1M Light
Gemini 3.1 Pro Preview FLAGSHIP $2.00 $12 $0.2−90% 1M Preview
Gemini 3.1 Flash-Lite Preview PREVIEW $0.25 $1.50 $0.03−90% 1M Preview
Gemini 3 Flash Preview PREVIEW $0.5 $3.00 $0.05−90% 1M Preview
Gemini 2.5 Flash-Lite Preview (09-2025) PREVIEW $0.1 $0.4 $0.01−90% 1M Preview
Gemini 2.5 Computer Use Preview PREVIEW $1.25 $10 documented elsewhere Preview
Gemini 2.0 Flash DEPRECATED $0.1 $0.4 $0.03−75% 1M Deprecated
Gemini 2.0 Flash-Lite DEPRECATED $0.07 $0.3 1M Deprecated
Gemini 3 Pro Preview RETIRED Retired Mar 9, 2026 → gemini-3-1-pro-preview Retired
Gemini 1.5 Flash-8B RETIRED Retired Sep 29, 2025 → gemini-2-5-flash-lite Retired
Gemini 1.5 Pro RETIRED Retired Sep 29, 2025 → gemini-2-5-pro Retired
Gemini 1.5 Flash RETIRED Retired Sep 29, 2025 → gemini-2-5-flash Retired
Gemini 1.0 Pro RETIRED Retired Feb 18, 2025 → gemini-2-5-flash Retired

The last 12 months at Google DeepMind.

Major releases, deprecations, and corporate moves. Sourced from ai.google.dev changelog, Google DeepMind blog, and recent press.

MAY 07 · 2026
Gemini 3.1 Pro Preview + Gemini 3.1 Flash-Lite released — new preview surface for the 3.x line
RELEASE
APR 23 · 2026
Vertex AI rebranded as Gemini Enterprise Agent Platform — full agent stack, RAG, and security controls under one product
CORPORATE
MAR 09 · 2026
Gemini 3 Pro Preview retired — superseded by Gemini 3.1 Pro Preview ($2/$12 stays the same)
PRICING
MAR · 2026
Lyria 3 Pro released — extended text-to-music track creation
RELEASE
FEB 18 · 2026
Gemini 2.0 Flash + Flash-Lite deprecated — full retirement scheduled for Jun 1, 2026
PRICING
FEB · 2026
Lyria 3 launched — first text-to-music model in the family
RELEASE
DEC 17 · 2025
Gemini 3 Flash Preview released — 78% on SWE-bench Verified, beats Gemini 3 Pro on agentic coding at a quarter the price
RELEASE
NOV 18 · 2025
Gemini 3 Pro Preview released — first fully multimodal reasoning Pro tier
RELEASE
SEP 29 · 2025
Gemini 1.5 family retired — 1.5 Pro, 1.5 Flash, 1.5 Flash-8B shut down
PRICING
SEP · 2025
Gemini Robotics 1.5 released — embodied AI improvements for physical interaction
RELEASE
JUL 22 · 2025
Gemini 2.5 Flash-Lite GA — cheapest GA tier on the Gemini API at $0.10/$0.40
RELEASE
JUN 17 · 2025
Gemini 2.5 Pro + Flash GA — production-grade reasoning with 2M context on Pro
RELEASE
MAY · 2025
Veo 3 launched — video generation with synchronised audio
RELEASE
§ 04 / ACCESS

Where to get it.

Methodology →
§ 04 / BEST FOR

Which Google for what.

More scenarios →
If you need the longest context window on a frontier model…
Gemini 2.5 Pro (2M)
Profile →
If you want frontier reasoning + multimodal in preview…
Gemini 3.1 Pro Preview
Profile →
If you need SWE-bench-class agentic coding at mid-tier cost…
Gemini 3 Flash Preview
Profile →
If you want a stable daily driver with thinking tokens…
Gemini 2.5 Flash
Profile →
If you're doing high-volume classification on the cheap…
Gemini 2.5 Flash-Lite
Profile →
If you're building a browser-agent / computer-use loop…
Gemini 2.5 Computer Use Preview
Profile →
If you need EU data residency for compliance…
Gemini 2.5 Pro on Vertex europe-west4
Vertex docs →
§ 06 / BACKGROUND

The company behind it.

deepmind.google →

Google DeepMind is the AI research and product division of Alphabet Inc. (the holding company spun out from Google in 2015). It was formed in April 2023 by merging the original DeepMind — founded in London in November 2010 by Demis Hassabis, Shane Legg, and Mustafa Suleyman, acquired by Google for $400–650M in January 2014 — with the previously separate Google Brain team inside Google. CEO: Demis Hassabis.

The merger ended a decade of internal duality. Pre-merger, Google ran two distinct AI orgs: Google Brain (Mountain View, headed by Jeff Dean) shipped TensorFlow, the original Transformer paper, and most of Google's product AI; DeepMind (London, headed by Hassabis) produced AlphaGo, AlphaFold, and most of the lab's public scientific breakthroughs. The merge consolidated leadership and unblocked the shipping cadence that produced the Gemini family.

Major technical contributions: AlphaGo / AlphaZero / MuZero (game-playing systems that beat world champions at Go, chess, shogi); AlphaFold (protein structure prediction — 2024 Nobel Prize in Chemistry for Hassabis and John Jumper); the original Transformer architecture (Vaswani et al., 2017, Google Brain); Gemini (multimodal LLM family, first released Dec 6, 2023); Imagen (image gen), Veo (video gen), and Lyria (music gen).

As an Alphabet subsidiary, Google DeepMind benefits from Alphabet's full revenue base (~$350B annual, 2025) and Google Cloud's compute footprint. It does not raise external venture funding the way Anthropic and OpenAI do — capital and TPU access come from inside the org. The catch: AI products compete for runway against Google Search, Google Cloud, and YouTube within the same P&L; product velocity has historically been slower than smaller pure-play labs.

Compared to OpenAI and Anthropic: the deepest multimodal stack (text + image + audio + video natively in one model), the longest context window on a flagship (2M on Gemini 2.5 Pro), and the broadest distribution via Workspace + Gemini app + Android. Weaker on coding-agent benchmarks (Anthropic still leads SWE-bench Verified; Gemini 3 Flash hits 78% vs Claude Opus 4.7's 87.6%) and on consumer brand recognition for ChatGPT-style chat.

§ 07 / COMPETITORS

Other frontier labs.

All providers →

Frequently asked.

Practical questions about Google DeepMind, the Gemini lineup, and Vertex / Gemini Enterprise Agent Platform.

Q · 01 Which Gemini model should I start with? +
For most production use cases: Gemini 2.5 Pro. It's GA-stable, has the longest context window in the lineup (2M), and is cheaper than the 3.1 preview at $1.25/$10. Move to Gemini 3.1 Pro Preview for newest reasoning + the 3.x preview surface. Move down to Gemini 2.5 Flash for daily traffic, 2.5 Flash-Lite for classification. For coding agents, Gemini 3 Flash Preview hits 78% SWE-bench Verified at a quarter the cost of Pro.
Q · 02 How does Gemini compare to Claude and GPT? +
Strongest at long context (2M on 2.5 Pro vs 1M / 200K on rivals), native multimodal (text + image + audio + video all in one model), and productivity-suite integration via Workspace. Weaker than Anthropic on coding-agent benchmarks (78% vs 87.6% on SWE-bench Verified). Less polished consumer chat product than OpenAI's ChatGPT despite the Gemini app's Workspace integration.
Q · 03 Does Google offer EU data residency? +
Yes, with a caveat. Vertex AI / Gemini Enterprise Agent Platform exposes EU multi-region data residency at-rest, and Gemini 2.5 Pro + 2.0 Flash are available in europe-west4 (Netherlands) for GDPR-compliant workloads. Gemini 3.x is not yet in EU regions; if you need the newest models, you're still pinned to US / global endpoints, which the Vertex docs warn do not guarantee in-region processing.
Q · 04 What's the data retention policy? +
On the paid Gemini API tier: prompts and responses are not used to improve models and are retained for a short period for abuse monitoring (Google does not publish a fixed number for the paid tier; check the latest Gemini API additional terms). On the free tier: prompts and responses are used to improve products. On Vertex / Gemini Enterprise: customer-controlled, no training on inputs, contractual data-residency commitments.
Q · 05 When will Gemini 3 hit general availability? +
The 3.x family is still in preview as of May 19, 2026. Cadence so far: Gemini 3 Pro Preview shipped Nov 2025, retired Mar 2026 in favour of 3.1 Pro Preview; Gemini 3 Flash Preview shipped Dec 2025; Gemini 3.1 Flash-Lite hit GA on May 7, 2026. GA for 3.1 Pro is expected later in 2026 once the preview surface stabilises — Google has not announced a date.
Q · 06 Can I get a discount? +
Three stackable discounts on the Gemini API: Batch −50% (24-hour async jobs), Flex −50% (best-effort latency), and context caching −90% on cached input tokens. The free tier on AI Studio is the most generous in the industry for development — large request quotas before paid tier kicks in. Vertex side adds committed-use discounts for enterprise contracts.
Q · 07 What about Gemini 1.5 / 2.0? +
The Gemini 1.5 family (1.5 Pro, 1.5 Flash, 1.5 Flash-8B) was retired on Sep 29, 2025. Gemini 2.0 Flash + 2.0 Flash-Lite are deprecated as of Feb 18, 2026 with full shutdown scheduled for Jun 1, 2026 — migrate to 2.5 Flash / 2.5 Flash-Lite. Both old-family slugs still take traffic until the retirement date but are not recommended for new builds.