Last verified
VERIFIED PRICEDATED PREVIEW1MTEXT + VISIONPROMPT CACHING

Gemini 2.5 Flash-Lite Preview (09-2025) API Pricing

Gemini 2.5 Flash-Lite Preview 09-2025 is a dated archive of Google's cheap multimodal tier. $0.1/M input, $0.4/M output, and $0.01/M cached input. It matches the GA Gemini 2.5 Flash-Lite rates and is best treated as a preview snapshot.

Input - per 1M tokens
$0.10/M
Source Google preview
Output - per 1M tokens
$0.40/M
Use for cost checks preview
Cached input
$0.01/M
Cache hit price discounted
Effective - agentic blend
$0.06/M
92/8 split - 82% cache
§ 01 / TERMINAL

Run the numbers.

Calculator pre-loaded with Gemini 2.5 Flash-Lite Preview (09-2025) rates. Tweak spend, output mix, or cache hit rate to compare this model with current alternatives.

$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
Words equivalent (English)
Effective rate
§ 02 / SCENARIOS

Real-world presets.

§ 03 / TAPE

Price history.

Gemini 2.5 Flash-Lite Preview (09-2025) is preview; archived at $0.1/$0.4 per M.

Input · $0.10/M
Output · $0.40/M
Cached · $0.01/M
SEP 01 September 2025 preview price at $0.10/M input and $0.40/M outputMAY 18 Preview snapshot pricing verified on Google AI pricing
§ 04 / TOKENIZER

Paste text. See tokens. See cost.

Estimate · gemini-tokenizer-estimate · ≈4.2 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters
Words
Tokens (estimated)
Cost as input · uncached
Cost as output · uncached
Cost as cached input
§ 05 / SHELF

Up against the shelf.

All models →
Model Input /M Output /M Effective blended Context Best for
Gemini 2.5 Flash-Lite Preview (09-2025) $0.10 cache $0.01 $0.40 $0.06 current page 1M Gemini multimodal workloads
Gemini 2.5 Pro $1.25 cache $0.13 $10.00 $1.10 pricier 2M Gemini multimodal workloads
Gemini 2.5 Flash $0.30 cache $0.03 $2.50 $0.27 pricier 1M Gemini multimodal workloads
Gemini 2.5 Flash-Lite $0.10 cache $0.01 $0.40 $0.06 same blend 1M Gemini multimodal workloads
GPT-5.4 $2.50 cache $0.25 $15.00 $1.80 pricier 1.05M Current OpenAI replacement
Claude Sonnet 4.6 $3.00 cache $0.30 $15.00 $1.92 pricier 1M Current Claude agents
DeepSeek V4 Pro $0.43 cache $0.00 $0.87 $0.14 pricier 1M Low-cost reasoning and coding

Frequently asked.

Short answers for teams checking Gemini 2.5 Flash-Lite Preview (09-2025) pricing, status, and migration choices.

Q · 01 Is Gemini 2.5 Flash-Lite Preview (09-2025) still available? +
Gemini 2.5 Flash-Lite Preview (09-2025) is a preview model. Use it when you intentionally need this preview surface; compare stable rows before production routing.
Q · 02 How much does Gemini 2.5 Flash-Lite Preview (09-2025) cost? +
Gemini 2.5 Flash-Lite Preview (09-2025) is listed at $0.1/M input and $0.4/M output.
Q · 03 Is cached-input pricing included? +
Yes. Cached input is shown at $0.01/M.
Q · 04 What should teams compare it against? +
Use the shelf to compare current replacements and nearby models. Snapshot replacement: current same-provider models.
Q · 05 Why keep this page? +
These pages preserve price history, preview economics, and migration context without pretending every older or preview model is the best current default.