Last verified
FRONTIER DAILY DRIVER1.05M CONTEXTTEXT + VISIONPROMPT CACHINGPRIORITY AVAILABLE

GPT-5.4 API Pricing

OpenAI's more affordable frontier model for coding, professional work, and agentic workflows: $2.50/M input, $15/M output, and $0.25/M cached input in the standard short-context tier. OpenAI lists a separate long-context tier above 272K input tokens.

Input - per 1M tokens
$2.50/M
Short context up to 272K input standard
Output - per 1M tokens
$15.00/M
Long context output is $22.50/M standard
Cached input - 90% off
$0.25/M
Prompt cache hit price -90%
Effective - agentic blend
$1.80/M
92/8 split - 82% cache
§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with current GPT-5.4 standard short-context rates. For prompts above 272K input tokens, use the long-context tier shown in the FAQ.

$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
Words equivalent (English)
Effective rate
§ 02 / SCENARIOS

Real-world presets.

§ 03 / TAPE

Price history.

API list price is still $2.50/M input and $15/M output since GPT-5.4 launch.

Input · $2.5/M
Output · $15/M
Cached · $0.25/M
MAR 05 API availability at $2.50/M input and $15/M outputMAY 18 Verified unchanged on OpenAI pricing docs
§ 04 / TOKENIZER

Paste text. See tokens. See cost.

Estimate · tiktoken-o200k_base · ≈3.85 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters
Words
Tokens (estimated)
Cost as input · uncached
Cost as output · uncached
Cost as cached input
§ 05 / SHELF

Up against the shelf.

All models →
Model Input /M Output /M Effective blended Context Best for
GPT-5.4 Current $2.50 cache $0.25 $15.00 $1.80 agentic 92/8 1.05M Affordable OpenAI frontier work
GPT-5.5 $5.00 cache $0.50 $30.00 $3.61 higher flagship tier 1M Frontier coding and professional agents
GPT-5.4 Pro $30.00 $180 $42.00 no cache discount 1.05M Hardest GPT-5.4 reasoning
GPT-5.4 mini $0.75 cache $0.07 $4.50 $0.54 same blend 400K Lower-latency subagents
Claude Opus 4.7 $5.00 cache $0.50 $25.00 $3.21 Anthropic frontier 1M Anthropic frontier reasoning
Gemini 2.5 Pro $1.25 cache $0.13 $10.00 $1.10 tier 1 pricing 2M Large multimodal context
DeepSeek V4 Pro $0.43 cache $0.00 $0.87 $0.14 promo price 1M Low-cost reasoning workloads
DeepSeek V4 Flash $0.14 cache $0.00 $0.28 $0.05 same blend 1M Bulk low-cost traffic

Frequently asked.

Practical GPT-5.4 pricing questions, with OpenAI's published rates separated from workload assumptions.

Q · 01 What is the standard GPT-5.4 API price? +
OpenAI lists gpt-5.4 at $2.50/M input tokens, $0.25/M cached input tokens, and $15/M output tokens in the standard short-context tier. This page stores the public list price in USD and marks the source as OpenAI's pricing docs.
Q · 02 When does GPT-5.4 long-context pricing apply? +
OpenAI separates short-context and long-context pricing for GPT-5.4. For prompts above 272K input tokens, the long-context tier is listed at $5/M input, $0.50/M cached input, and $22.50/M output. The calculator on this page uses the standard short-context tier unless you open a long-context variant.
Q · 03 How much cheaper are Batch and Flex? +
OpenAI lists Batch and Flex for gpt-5.4 at half the standard short-context rate: $1.25/M input, $0.13/M cached input, and $7.50/M output. Long-context Batch/Flex is listed at $2.50/M input, $0.25/M cached input, and $11.25/M output.
Q · 04 What does Priority processing cost? +
The Priority table lists gpt-5.4 at $5/M input, $0.50/M cached input, and $30/M output. That is 2x the standard short-context list price, so reserve it for latency-sensitive production traffic rather than batch analysis.
Q · 05 Does GPT-5.4 have regional pricing? +
Yes. OpenAI states that regional processing/data-residency endpoints carry a 10% uplift for gpt-5.4 and related GPT-5.4/5.5 models. Standard global routing uses the public standard rate.
Q · 06 Is GPT-5.4 cheaper than GPT-5.5? +
Yes. GPT-5.4 is half the GPT-5.5 list price at $2.50/M input, $0.25/M cached input, and $15/M output. In the shared 92/8 agentic blend with 82% cache hits, GPT-5.4 lands near $1.80/M while GPT-5.5 lands near $3.61/M.
Q · 07 What release date is used here? +
OpenAI published the GPT-5.4 release on March 5, 2026 and said GPT-5.4 was available in the API as gpt-5.4. This page uses 2026-03-05 as the API release date.
Q · 08 How accurate is the tokenizer estimate? +
The live widget uses an estimated 4.875 characters per token for English text and labels the tokenizer as tiktoken-o200k_base. It is good for budget planning, but exact billing can differ by language, whitespace, code, tool calls, and image inputs.