Question 1

What is the standard GPT-5.4 API price?

Accepted Answer

OpenAI lists gpt-5.4 at $2.50/M input tokens, $0.25/M cached input tokens, and $15/M output tokens in the standard short-context tier. This page stores the public list price in USD and marks the source as OpenAI's pricing docs.

Question 2

When does GPT-5.4 long-context pricing apply?

Accepted Answer

OpenAI separates short-context and long-context pricing for GPT-5.4. For prompts above 272K input tokens, the long-context tier is listed at $5/M input, $0.50/M cached input, and $22.50/M output. The calculator on this page uses the standard short-context tier unless you open a long-context variant.

Question 3

How much cheaper are Batch and Flex?

Accepted Answer

OpenAI lists Batch and Flex for gpt-5.4 at half the standard short-context rate: $1.25/M input, $0.13/M cached input, and $7.50/M output. Long-context Batch/Flex is listed at $2.50/M input, $0.25/M cached input, and $11.25/M output.

Question 4

What does Priority processing cost?

Accepted Answer

The Priority table lists gpt-5.4 at $5/M input, $0.50/M cached input, and $30/M output. That is 2x the standard short-context list price, so reserve it for latency-sensitive production traffic rather than batch analysis.

Question 5

Does GPT-5.4 have regional pricing?

Accepted Answer

Yes. OpenAI states that regional processing/data-residency endpoints carry a 10% uplift for gpt-5.4 and related GPT-5.4/5.5 models. Standard global routing uses the public standard rate.

Question 6

Is GPT-5.4 cheaper than GPT-5.5?

Accepted Answer

Yes. GPT-5.4 is half the GPT-5.5 list price at $2.50/M input, $0.25/M cached input, and $15/M output. In the shared 92/8 agentic blend with 82% cache hits, GPT-5.4 lands near $1.80/M while GPT-5.5 lands near $3.61/M.

Question 7

What release date is used here?

Accepted Answer

OpenAI published the GPT-5.4 release on March 5, 2026 and said GPT-5.4 was available in the API as gpt-5.4. This page uses 2026-03-05 as the API release date.

Question 8

How accurate is the tokenizer estimate?

Accepted Answer

The live widget uses an estimated 4.875 characters per token for English text and labels the tokenizer as tiktoken-o200k_base. It is good for budget planning, but exact billing can differ by language, whitespace, code, tool calls, and image inputs.

Model	Input /M	Output /M	Effective blended	Context	Best for
GPT-5.4 Current	$2.50 cache $0.25	$15.00	$1.80 agentic 92/8	1.05M	Affordable OpenAI frontier work
GPT-5.5	$5.00 cache $0.50	$30.00	$3.61 higher flagship tier	1M	Frontier coding and professional agents
GPT-5.4 Pro	$30.00	$180	$42.00 no cache discount	1.05M	Hardest GPT-5.4 reasoning
GPT-5.4 mini	$0.75 cache $0.07	$4.50	$0.54 same blend	400K	Lower-latency subagents
Claude Opus 4.7	$5.00 cache $0.50	$25.00	$3.21 Anthropic frontier	1M	Anthropic frontier reasoning
Gemini 2.5 Pro	$1.25 cache $0.13	$10.00	$1.10 tier 1 pricing	2M	Large multimodal context
DeepSeek V4 Pro	$0.43 cache $0.00	$0.87	$0.14 promo price	1M	Low-cost reasoning workloads
DeepSeek V4 Flash	$0.14 cache $0.00	$0.28	$0.05 same blend	1M	Bulk low-cost traffic

GPT-5.4 API Pricing

Run the numbers.

Real-world presets.

Professional workflow assistant

Repo-wide feature build

Reading 150-page contracts

Multi-step market brief

Premium support ticket

Paste text. See tokens. See cost.

Up against the shelf.

Frequently asked.