Question 1

What is the standard GPT-5.4 mini API price?

Accepted Answer

OpenAI lists gpt-5.4-mini at $0.75/M input tokens, $0.075/M cached input tokens, and $4.50/M output tokens. This page stores the public list price in USD and marks the source as OpenAI's pricing docs.

Question 2

How much cheaper is GPT-5.4 mini than GPT-5.4?

Accepted Answer

GPT-5.4 mini is 3.3x cheaper than GPT-5.4 on the shared agentic blend: about $0.54/M versus $1.80/M. The tradeoff is model capability and context: mini has a 400K context window rather than GPT-5.4's larger 1.05M window.

Question 3

How much cheaper are Batch and Flex?

Accepted Answer

OpenAI lists Batch and Flex for gpt-5.4-mini at half the standard rate: $0.375/M input, $0.0375/M cached input, and $2.25/M output. That makes latency-tolerant subagent and support workloads materially cheaper.

Question 4

What does Priority processing cost?

Accepted Answer

The Priority table lists gpt-5.4-mini at $1.50/M input, $0.15/M cached input, and $9/M output. That is 2x the standard price, so use it only when latency matters.

Question 5

Does GPT-5.4 mini have regional pricing?

Accepted Answer

Yes. OpenAI states that regional processing/data-residency endpoints carry a 10% uplift for gpt-5.4-mini and related GPT-5.4/5.5 models. Standard global routing uses the public standard rate.

Question 6

Does GPT-5.4 mini support computer use?

Accepted Answer

Yes. OpenAI's model page lists computer use among the tools supported by gpt-5.4-mini in the Responses API. GPT-5.4 nano's model page does not list computer use support, which is one practical reason to pick mini for browser or desktop automation.

Question 7

What release date is used here?

Accepted Answer

OpenAI's model page lists the first GPT-5.4 mini snapshot as gpt-5.4-mini-2026-03-17. This page uses 2026-03-17 as the model release date for pricing-history purposes.

Question 8

How accurate is the tokenizer estimate?

Accepted Answer

The live widget uses an estimated 4.875 characters per token for English text and labels the tokenizer as tiktoken-o200k_base. It is good for budget planning, but exact billing can differ by language, whitespace, code, tool calls, and image inputs.

Model	Input /M	Output /M	Effective blended	Context	Best for
GPT-5.4 mini Current	$0.75 cache $0.07	$4.50	$0.54 agentic 92/8	400K	Subagents and computer-use tasks
GPT-5.4	$2.50 cache $0.25	$15.00	$1.80 larger model	1.05M	Affordable OpenAI frontier work
GPT-5.4 nano	$0.20 cache $0.02	$1.25	$0.15 same blend	400K	Classification and extraction
GPT-5.3-Codex	$1.75 cache $0.17	$14.00	$1.54 coding specialist	400K	Agentic coding tasks
Gemini 2.5 Pro	$1.25 cache $0.13	$10.00	$1.10 tier 1 pricing	2M	Large multimodal context
DeepSeek V4 Pro	$0.43 cache $0.00	$0.87	$0.14 promo price	1M	Low-cost reasoning workloads
DeepSeek V4 Flash	$0.14 cache $0.00	$0.28	$0.05 same blend	1M	Bulk low-cost traffic

GPT-5.4 mini API Pricing

Run the numbers.

Real-world presets.

Product support assistant

Coding helper subtask

Knowledge-base support

Invoice data extraction

Browser workflow automation

Paste text. See tokens. See cost.

Up against the shelf.

Frequently asked.