GPT-5.4 API Pricing
OpenAI's more affordable frontier model for coding, professional work, and agentic workflows: $2.50/M input, $15/M output, and $0.25/M cached input in the standard short-context tier. OpenAI lists a separate long-context tier above 272K input tokens.
Run the numbers.
Live calculator pre-loaded with current GPT-5.4 standard short-context rates. For prompts above 272K input tokens, use the long-context tier shown in the FAQ.
Real-world presets.
Professional workflow assistant
Repo-wide feature build
Reading 150-page contracts
Multi-step market brief
Premium support ticket
Price history.
Paste text. See tokens. See cost.
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| GPT-5.4 Current | $2.50 cache $0.25 | $15.00 | $1.80 agentic 92/8 | 1.05M | Affordable OpenAI frontier work |
| GPT-5.5 | $5.00 cache $0.50 | $30.00 | $3.61 higher flagship tier | 1M | Frontier coding and professional agents |
| GPT-5.4 Pro | $30.00 | $180 | $42.00 no cache discount | 1.05M | Hardest GPT-5.4 reasoning |
| GPT-5.4 mini | $0.75 cache $0.07 | $4.50 | $0.54 same blend | 400K | Lower-latency subagents |
| Claude Opus 4.7 | $5.00 cache $0.50 | $25.00 | $3.21 Anthropic frontier | 1M | Anthropic frontier reasoning |
| Gemini 2.5 Pro | $1.25 cache $0.13 | $10.00 | $1.10 tier 1 pricing | 2M | Large multimodal context |
| DeepSeek V4 Pro | $0.43 cache $0.00 | $0.87 | $0.14 promo price | 1M | Low-cost reasoning workloads |
| DeepSeek V4 Flash | $0.14 cache $0.00 | $0.28 | $0.05 same blend | 1M | Bulk low-cost traffic |
Frequently asked.
Practical GPT-5.4 pricing questions, with OpenAI's published rates separated from workload assumptions.
Q · 01 What is the standard GPT-5.4 API price? +
gpt-5.4 at $2.50/M input tokens, $0.25/M cached input tokens, and $15/M output tokens in the standard short-context tier. This page stores the public list price in USD and marks the source as OpenAI's pricing docs.Q · 02 When does GPT-5.4 long-context pricing apply? +
272K input tokens, the long-context tier is listed at $5/M input, $0.50/M cached input, and $22.50/M output. The calculator on this page uses the standard short-context tier unless you open a long-context variant.Q · 03 How much cheaper are Batch and Flex? +
gpt-5.4 at half the standard short-context rate: $1.25/M input, $0.13/M cached input, and $7.50/M output. Long-context Batch/Flex is listed at $2.50/M input, $0.25/M cached input, and $11.25/M output.Q · 04 What does Priority processing cost? +
gpt-5.4 at $5/M input, $0.50/M cached input, and $30/M output. That is 2x the standard short-context list price, so reserve it for latency-sensitive production traffic rather than batch analysis.Q · 05 Does GPT-5.4 have regional pricing? +
10% uplift for gpt-5.4 and related GPT-5.4/5.5 models. Standard global routing uses the public standard rate.Q · 06 Is GPT-5.4 cheaper than GPT-5.5? +
$2.50/M input, $0.25/M cached input, and $15/M output. In the shared 92/8 agentic blend with 82% cache hits, GPT-5.4 lands near $1.80/M while GPT-5.5 lands near $3.61/M.Q · 07 What release date is used here? +
gpt-5.4. This page uses 2026-03-05 as the API release date.Q · 08 How accurate is the tokenizer estimate? +
4.875 characters per token for English text and labels the tokenizer as tiktoken-o200k_base. It is good for budget planning, but exact billing can differ by language, whitespace, code, tool calls, and image inputs.