GPT-5.5 API Pricing
OpenAI's flagship API model for complex reasoning and coding: $5/M input, $30/M output, and $0.50/M cached input in the standard short-context tier. OpenAI lists a separate long-context tier above 272K input tokens.
Run the numbers.
Live calculator pre-loaded with current GPT-5.5 standard short-context rates. For prompts above 272K input tokens, use the long-context tier shown in the FAQ.
Real-world presets.
Executive workflow assistant
Repo-wide feature build
Reading 150-page contracts
Multi-step market brief
Premium support ticket
Price history.
Paste text. See tokens. See cost.
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| GPT-5.5 Current | $5.00 cache $0.50 | $30.00 | $3.61 agentic 92/8 | 1M | Frontier coding and professional agents |
| GPT-5.4 | $2.50 cache $0.25 | $15.00 | $1.80 same blend | 1M | Cheaper GPT daily-driver work |
| GPT-5.4 mini | $0.75 cache $0.07 | $4.50 | $0.54 same blend | 400K | Lower-latency subagents |
| Claude Opus 4.7 | $5.00 cache $0.50 | $25.00 | $3.21 same input, cheaper output | 1M | Anthropic frontier reasoning |
| Gemini 2.5 Pro | $1.25 cache $0.13 | $10.00 | $1.10 tier 1 pricing | 2M | Large multimodal context |
| DeepSeek V4 Pro | $0.43 cache $0.00 | $0.87 | $0.14 promo price | 1M | Low-cost reasoning workloads |
| DeepSeek V4 Flash | $0.14 cache $0.00 | $0.28 | $0.05 same blend | 1M | Bulk low-cost traffic |
Frequently asked.
Practical GPT-5.5 pricing questions, with OpenAI's published rates separated from workload assumptions.
Q · 01 What is the standard GPT-5.5 API price? +
gpt-5.5 at $5/M input tokens, $0.50/M cached input tokens, and $30/M output tokens in the standard short-context tier. This page stores the public list price in USD and marks the source as OpenAI's pricing docs.Q · 02 When does GPT-5.5 long-context pricing apply? +
gpt-5.5, the long-context tier applies above 272K input tokens and is listed at $10/M input, $1/M cached input, and $45/M output. The calculator on this page uses the standard short-context tier; use the long-context calculator link for requests above that threshold.Q · 03 How much cheaper are Batch and Flex? +
gpt-5.5 at half the standard short-context rate: $2.50/M input, $0.25/M cached input, and $15/M output. That makes offline jobs and latency-tolerant agent runs materially cheaper than standard synchronous calls.Q · 04 What does Priority processing cost? +
gpt-5.5 at $12.50/M input, $1.25/M cached input, and $75/M output. That is 2.5x the standard short-context list price, so reserve it for latency-sensitive production traffic rather than batch analysis.Q · 05 Does GPT-5.5 have regional pricing? +
10% uplift for gpt-5.5 and related GPT-5.4/5.5 models. Standard global routing uses the public standard rate.Q · 06 Is GPT-5.5 cheaper than GPT-5.4? +
$2.50/M input, $0.25/M cached input, and $15/M output. In the shared 92/8 agentic blend with 82% cache hits, GPT-5.5 lands near $3.61/M while GPT-5.4 lands near $1.80/M.Q · 07 What release date is used here? +
2026-04-24 as the API release date.Q · 08 How accurate is the tokenizer estimate? +
4.875 characters per token for English text and labels the tokenizer as tiktoken-o200k_base. It is good for budget planning, but exact billing can differ by language, whitespace, code, tool calls, and image inputs.