Last verified
FRONTIER REASONING1M CONTEXTTEXT + VISIONPROMPT CACHINGBATCH -50%

GPT-5.5 API Pricing

OpenAI's flagship API model for complex reasoning and coding: $5/M input, $30/M output, and $0.50/M cached input in the standard short-context tier. OpenAI lists a separate long-context tier above 272K input tokens.

Input - per 1M tokens
$5.00/M
Short context up to 272K input standard
Output - per 1M tokens
$30.00/M
Long context output is $45/M standard
Cached input - 90% off
$0.50/M
Prompt cache hit price -90%
Effective - agentic blend
$3.61/M
92/8 split - 82% cache
§ 01 / TERMINAL

Run the numbers.

Live calculator pre-loaded with current GPT-5.5 standard short-context rates. For prompts above 272K input tokens, use the long-context tier shown in the FAQ.

$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
Words equivalent (English)
Effective rate
§ 02 / SCENARIOS

Real-world presets.

§ 03 / TAPE

Price history.

API list price is still $5/M input and $30/M output since GPT-5.5 API availability.

Input · $5/M
Output · $30/M
Cached · $0.50/M
APR 24 API availability at $5/M input and $30/M outputMAY 17 Verified unchanged on OpenAI pricing docs
§ 04 / TOKENIZER

Paste text. See tokens. See cost.

Estimate · tiktoken-o200k_base · ≈3.85 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters
Words
Tokens (estimated)
Cost as input · uncached
Cost as output · uncached
Cost as cached input
§ 05 / SHELF

Up against the shelf.

All models →
Model Input /M Output /M Effective blended Context Best for
GPT-5.5 Current $5.00 cache $0.50 $30.00 $3.61 agentic 92/8 1M Frontier coding and professional agents
GPT-5.4 $2.50 cache $0.25 $15.00 $1.80 same blend 1M Cheaper GPT daily-driver work
GPT-5.4 mini $0.75 cache $0.07 $4.50 $0.54 same blend 400K Lower-latency subagents
Claude Opus 4.7 $5.00 cache $0.50 $25.00 $3.21 same input, cheaper output 1M Anthropic frontier reasoning
Gemini 2.5 Pro $1.25 cache $0.13 $10.00 $1.10 tier 1 pricing 2M Large multimodal context
DeepSeek V4 Pro $0.43 cache $0.00 $0.87 $0.14 promo price 1M Low-cost reasoning workloads
DeepSeek V4 Flash $0.14 cache $0.00 $0.28 $0.05 same blend 1M Bulk low-cost traffic

Frequently asked.

Practical GPT-5.5 pricing questions, with OpenAI's published rates separated from workload assumptions.

Q · 01 What is the standard GPT-5.5 API price? +
OpenAI lists gpt-5.5 at $5/M input tokens, $0.50/M cached input tokens, and $30/M output tokens in the standard short-context tier. This page stores the public list price in USD and marks the source as OpenAI's pricing docs.
Q · 02 When does GPT-5.5 long-context pricing apply? +
OpenAI separates short-context and long-context pricing. For gpt-5.5, the long-context tier applies above 272K input tokens and is listed at $10/M input, $1/M cached input, and $45/M output. The calculator on this page uses the standard short-context tier; use the long-context calculator link for requests above that threshold.
Q · 03 How much cheaper are Batch and Flex? +
OpenAI lists Batch and Flex for gpt-5.5 at half the standard short-context rate: $2.50/M input, $0.25/M cached input, and $15/M output. That makes offline jobs and latency-tolerant agent runs materially cheaper than standard synchronous calls.
Q · 04 What does Priority processing cost? +
The Priority table lists gpt-5.5 at $12.50/M input, $1.25/M cached input, and $75/M output. That is 2.5x the standard short-context list price, so reserve it for latency-sensitive production traffic rather than batch analysis.
Q · 05 Does GPT-5.5 have regional pricing? +
Yes. OpenAI states that regional processing/data-residency endpoints carry a 10% uplift for gpt-5.5 and related GPT-5.4/5.5 models. Standard global routing uses the public standard rate.
Q · 06 Is GPT-5.5 cheaper than GPT-5.4? +
No. GPT-5.4 is half the GPT-5.5 list price at $2.50/M input, $0.25/M cached input, and $15/M output. In the shared 92/8 agentic blend with 82% cache hits, GPT-5.5 lands near $3.61/M while GPT-5.4 lands near $1.80/M.
Q · 07 What release date is used here? +
OpenAI announced GPT-5.5 on April 23, 2026 and then updated the release note on April 24, 2026 to say GPT-5.5 and GPT-5.5 Pro were available in the API. This page uses 2026-04-24 as the API release date.
Q · 08 How accurate is the tokenizer estimate? +
The live widget uses an estimated 4.875 characters per token for English text and labels the tokenizer as tiktoken-o200k_base. It is good for budget planning, but exact billing can differ by language, whitespace, code, tool calls, and image inputs.