Question 1

What is the standard chat-latest API price?

Accepted Answer

OpenAI lists chat-latest at $5/M input tokens, $0.5/M cached input tokens, and $30/M output tokens. This page stores the public list price in USD and marks OpenAI's pricing docs as the source.

Question 2

How is the effective blended cost calculated?

Accepted Answer

The effective tile uses the fixed AI//COST 92/8 agentic blend with 82% cache hits. For chat-latest, that works out to $3.61/M using $5/M input, $0.5/M cached input, and $30/M output.

Question 3

Is Batch API pricing listed?

Accepted Answer

OpenAI's model docs expose Batch as an endpoint, and the pricing table separately lists discounted Batch rows where available. If a model-specific Batch row is absent, this page does not invent one.

Question 4

Does prompt caching change the input price?

Accepted Answer

Yes. A cache hit is billed at $0.5/M instead of $5/M. The calculator lets you set the hit rate to 82%, 50%, or off.

Question 5

Are regional processing prices different?

Accepted Answer

OpenAI states that regional processing/data-residency endpoints can carry a 10% uplift for selected GPT-5.4/5.5 family models. This page uses the global public API list price unless the vendor row explicitly lists a regional surcharge.

Question 6

Is chat-latest a pinned production model?

Accepted Answer

No. OpenAI says chat-latest points to the latest Instant model used in ChatGPT and that the underlying snapshot will be regularly updated. OpenAI recommends GPT-5.5 for production API usage when you need a stable model choice.

Question 7

What release date is used here?

Accepted Answer

OpenAI's GPT-5.5 Instant post is dated May 5, 2026 and says the model is available in the API as chat-latest. This page uses 2026-05-05 for the first observed pricing-history point.

Question 8

How accurate is the tokenizer estimate?

Accepted Answer

The live widget uses an estimated 4.875 characters per token for English text and labels the tokenizer as tiktoken-o200k_base. Exact billing can differ by language, code, whitespace, tool calls, and image inputs.

Model	Input /M	Output /M	Effective blended	Context	Best for
chat-latest Current	$5.00 cache $0.50	$30.00	$3.61 agentic 92/8	400K	Latest ChatGPT Instant alias
GPT-5.5	$5.00 cache $0.50	$30.00	$3.61 same input/output	1M	Production flagship API model
GPT-5.4	$2.50 cache $0.25	$15.00	$1.80 cheaper frontier	1.05M	Affordable OpenAI frontier work
GPT-5.4 mini	$0.75 cache $0.07	$4.50	$0.54 smaller tier	400K	Subagents and computer-use tasks
GPT-5.4 nano	$0.20 cache $0.02	$1.25	$0.15 lowest GPT tier	400K	Classification and extraction
Claude Opus 4.7	$5.00 cache $0.50	$25.00	$3.21 Anthropic frontier	1M	Frontier reasoning and writing
Gemini 3.1 Pro Preview	$2.00 cache $0.20	$12.00	$1.44 Google preview	1M	Large multimodal reasoning

chat-latest API Pricing

Run the numbers.

Real-world presets.

Everyday ChatGPT-style assistant

Support answer drafting

Image-heavy question

Creative brief iteration

Knowledge-base answer

Paste text. See tokens. See cost.

Up against the shelf.

Frequently asked.