Last verified
VERIFIED PRICERETIRED MODEL32KTEXT MODELNO CACHE SKU

GPT-4 32K API Pricing

GPT-4 32K is retired; this archive preserves OpenAI's most expensive public API tier. $60/M input, $120/M output. It charged a 2x premium over original GPT-4 for extended context.

Input - per 1M tokens
$60.00/M
Source OpenAI retired
Output - per 1M tokens
$120.00/M
Use for cost checks retired
Cached input
$60.00/M
Cache no separate SKU not listed
Effective - agentic blend
$64.80/M
92/8 split - 82% cache
§ 01 / TERMINAL

Run the numbers.

Calculator pre-loaded with GPT-4 32K rates. Tweak spend, output mix, or cache hit rate to compare this model with current alternatives.

$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
Words equivalent (English)
Effective rate
§ 02 / SCENARIOS

Real-world presets.

§ 03 / TAPE

Price history.

GPT-4 32K is retired; archived at $60/$120 per M.

Input · $60/M
Output · $120/M
Cached · $60/M
MAR 14 Launch-era 32K price at $60/M input and $120/M outputMAY 18 Retired on June 6, 2025; replacement is GPT-4o
§ 04 / TOKENIZER

Paste text. See tokens. See cost.

Estimate · tiktoken-cl100k_base · ≈4 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters
Words
Tokens (estimated)
Cost as input · uncached
Cost as output · uncached
Cost as cached input
§ 05 / SHELF

Up against the shelf.

All models →
Model Input /M Output /M Effective blended Context Best for
GPT-4 32K $60.00 cache $60.00 $120 $64.80 current page 32K Archive comparison
GPT-5.5 $5.00 cache $0.50 $30.00 $3.60 cheaper 1M Current OpenAI replacement
GPT-5.4 $2.50 cache $0.25 $15.00 $1.80 cheaper 1.05M Current OpenAI replacement
GPT-5.4 mini $0.75 cache $0.07 $4.50 $0.54 cheaper 400K Current OpenAI replacement
Claude Sonnet 4.6 $3.00 cache $0.30 $15.00 $1.92 cheaper 1M Current Claude agents
Gemini 2.5 Pro $1.25 cache $0.13 $10.00 $1.10 cheaper 2M Gemini multimodal workloads
DeepSeek V4 Pro $0.43 cache $0.00 $0.87 $0.14 cheaper 1M Low-cost reasoning and coding

Frequently asked.

Short answers for teams checking GPT-4 32K pricing, status, and migration choices.

Q · 01 Is GPT-4 32K still available? +
GPT-4 32K is retired. Keep this page for pricing history, old invoices, and migration planning.
Q · 02 How much does GPT-4 32K cost? +
GPT-4 32K is listed at $60/M input and $120/M output.
Q · 03 Is cached-input pricing included? +
No separate cached-input SKU is listed for this archive entry, so the calculator treats cached input as undiscounted.
Q · 04 What should teams compare it against? +
Use the shelf to compare current replacements and nearby models. Snapshot replacement: gpt-4o.
Q · 05 Why keep this page? +
These pages preserve price history, preview economics, and migration context without pretending every older or preview model is the best current default.