Last verified 2026-05-18

VERIFIED PRICERETIRED MODEL32KTEXT MODELNO CACHE SKU

GPT-4 32K API Pricing

Q: How much does GPT-4 32K cost?

GPT-4 32K is listed at $60/M input and $120/M output.

GPT-4 32K is retired; this archive preserves OpenAI's most expensive public API tier. $60/M input, $120/M output. It charged a 2x premium over original GPT-4 for extended context.

Input - per 1M tokens

$60.00/M

Source OpenAI retired

Output - per 1M tokens

$120.00/M

Use for cost checks retired

Cached input

$60.00/M

Cache no separate SKU not listed

Effective - agentic blend

$64.80/M

92/8 split - 82% cache

§ 01 / TERMINAL

Run the numbers.

Calculator pre-loaded with GPT-4 32K rates. Tweak spend, output mix, or cache hit rate to compare this model with current alternatives.

Spend

$ /mo

Workload split

Prompt cache hit rate

Tokens you can process

—

Words equivalent (English)

—

Effective rate

—

Open full calculator (all models · share URL · CSV) →

§ 02 / SCENARIOS

Real-world presets.

AGENT LOOP

Repo-wide bug fix

$5.70/task

81k in - 7k out~17 units/$100

LONG DOC

Reading 100-page contracts

$11.46/doc

175k in - 8k out~8 units/$100

RAG SUPPORT

Support agent ticket triage

$0.360/ticket

4k in - 1k out~277 units/$100

ASSISTANT TURN

Research planning turn

$0.960/turn

12k in - 2k out~104 units/$100

§ 03 / TOKENIZER

Paste text. See tokens. See cost.

Your text · live count

Estimate · tiktoken-cl100k_base · ≈4 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters 256

Words 39

Tokens (estimated) 64 tokens

Cost as input · uncached $0.003840 USD

Cost as output · uncached $0.007680 USD

Cost as cached input $0.003840 USD

§ 04 / SHELF

Up against the shelf.

All models →

Model	Input /M	Output /M	Effective blended	Context	Best for
GPT-4 32K	$60.00 cache $60.00	$120	$64.80 current page	32K	Archive comparison
GPT-5.5	$5.00 cache $0.50	$30.00	$3.60 cheaper	1M	Current OpenAI replacement
GPT-5.4	$2.50 cache $0.25	$15.00	$1.80 cheaper	1.05M	Current OpenAI replacement
GPT-5.4 mini	$0.75 cache $0.07	$4.50	$0.54 cheaper	400K	Current OpenAI replacement
Claude Sonnet 4.6	$3.00 cache $0.30	$15.00	$1.92 cheaper	1M	Current Claude agents
Gemini 2.5 Pro	$1.25 cache $0.13	$10.00	$1.10 cheaper	2M	Gemini multimodal workloads
DeepSeek V4 Pro	$0.43 cache $0.00	$0.87	$0.14 cheaper	1M	Low-cost reasoning and coding

§ 05 / DEEP LINKS

Specific scenarios.

All calculators →

By comparison

Compare current OpenAI → Compare current Gemini → Compare current Claude →

Frequently asked.

Short answers for teams checking GPT-4 32K pricing, status, and migration choices.

Q · 01 Is GPT-4 32K still available? +

GPT-4 32K is retired. Keep this page for pricing history, old invoices, and migration planning.

Q · 02 How much does GPT-4 32K cost? +

GPT-4 32K is listed at $60/M input and $120/M output.

Q · 03 Is cached-input pricing included? +

No separate cached-input SKU is listed for this archive entry, so the calculator treats cached input as undiscounted.

Q · 04 What should teams compare it against? +

Use the shelf to compare current replacements and nearby models. Snapshot replacement: gpt-4o.

Q · 05 Why keep this page? +

These pages preserve price history, preview economics, and migration context without pretending every older or preview model is the best current default.

Reviewed by Yaroslav Vikhariev Founder - AI//COST - Last verified 2026-05-18 - Vendor pricing

Methodology Report a correction More by Y.V.