Last verified 2026-05-18

ARCHIVE PRICEDEPRECATED REASONING200KTEXT MODELPROMPT CACHING

o1 API Pricing

Q: How much did o1 cost?

o1 is archived at $15/M input and $60/M output.

Q: Is cached-input pricing included?

Yes. Cached input is shown at $7.5/M.

Q: What replaced it?

The snapshot replacement is gpt-5-5; use the shelf to compare current options.

o1 is deprecated; this archive preserves the first production OpenAI reasoning price. $15/M input, $60/M output, and $7.5/M cached input. Output includes reasoning tokens, so the $60/M output line matters for old agent invoices.

Archived input - per 1M tokens

$15.00/M

Source OpenAI deprecated

Archived output - per 1M tokens

$60.00/M

Use for invoice checks deprecated

Cached input

$7.50/M

Cache hit price discounted

Effective - agentic blend

$12.94/M

92/8 split - 82% cache

§ 01 / TERMINAL

Run the numbers.

Archive calculator pre-loaded with o1 rates. Tweak spend, output mix, or cache hit rate to compare legacy bills with current replacements.

Spend

$ /mo

Workload split

Prompt cache hit rate

Tokens you can process

—

Words equivalent (English)

—

Effective rate

—

§ 02 / SCENARIOS

Real-world presets.

CODING AGENT ARCHIVE

Repo-wide bug fix

$1.14/task

81k in - 7k out~87 units/$100

LONG DOC ANALYSIS

Reading 100-page contracts

$2.03/doc

175k in - 8k out~49 units/$100

RAG SUPPORT

Support agent ticket triage

$0.095/ticket

4k in - 1k out~1,052 units/$100

ASSISTANT TURN

Research planning turn

$0.226/turn

12k in - 2k out~442 units/$100

§ 03 / TAPE

Price history.

o1 is deprecated; archived at $15/$60 per M.

Input · $15/M

Output · $60/M

Cached · $7.5/M

DEC 17 Launch at $15/M input, $7.50/M cached input, and $60/M outputMAY 18 Deprecated; shutdown listed for October 23, 2026 with GPT-5.5 as replacement

§ 04 / TOKENIZER

Paste text. See tokens. See cost.

Your text · live count

Estimate · tiktoken-cl100k_base · ≈4 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters —

Words —

Tokens (estimated) —

Cost as input · uncached —

Cost as output · uncached —

Cost as cached input —

§ 05 / SHELF

Up against the shelf.

All models →

Model	Input /M	Output /M	Effective blended	Context	Best for
o1	$15.00 cache $7.50	$60.00	$12.94 current page	200K	Legacy reasoning invoices
GPT-5.5	$5.00 cache $0.50	$30.00	$3.60 cheaper	1M	Current OpenAI flagship
GPT-5.4	$2.50 cache $0.25	$15.00	$1.80 cheaper	1.05M	Current OpenAI daily driver
GPT-5.4 mini	$0.75 cache $0.07	$4.50	$0.54 cheaper	400K	Current cheap OpenAI workloads
Claude Sonnet 4.6	$3.00 cache $0.30	$15.00	$1.92 cheaper	1M	Current Claude agents
Gemini 2.5 Pro	$1.25 cache $0.13	$10.00	$1.10 cheaper	2M	Gemini long-context work
DeepSeek V4 Pro	$0.43 cache $0.00	$0.87	$0.14 cheaper	1M	Low-cost reasoning and coding

Frequently asked.

Short answers for teams checking o1 pricing, status, and migration choices.

Q · 01 Is o1 still available? +

o1 is deprecated. OpenAI's deprecations page lists the relevant shutdown and replacement path, so use this page for old workloads and migration checks.

Q · 02 How much did o1 cost? +

o1 is archived at $15/M input and $60/M output.

Q · 03 Is cached-input pricing included? +

Yes. Cached input is shown at $7.5/M.

Q · 04 What replaced it? +

The snapshot replacement is gpt-5-5; use the shelf to compare current options.

Q · 05 Why keep this archive page? +

Archive pages preserve old price baselines, help explain invoice history, and capture migration search demand without treating old models as current recommendations.

Reviewed by Yaroslav Vikhariev Founder - AI//COST - Last verified 2026-05-18 - OpenAI archive pricing

Methodology Report a correction More by Y.V.