ARCHIVE PRICEDEPRECATED REASONING200KTEXT MODELPROMPT CACHING
o1 API Pricing
o1 is deprecated; this archive preserves the first production OpenAI reasoning price. $15/M input, $60/M output, and $7.5/M cached input. Output includes reasoning tokens, so the $60/M output line matters for old agent invoices.
Archived input - per 1M tokens
$15.00/M
Source OpenAI deprecated
Archived output - per 1M tokens
$60.00/M
Use for invoice checks deprecated
Cached input
$7.50/M
Cache hit price discounted
Effective - agentic blend
$12.94/M
92/8 split - 82% cache
§ 01 / TERMINAL
Run the numbers.
Archive calculator pre-loaded with o1 rates. Tweak spend, output mix, or cache hit rate to compare legacy bills with current replacements.
$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
—
Words equivalent (English)
—
Effective rate
—
§ 02 / SCENARIOS
Real-world presets.
CODING AGENT ARCHIVE
Repo-wide bug fix
$1.14/task
LONG DOC ANALYSIS
Reading 100-page contracts
$2.03/doc
RAG SUPPORT
Support agent ticket triage
$0.095/ticket
ASSISTANT TURN
Research planning turn
$0.226/turn
§ 03 / TAPE
Price history.
Input · $15/M
Output · $60/M
Cached · $7.5/M
DEC 17 Launch at $15/M input, $7.50/M cached input, and $60/M outputMAY 18 Deprecated; shutdown listed for October 23, 2026 with GPT-5.5 as replacement
§ 04 / TOKENIZER
Paste text. See tokens. See cost.
Estimate · tiktoken-cl100k_base · ≈4 chars/token Auto-counts as you type
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
Characters —
Words —
Tokens (estimated) —
Cost as input · uncached —
Cost as output · uncached —
Cost as cached input —
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| o1 | $15.00 cache $7.50 | $60.00 | $12.94 current page | 200K | Legacy reasoning invoices |
| GPT-5.5 | $5.00 cache $0.50 | $30.00 | $3.60 cheaper | 1M | Current OpenAI flagship |
| GPT-5.4 | $2.50 cache $0.25 | $15.00 | $1.80 cheaper | 1.05M | Current OpenAI daily driver |
| GPT-5.4 mini | $0.75 cache $0.07 | $4.50 | $0.54 cheaper | 400K | Current cheap OpenAI workloads |
| Claude Sonnet 4.6 | $3.00 cache $0.30 | $15.00 | $1.92 cheaper | 1M | Current Claude agents |
| Gemini 2.5 Pro | $1.25 cache $0.13 | $10.00 | $1.10 cheaper | 2M | Gemini long-context work |
| DeepSeek V4 Pro | $0.43 cache $0.00 | $0.87 | $0.14 cheaper | 1M | Low-cost reasoning and coding |
Frequently asked.
Short answers for teams checking o1 pricing, status, and migration choices.
Q · 01 Is o1 still available? +
o1 is deprecated. OpenAI's deprecations page lists the relevant shutdown and replacement path, so use this page for old workloads and migration checks.
Q · 02 How much did o1 cost? +
o1 is archived at $15/M input and $60/M output.
Q · 03 Is cached-input pricing included? +
Yes. Cached input is shown at $7.5/M.
Q · 04 What replaced it? +
The snapshot replacement is
gpt-5-5; use the shelf to compare current options.Q · 05 Why keep this archive page? +
Archive pages preserve old price baselines, help explain invoice history, and capture migration search demand without treating old models as current recommendations.