RETIRED256K CONTEXTARCHIVE PRICEREDIRECTS TO GROK-4-3
Grok 4 API Pricing
Grok 4 is now an archive row: historical pricing was $3/M input, $0.75/M cached input, and $15/M output. xAI's May 15 retirement guide says retired slugs redirect to Grok 4.3 and are billed at current Grok 4.3 prices.
Input - per 1M tokens
$3.00/M
Historical Grok 4 archive
Output - per 1M tokens
$15.00/M
Archive launch price archive
Cached input
$0.75/M
Prompt cache discount
Effective - agentic blend
$2.26/M
92/8 split - 82% cache
§ 01 / TERMINAL
Run the numbers.
Calculator pre-loaded with Grok 4's archive rates. For live traffic after May 15, 2026, xAI says the retired slug redirects to Grok 4.3 and bills at Grok 4.3 pricing.
$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
—
Words equivalent (English)
—
Effective rate
—
§ 02 / SCENARIOS
Real-world presets.
AGENT
Tool-calling agent
$0.212/task
LONG CONTEXT
Research pack synthesis
$0.381/pack
CHAT
Power-user chat
$0.165/turn
RAG
Knowledge base RAG
$0.353/answer
§ 03 / TAPE
Price history.
Input · $3/M
Output · $15/M
Cached · $0.75/M
JUL 09 Launched at $3/M input and $15/M outputMAY 18 Retirement verified; slug redirects to Grok 4.3 after May 15, 2026
§ 04 / TOKENIZER
Paste text. See tokens. See cost.
Estimate · grok-tokenizer-estimate · ≈3.85 chars/token Auto-counts as you type
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
Characters —
Words —
Tokens (estimated) —
Cost as input · uncached —
Cost as output · uncached —
Cost as cached input —
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| Grok 4 Current | $3.00 cache $0.75 | $15.00 | $2.26 archive 92/8 | 256K | Archive Grok 4 invoices |
| Grok 4.3 | $1.25 | $2.50 | $1.35 current replacement | 1M | Current Grok default |
| Gemini 2.5 Pro | $1.25 cache $0.13 | $10.00 | $1.10 Gemini Pro | 2M | Stable Google frontier |
| GPT-5.4 | $2.50 cache $0.25 | $15.00 | $1.80 OpenAI frontier | 1.05M | Affordable OpenAI frontier |
| Claude Sonnet 4.6 | $3.00 cache $0.30 | $15.00 | $1.92 Anthropic agent | 1M | Daily-driver Claude agents |
| DeepSeek V4 Pro | $0.43 cache $0.00 | $0.87 | $0.14 budget reasoning | 1M | Low-cost reasoning alternative |
Frequently asked.
Grok 4 pricing questions, with xAI token rates separated from workload assumptions.
Q · 01 What was Grok 4's API price? +
The archive snapshot keeps Grok 4 at
$3/M input, $0.75/M cached input, and $15/M output. Use these numbers for historical billing analysis, not new routing decisions.Q · 02 Is Grok 4 still available? +
xAI's May 15, 2026 retirement guide says
grok-4-0709 and related retired slugs redirect to grok-4.3. After the redirect, requests are billed at Grok 4.3's $1.25/M input and $2.50/M output pricing.Q · 03 Does prompt caching apply? +
The archive row includes cached input at
$0.75/M. The effective blend assumes 82% of input tokens hit cache; turn cache off in the calculator for fresh-input workloads.Q · 04 Why keep an archive page? +
Teams still need old price rows to reconcile invoices and explain model migrations. This page explicitly marks the endpoint as retired so it does not imply new production availability.
Q · 05 What should replace Grok 4? +
Use
grok-4.3. xAI states retired text slugs redirect to Grok 4.3, and the live pricing page lists Grok 4.3 at $1.25/M input, $0.20/M cached input, and $2.50/M output.Q · 06 How accurate is the tokenizer estimate? +
The widget uses
4.875 characters per token as a planning estimate. Exact billing can vary with language, reasoning effort, cached prompts, and tool calls.