Last verified
RETIRED256K CONTEXTARCHIVE PRICEREDIRECTS TO GROK-4-3

Grok 4 API Pricing

Grok 4 is now an archive row: historical pricing was $3/M input, $0.75/M cached input, and $15/M output. xAI's May 15 retirement guide says retired slugs redirect to Grok 4.3 and are billed at current Grok 4.3 prices.

Input - per 1M tokens
$3.00/M
Historical Grok 4 archive
Output - per 1M tokens
$15.00/M
Archive launch price archive
Cached input
$0.75/M
Prompt cache discount
Effective - agentic blend
$2.26/M
92/8 split - 82% cache
§ 01 / TERMINAL

Run the numbers.

Calculator pre-loaded with Grok 4's archive rates. For live traffic after May 15, 2026, xAI says the retired slug redirects to Grok 4.3 and bills at Grok 4.3 pricing.

$ /mo
Workload split
Prompt cache hit rate
Tokens you can process
Words equivalent (English)
Effective rate
§ 02 / SCENARIOS

Real-world presets.

§ 03 / TAPE

Price history.

Grok 4 retired on 2026-05-15; archive price remains $3/$15 per M.

Input · $3/M
Output · $15/M
Cached · $0.75/M
JUL 09 Launched at $3/M input and $15/M outputMAY 18 Retirement verified; slug redirects to Grok 4.3 after May 15, 2026
§ 04 / TOKENIZER

Paste text. See tokens. See cost.

Estimate · grok-tokenizer-estimate · ≈3.85 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters
Words
Tokens (estimated)
Cost as input · uncached
Cost as output · uncached
Cost as cached input
§ 05 / SHELF

Up against the shelf.

All models →
Model Input /M Output /M Effective blended Context Best for
Grok 4 Current $3.00 cache $0.75 $15.00 $2.26 archive 92/8 256K Archive Grok 4 invoices
Grok 4.3 $1.25 $2.50 $1.35 current replacement 1M Current Grok default
Gemini 2.5 Pro $1.25 cache $0.13 $10.00 $1.10 Gemini Pro 2M Stable Google frontier
GPT-5.4 $2.50 cache $0.25 $15.00 $1.80 OpenAI frontier 1.05M Affordable OpenAI frontier
Claude Sonnet 4.6 $3.00 cache $0.30 $15.00 $1.92 Anthropic agent 1M Daily-driver Claude agents
DeepSeek V4 Pro $0.43 cache $0.00 $0.87 $0.14 budget reasoning 1M Low-cost reasoning alternative

Frequently asked.

Grok 4 pricing questions, with xAI token rates separated from workload assumptions.

Q · 01 What was Grok 4's API price? +
The archive snapshot keeps Grok 4 at $3/M input, $0.75/M cached input, and $15/M output. Use these numbers for historical billing analysis, not new routing decisions.
Q · 02 Is Grok 4 still available? +
xAI's May 15, 2026 retirement guide says grok-4-0709 and related retired slugs redirect to grok-4.3. After the redirect, requests are billed at Grok 4.3's $1.25/M input and $2.50/M output pricing.
Q · 03 Does prompt caching apply? +
The archive row includes cached input at $0.75/M. The effective blend assumes 82% of input tokens hit cache; turn cache off in the calculator for fresh-input workloads.
Q · 04 Why keep an archive page? +
Teams still need old price rows to reconcile invoices and explain model migrations. This page explicitly marks the endpoint as retired so it does not imply new production availability.
Q · 05 What should replace Grok 4? +
Use grok-4.3. xAI states retired text slugs redirect to Grok 4.3, and the live pricing page lists Grok 4.3 at $1.25/M input, $0.20/M cached input, and $2.50/M output.
Q · 06 How accurate is the tokenizer estimate? +
The widget uses 4.875 characters per token as a planning estimate. Exact billing can vary with language, reasoning effort, cached prompts, and tool calls.