Last verified 2026-05-18

RETIRED256K CONTEXTARCHIVE PRICEREDIRECTS TO GROK-4-3

Grok 4 API Pricing

Q: What was Grok 4's API price?

The archive snapshot keeps Grok 4 at $3/M input, $0.75/M cached input, and $15/M output. Use these numbers for historical billing analysis, not new routing decisions.

Q: Is Grok 4 still available?

xAI's May 15, 2026 retirement guide says grok-4-0709 and related retired slugs redirect to grok-4.3. After the redirect, requests are billed at Grok 4.3's $1.25/M input and $2.50/M output pricing.

Q: Does prompt caching apply?

The archive row includes cached input at $0.75/M. The effective blend assumes 82% of input tokens hit cache; turn cache off in the calculator for fresh-input workloads.

Q: What should replace Grok 4?

Use grok-4.3. xAI states retired text slugs redirect to Grok 4.3, and the live pricing page lists Grok 4.3 at $1.25/M input, $0.20/M cached input, and $2.50/M output.

Q: How accurate is the tokenizer estimate?

The widget uses 4.875 characters per token as a planning estimate. Exact billing can vary with language, reasoning effort, cached prompts, and tool calls.

Grok 4 is now an archive row: historical pricing was $3/M input, $0.75/M cached input, and $15/M output. xAI's May 15 retirement guide says retired slugs redirect to Grok 4.3 and are billed at current Grok 4.3 prices.

Input - per 1M tokens

$3.00/M

Historical Grok 4 archive

Output - per 1M tokens

$15.00/M

Archive launch price archive

Cached input

$0.75/M

Prompt cache discount

Effective - agentic blend

$2.26/M

92/8 split - 82% cache

§ 01 / TERMINAL

Run the numbers.

Calculator pre-loaded with Grok 4's archive rates. For live traffic after May 15, 2026, xAI says the retired slug redirects to Grok 4.3 and bills at Grok 4.3 pricing.

Spend

$ /mo

Workload split

Prompt cache hit rate

Tokens you can process

—

Words equivalent (English)

—

Effective rate

—

Open full calculator (all models · share URL · CSV) →

§ 02 / SCENARIOS

Real-world presets.

AGENT

Tool-calling agent

$0.212/task

80k in / 8k out~470 tasks/$100

LONG CONTEXT

Research pack synthesis

$0.381/pack

200k in / 10k out~262 packs/$100

CHAT

Power-user chat

$0.165/turn

40k in / 6k out~606 turns/$100

RAG

Knowledge base RAG

$0.353/answer

150k in / 12k out~283 answers/$100

§ 03 / TOKENIZER

Paste text. See tokens. See cost.

Your text · live count

Estimate · grok-tokenizer-estimate · ≈3.85 chars/token Auto-counts as you type

This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.

Characters 244

Words 40

Tokens (estimated) 63 tokens

Cost as input · uncached $0.000189 USD

Cost as output · uncached $0.000945 USD

Cost as cached input $0.000047 USD

§ 04 / SHELF

Up against the shelf.

All models →

Model	Input /M	Output /M	Effective blended	Context	Best for
Grok 4 Current	$3.00 cache $0.75	$15.00	$2.26 archive 92/8	256K	Archive Grok 4 invoices
Grok 4.3	$1.25	$2.50	$1.35 current replacement	1M	Current Grok default
Gemini 2.5 Pro	$1.25 cache $0.13	$10.00	$1.10 Gemini Pro	2M	Stable Google frontier
GPT-5.4	$2.50 cache $0.25	$15.00	$1.80 OpenAI frontier	1.05M	Affordable OpenAI frontier
Claude Sonnet 4.6	$3.00 cache $0.30	$15.00	$1.92 Anthropic agent	1M	Daily-driver Claude agents
DeepSeek V4 Pro	$0.43 cache $0.00	$0.87	$0.14 budget reasoning	1M	Low-cost reasoning alternative

Frequently asked.

Grok 4 pricing questions, with xAI token rates separated from workload assumptions.

Q · 01 What was Grok 4's API price? +

The archive snapshot keeps Grok 4 at $3/M input, $0.75/M cached input, and $15/M output. Use these numbers for historical billing analysis, not new routing decisions.

Q · 02 Is Grok 4 still available? +

xAI's May 15, 2026 retirement guide says grok-4-0709 and related retired slugs redirect to grok-4.3. After the redirect, requests are billed at Grok 4.3's $1.25/M input and $2.50/M output pricing.

Q · 03 Does prompt caching apply? +

The archive row includes cached input at $0.75/M. The effective blend assumes 82% of input tokens hit cache; turn cache off in the calculator for fresh-input workloads.

Q · 04 Why keep an archive page? +

Teams still need old price rows to reconcile invoices and explain model migrations. This page explicitly marks the endpoint as retired so it does not imply new production availability.

Q · 05 What should replace Grok 4? +

Use grok-4.3. xAI states retired text slugs redirect to Grok 4.3, and the live pricing page lists Grok 4.3 at $1.25/M input, $0.20/M cached input, and $2.50/M output.

Q · 06 How accurate is the tokenizer estimate? +

The widget uses 4.875 characters per token as a planning estimate. Exact billing can vary with language, reasoning effort, cached prompts, and tool calls.

Reviewed by Yaroslav Vikhariev Founder - AI//COST - Pricing pulled daily from docs.x.ai - Last verified May 18, 2026

Methodology Report a correction More by Y.V.