GLM-4.7 FlashX API Pricing
GLM-4.7 FlashX is Zhipu's ultra-cheap GLM-4.7 series model for cost-sensitive coding, RAG, and support traffic. The live vendor table lists $0.07/M input and $0.4/M output, with cached input at $0.01/M. Pulled directly from docs.z.ai daily.
Run the numbers.
Live calculator pre-loaded with current GLM-4.7 FlashX rates. Tweak spend, output mix, or cache assumptions and share the URL to share the calculation.
Real-world presets.
Small repo implementation
Pull request review
Knowledge base answer
Support assistant
Price history.
Paste text. See tokens. See cost.
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| GLM-4.7 FlashX Current | $0.07 cache $0.01 | $0.40 | $0.05 agentic 92/8 | 200K | Ultra-cheap GLM traffic |
| GLM-4.7 Flash | $0.00 cache $0.00 | $0.00 | $0.00 cheaper | 200K | Free registered-user tier |
| GLM-4.5 Flash | $0.00 cache $0.00 | $0.00 | $0.00 cheaper | 128K | Free GLM-4.5 fallback |
| GLM-4.5 Air | $0.20 cache $0.03 | $1.10 | $0.14 pricier | 128K | Budget GLM-4.5 production |
| GLM-4.7 | $0.60 cache $0.11 | $2.20 | $0.36 pricier | 200K | Mid-tier GLM agents |
| DeepSeek V4 Flash | $0.14 cache $0.00 | $0.28 | $0.05 same blend | 1M | Budget reasoning and coding |
| Qwen3 Coder Flash | $0.30 | $1.50 | $0.40 pricier | 1M | Cheap code-focused Qwen |
Frequently asked.
Practical pricing questions, separated from calculator assumptions and regional tiers.
Q · 01 What is GLM-4.7 FlashX priced at? +
$0.07/M input and $0.4/M output on the live Z.AI pricing table. Cached input is listed at $0.01/M. This page stores USD per-million-token pricing.Q · 02 How is the effective price calculated? +
$0.05/M.Q · 03 Is prompt caching priced separately? +
$0.01/M versus $0.07/M fresh input. Cached input storage is listed as limited-time free on the Z.AI page.Q · 04 Are regional prices different? +
Q · 05 Is there a batch discount? +
Q · 06 How accurate is the tokenizer estimate? +
zhipu-tokenizer-estimate chars-per-token estimate for English text. It is useful for rough planning, but actual billing comes from the vendor API usage fields and can differ for Chinese, code, or mixed-language prompts.