GLM-4.7 Flash API Pricing
GLM-4.7 Flash is Zhipu's free GLM-4.7 series model for registered-user experimentation and high-frequency light workloads. The live vendor table lists $0/M input and $0/M output; access is free on the public Z.AI pricing page. Pulled directly from docs.z.ai daily.
Run the numbers.
Live calculator pre-loaded with current GLM-4.7 Flash rates. Tweak spend, output mix, or cache assumptions and share the URL to share the calculation.
Real-world presets.
Prototype implementation
Pull request review
Knowledge base answer
Support assistant
Price history.
Paste text. See tokens. See cost.
This is a chars-per-token approximation, not a real tokenizer. Actual tokens vary by language, code density, and tool-call overhead — counts are typically ±10–20% off for English prose, more for code or non-Latin scripts. For exact billing, use the vendor's official tokenizer.
| Model | Input /M | Output /M | Effective blended | Context | Best for |
|---|---|---|---|---|---|
| GLM-4.7 Flash Current | $0.00 cache $0.00 | $0.00 | $0.00 agentic 92/8 | 200K | Free registered-user tier |
| GLM-4.5 Flash | $0.00 cache $0.00 | $0.00 | $0.00 same blend | 128K | Free GLM-4.5 fallback |
| GLM-4.7 FlashX | $0.07 cache $0.01 | $0.40 | $0.05 pricier | 200K | Ultra-cheap GLM traffic |
| GLM-4.5 Air | $0.20 cache $0.03 | $1.10 | $0.14 pricier | 128K | Budget GLM-4.5 production |
| GLM-4.7 | $0.60 cache $0.11 | $2.20 | $0.36 pricier | 200K | Mid-tier GLM agents |
| DeepSeek V4 Flash | $0.14 cache $0.00 | $0.28 | $0.05 pricier | 1M | Budget reasoning and coding |
| Qwen3 Coder Flash | $0.30 | $1.50 | $0.40 pricier | 1M | Cheap code-focused Qwen |
Frequently asked.
Practical pricing questions, separated from calculator assumptions and regional tiers.
Q · 01 What is GLM-4.7 Flash priced at? +
$0/M input and $0/M output on the live Z.AI pricing table. This page stores USD per-million-token pricing, so the calculator treats the token charge as zero.Q · 02 How is the effective price calculated? +
$0/M.Q · 03 Is prompt caching priced separately? +
Q · 04 Are regional prices different? +
Q · 05 Is there a batch discount? +
Q · 06 How accurate is the tokenizer estimate? +
zhipu-tokenizer-estimate chars-per-token estimate for English text. It is useful for rough planning, but actual billing comes from the vendor API usage fields and can differ for Chinese, code, or mixed-language prompts.