Last verified 2026-06-19

GLM ARCHITECTFOUNDED 2019TSINGHUA SPIN-OFFFREE TIERSOPEN WEIGHTS

GLM API Pricing

Q: Which GLM model should I start with?

For coding/agentic work: GLM-4.7 ($0.60/$2.20) — it plugs into Claude Code, Cline, and similar tools. For the frontier: GLM-5.2 ($1.4/$4.4). Prototyping for free? GLM-4.7 Flash is $0/M. See the use case picker above.

Q: Are there really free models?

Yes — GLM-4.7 Flash and GLM-4.5 Flash are free ($0/M) for registered users, rate-limited but with no per-token charge and up to 200K context. The cheapest paid tier is GLM-4.7 FlashX at $0.07/$0.40. Few labs offer free frontier-adjacent tiers like this.

Q: Are GLM models open-weight?

Several are. GLM-4.5, GLM-4.6, and the GLM-4 32B snapshot are published as open weights on Hugging Face — self-hostable and a clean way to avoid routing data through the hosted API. The newest GLM-5 tiers are served via the Z.ai / BigModel APIs.

Q: Is Zhipu under US sanctions?

Yes. In January 2025 the US Commerce Department added Zhipu to the Entity List over national-security concerns, restricting its access to US technology. For Western buyers this is a real procurement consideration. Zhipu has since said it trained a major model on Huawei chips in response to chip restrictions.

Q: Where is my data stored?

It depends on the endpoint. The Z.ai international API and the China-domestic BigModel (open.bigmodel.cn) platform store data under their respective jurisdictions — BigModel keeps data in mainland China. For full control, self-host the open GLM weights.

Q: Why is GLM popular for coding?

Zhipu tuned GLM-4.6/4.7 for agentic coding and made them drop-in compatible with Claude Code, Cline, Roo Code, and Kilo Code. Positioned as Claude-Sonnet-class on several coding benchmarks at $0.60/$2.20, GLM-4.7 is a popular low-cost backend for coding agents.

The Beijing lab behind the GLM models, also operating internationally as Z.ai. Spun out of Tsinghua University in 2019, Zhipu ships strong coding/agentic models, multiple free tiers, and open weights — and was the first of China's "Six Tigers" to pursue an IPO. It also sits on the US Entity List.

Open Z.ai docs →View all models Calculate cost

Production models

GLM-5.2 / 5.1 / 4.7 / 4.5

Founded

2019

Tsinghua spin-off

Free tiers

$0/M

GLM-4.7 + 4.5 Flash

Cheapest paid

$0.07/M

GLM-4.7 FlashX

Valuation

>$20B

2025 · first Tiger to IPO

Methodology →

Model	Input /M	Output /M	Cached	Context	Max output	Vision	Tools	Tier
GLM-5.2 FLAGSHIP	$1.40	$4.40	$0.26−81%	1M	—	✗	✓	Frontier
GLM-5.1	$1.40	$4.40	$0.26−81%	200K	—	✗	✓	Frontier
GLM-5	$1.00	$3.20	$0.2−80%	200K	—	✗	✓	Frontier
GLM-5 Turbo	$1.20	$4.00	$0.24−80%	200K	—	✗	✓	Frontier
GLM-4 32B (0414, 128K)	$0.1	$0.1	—	128K	—	✗	✓	Light
GLM-4.7	$0.6	$2.20	$0.11−82%	200K	—	✗	✓	Mid
GLM-4.7 FlashX	$0.07	$0.4	$0.01−86%	200K	—	✗	✓	Light
GLM-4.7 Flash	Free	Free	Free	200K	—	✗	✓	Light
GLM-4.6	$0.6	$2.20	$0.11−82%	200K	—	✗	✓	Mid
GLM-4.5 Air	$0.2	$1.10	$0.03−85%	128K	—	✗	✓	Light
GLM-4.5 Flash	Free	Free	Free	128K	—	✗	✓	Light
GLM-4.5 X	$2.20	$8.90	$0.45−80%	128K	—	✗	✓	Frontier
GLM-4.5 AirX	$1.10	$4.50	$0.22−80%	128K	—	✗	✓	Mid
GLM-4.5	$0.6	$2.20	$0.11−82%	128K	—	✗	✓	Active

§ 03 / PRICE CURVE

Pricing across the lineup.

How Zhipu (Z.ai / GLM) priced 3 models · JUL 25 → JUN 26.

oldest → newest →

Input · newest $1.4/M

Output · newest $4.4/M

Each point is a model at its listed $/M price.

The GLM story so far.

Releases and corporate moves from Zhipu / Z.ai. Sourced from Z.ai docs, Zhipu press, and Wikipedia — verified at publication.

JUN 16 - 2026

GLM-5.2 ships as the new 1M-context flagship for long-horizon coding and project-scale engineering.

release

APR 03 · 2026

GLM-5.1 released as the previous flagship at $1.4/$4.4, 200K context

RELEASE

EARLY · 2026

Zhipu completes its IPO — the first of China's "Six Tigers" to list publicly

CORPORATE

FEB · 2026

Zhipu says it trained a major model on Huawei chips — a workaround amid US chip restrictions

CORPORATE

SEP · 2025

GLM-4.6 released — open-weight, positioned on-par with Claude Sonnet 4 on several coding benchmarks

RELEASE

JUL · 2025

GLM-4.5 released as open weights — runs on eight Nvidia H20 chips

RELEASE

JAN · 2025

Added to the US Entity List by the Commerce Department over national-security concerns

CORPORATE

2019

Zhipu AI spun out of Tsinghua University (Tang Jie, Li Juanzi); GLM series follows

CORPORATE

§ 04 / ACCESS

Where to get it.

Methodology →

PRIMARY · INTERNATIONAL

Z.ai API

docs.z.ai — the international Z.ai brand and API for the full GLM catalog, including free Flash tiers. OpenAI-compatible; cache-hit input is a fraction of base.

Console + API →

CHINA · DOMESTIC

Zhipu BigModel

open.bigmodel.cn — Zhipu's China-domestic platform (BigModel / 智谱), keeping data inside mainland China for domestic compliance.

China data residency →

OPEN-WEIGHT · SELF-HOST

Hugging Face

GLM open weights — GLM-4.5, GLM-4.6, and the GLM-4 32B snapshot are published for self-hosting, fully outside any China-routing of your data.

GLM open weights →

§ 04 / BEST FOR

Which Zhipu (Z.ai / GLM) for what.

More scenarios →

If you need the current frontier flagship…

GLM-5.2

Open GLM-5.2 pricing

If you're doing coding / agentic work (Claude Code, Cline)…

GLM-4.7

Profile →

If you need the base GLM-4.5 MoE...

GLM-4.5

Profile →

If you want a cheap budget tier…

GLM-4.5 Air

Profile →

If you're prototyping for free…

GLM-4.7 Flash

Profile →

If you must self-host open weights…

GLM-4 32B

Profile →

If you need China data residency…

Zhipu BigModel

BigModel →

§ 06 / BACKGROUND

The company behind it.

z.ai →

Zhipu AI (智谱) was spun out of Tsinghua University in 2019 by professors Tang Jie and Li Juanzi, and is led by CEO Zhang Peng. It develops the GLM (General Language Model) series with Tsinghua's KEG lab, and operates internationally under the Z.ai brand. It is one of China's "Six Little Tigers" of AI.

Zhipu's lineup is unusually broad and price-aggressive: the current GLM-5.2 flagship, previous GLM-5.1, and GLM-5 base, the coding-focused GLM-4.7 (compatible with Claude Code, Cline, Roo Code), a premium GLM-4.5 X tier, and — distinctively — two free models (GLM-4.7 Flash and GLM-4.5 Flash) plus an ultra-cheap GLM-4.7 FlashX at $0.07/M. Several models, including GLM-4.5 and GLM-4.6, are released as open weights.

On funding, Zhipu raised from Alibaba, Tencent, Ant Group, and Saudi Arabia's Prosperity7, with a valuation pushed above $20 billion by 2025. It was the first of the "Six Tigers" to complete an IPO, listing in early 2026.

Two facts must be stated plainly. First, in January 2025 the US Commerce Department added Zhipu to the Entity List over national-security concerns — which restricts its access to US technology and matters for some Western buyers' procurement. Second, in February 2026 Zhipu said it had trained a major model on Huawei chips, a response to those very chip restrictions.

The trade-offs: GLM models are text/code-focused (no native vision in this lineup), the direct China platform stores data under Chinese rules, and the Entity-List status is a real procurement consideration. But Zhipu's open weights and free tiers let you sidestep the hosted API entirely. Versus DeepSeek and Moonshot, Zhipu competes on coding strength, free access, and breadth.

§ 07 / COMPETITORS

Other frontier labs.

All providers →

COST DISRUPTOR

DeepSeek

MIT open weights at the lowest cost-per-token. Comparable open-weight stance; Zhipu counters with free tiers and stronger agentic-coding tooling integration.

$0.14/M input →

CHINESE PEER

Alibaba (Qwen)

Broad open-weight catalog with 119-language coverage and multimodality. Qwen is wider and multimodal; Zhipu is leaner and coding-focused with free tiers.

Qwen3 family →

SIX-TIGERS PEER

Moonshot (Kimi)

1T-param open-weight MoE and long-context heritage. A fellow Beijing "Tiger"; Moonshot chases scale, Zhipu chases breadth + free access.

Kimi K2 family →

PRIMARY RIVAL

OpenAI

Frontier benchmarks + global ecosystem (GPT-5). Closed and far pricier; Zhipu's GLM-4.6 is pitched as Claude-Sonnet-class on coding at a fraction of the cost.

GPT-5 family →

FRONTIER LEADER

Anthropic

Leads coding + reasoning with Claude. Notably, GLM models are tuned to plug into Claude Code — Zhipu targets the same agentic-coding niche far cheaper.

Claude lineup →

CHINA CONSUMER LEADER

ByteDance (Doubao)

The Doubao model family from ByteDance — parent of TikTok and Douyin. Served through the Volcano Ark (火山方舟) platform on ByteDance's Volcano Engine cloud, Doubao powers China's most-used consumer AI app and undercuts most frontier labs on price.

14 models · from $0.02/M →

Frequently asked.

Practical questions about GLM pricing, free tiers, open weights, and sanctions.

Q · 01 Which GLM model should I start with? +

For coding/agentic work: GLM-4.7 ($0.60/$2.20) — it plugs into Claude Code, Cline, and similar tools. For the frontier: GLM-5.2 ($1.4/$4.4). Prototyping for free? GLM-4.7 Flash is $0/M. See the use case picker above.

Q · 02 Are there really free models? +

Yes — GLM-4.7 Flash and GLM-4.5 Flash are free ($0/M) for registered users, rate-limited but with no per-token charge and up to 200K context. The cheapest paid tier is GLM-4.7 FlashX at $0.07/$0.40. Few labs offer free frontier-adjacent tiers like this.

Q · 03 Are GLM models open-weight? +

Several are. GLM-4.5, GLM-4.6, and the GLM-4 32B snapshot are published as open weights on Hugging Face — self-hostable and a clean way to avoid routing data through the hosted API. The newest GLM-5 tiers are served via the Z.ai / BigModel APIs.

Q · 04 Is Zhipu under US sanctions? +

Yes. In January 2025 the US Commerce Department added Zhipu to the Entity List over national-security concerns, restricting its access to US technology. For Western buyers this is a real procurement consideration. Zhipu has since said it trained a major model on Huawei chips in response to chip restrictions.

Q · 05 Where is my data stored? +

It depends on the endpoint. The Z.ai international API and the China-domestic BigModel (open.bigmodel.cn) platform store data under their respective jurisdictions — BigModel keeps data in mainland China. For full control, self-host the open GLM weights.

Q · 06 Why is GLM popular for coding? +

Zhipu tuned GLM-4.6/4.7 for agentic coding and made them drop-in compatible with Claude Code, Cline, Roo Code, and Kilo Code. Positioned as Claude-Sonnet-class on several coding benchmarks at $0.60/$2.20, GLM-4.7 is a popular low-cost backend for coding agents.

Reviewed by Yaroslav Vikhariev Founder · AI//COST · GLM models tested · Pricing from docs.z.ai

Methodology Report a correction More by Y.V.