Last verified
GLM ARCHITECTFOUNDED 2019TSINGHUA SPIN-OFFFREE TIERSOPEN WEIGHTS

GLM API Pricing

The Beijing lab behind the GLM models, also operating internationally as Z.ai. Spun out of Tsinghua University in 2019, Zhipu ships strong coding/agentic models, multiple free tiers, and open weights — and was the first of China's "Six Tigers" to pursue an IPO. It also sits on the US Entity List.

Production models
12
GLM-5 / 4.7 / 4.5
Founded
2019
Tsinghua spin-off
Free tiers
$0/M
GLM-4.7 + 4.5 Flash
Cheapest paid
$0.07/M
GLM-4.7 FlashX
Valuation
>$20B
2025 · first Tiger to IPO
Open weights
GLM open
GLM-4.5 / 4.6 on HF
§ 01 / LINEUP

The full roster.

Side-by-side →
FRONTIER · REASONING

GLM-5.1

Zhipu's current flagship text model.

Input
$1/M
Output
$4/M
200K ctx · text-only
FRONTIER · REASONING

GLM-5

GLM-5 mid-tier flagship.

Input
$1/M
Output
$3/M
200K ctx · text-only
FRONTIER · REASONING

GLM-5 Turbo

Latency-optimised GLM-5 variant.

Input
$1/M
Output
$4/M
200K ctx · text-only
FAST · LIGHTWEIGHT

GLM-4 32B (0414, 128K)

Open-weight dated GLM-4 snapshot — 32B params, 128K context, symmetric $0.10/$0.10 pricing.

Input
$0.10/M
Output
$0.10/M
128K ctx · text-only
BALANCED · MID-TIER

GLM-4.7

Cheaper sibling — same pricing as GLM-4.5/4.6.

Input
$0.60/M
Output
$2/M
200K ctx · text-only
FAST · LIGHTWEIGHT

GLM-4.7 FlashX

Ultra-cheap GLM-4.7 tier.

Input
$0.07/M
Output
$0.40/M
200K ctx · text-only
FAST · LIGHTWEIGHT

GLM-4.7 Flash

FREE tier for all registered Zhipu users.

Input
$0.00/M
Output
$0.00/M
200K ctx · text-only
BALANCED · MID-TIER

GLM-4.6

Positioned by Zhipu as on-par with Claude Sonnet 4 on several benchmarks.

Input
$0.60/M
Output
$2/M
200K ctx · text-only
FAST · LIGHTWEIGHT

GLM-4.5 Air

Budget Zhipu tier.

Input
$0.20/M
Output
$1/M
128K ctx · text-only
FAST · LIGHTWEIGHT

GLM-4.5 Flash

FREE tier — Zhipu's older free model alongside GLM-4.7-Flash.

Input
$0.00/M
Output
$0.00/M
128K ctx · text-only
FRONTIER · REASONING

GLM-4.5 X

Premium GLM-4.5 variant — most expensive tier in the 4.5 family.

Input
$2/M
Output
$9/M
128K ctx · text-only
BALANCED · MID-TIER

GLM-4.5 AirX

Mid-tier between GLM-4.5 Air ($0.20/$1.10) and GLM-4.5 X ($2.20/$8.90).

Input
$1/M
Output
$5/M
128K ctx · text-only
§ 02 / SHELF

All side-by-side.

Methodology →
Model Input /M Output /M Cached Context Max output Vision Tools Tier
GLM-5.1 $1.40 $4.40 $0.26−81% 200K Frontier
GLM-5 $1.00 $3.20 $0.2−80% 200K Frontier
GLM-5 Turbo $1.20 $4.00 $0.24−80% 200K Frontier
GLM-4 32B (0414, 128K) $0.1 $0.1 128K Light
GLM-4.7 $0.6 $2.20 $0.11−82% 200K Mid
GLM-4.7 FlashX $0.07 $0.4 $0.01−86% 200K Light
GLM-4.7 Flash Free Free Free 200K Light
GLM-4.6 $0.6 $2.20 $0.11−82% 200K Mid
GLM-4.5 Air $0.2 $1.10 $0.03−85% 128K Light
GLM-4.5 Flash Free Free Free 128K Light
GLM-4.5 X FLAGSHIP $2.20 $8.90 $0.45−80% 128K Frontier
GLM-4.5 AirX $1.10 $4.50 $0.22−80% 128K Mid

The GLM story so far.

Releases and corporate moves from Zhipu / Z.ai. Sourced from Z.ai docs, Zhipu press, and Wikipedia — verified at publication.

APR 03 · 2026
GLM-5.1 released — current flagship at $1.4/$4.4, 200K context
RELEASE
EARLY · 2026
Zhipu completes its IPO — the first of China's "Six Tigers" to list publicly
CORPORATE
FEB · 2026
Zhipu says it trained a major model on Huawei chips — a workaround amid US chip restrictions
CORPORATE
SEP · 2025
GLM-4.6 released — open-weight, positioned on-par with Claude Sonnet 4 on several coding benchmarks
RELEASE
JUL · 2025
GLM-4.5 released as open weights — runs on eight Nvidia H20 chips
RELEASE
JAN · 2025
Added to the US Entity List by the Commerce Department over national-security concerns
CORPORATE
2019
Zhipu AI spun out of Tsinghua University (Tang Jie, Li Juanzi); GLM series follows
CORPORATE
§ 04 / ACCESS

Where to get it.

Methodology →
§ 04 / BEST FOR

Which Zhipu (Z.ai / GLM) for what.

More scenarios →
If you need the current frontier flagship
GLM-5.1
Profile →
If you're doing coding / agentic work (Claude Code, Cline)…
GLM-4.7
Profile →
If you need the hardest reasoning in the 4.5 line…
GLM-4.5 X
Profile →
If you want a cheap budget tier
GLM-4.5 Air
Profile →
If you're prototyping for free
GLM-4.7 Flash
Profile →
If you must self-host open weights
GLM-4 32B
Profile →
If you need China data residency
Zhipu BigModel
BigModel →
§ 06 / BACKGROUND

The company behind it.

z.ai →

Zhipu AI (智谱) was spun out of Tsinghua University in 2019 by professors Tang Jie and Li Juanzi, and is led by CEO Zhang Peng. It develops the GLM (General Language Model) series with Tsinghua's KEG lab, and operates internationally under the Z.ai brand. It is one of China's "Six Little Tigers" of AI.

Zhipu's lineup is unusually broad and price-aggressive: the current GLM-5.1 flagship and GLM-5 base, the coding-focused GLM-4.7 (compatible with Claude Code, Cline, Roo Code), a premium GLM-4.5 X tier, and — distinctively — two free models (GLM-4.7 Flash and GLM-4.5 Flash) plus an ultra-cheap GLM-4.7 FlashX at $0.07/M. Several models, including GLM-4.5 and GLM-4.6, are released as open weights.

On funding, Zhipu raised from Alibaba, Tencent, Ant Group, and Saudi Arabia's Prosperity7, with a valuation pushed above $20 billion by 2025. It was the first of the "Six Tigers" to complete an IPO, listing in early 2026.

Two facts must be stated plainly. First, in January 2025 the US Commerce Department added Zhipu to the Entity List over national-security concerns — which restricts its access to US technology and matters for some Western buyers' procurement. Second, in February 2026 Zhipu said it had trained a major model on Huawei chips, a response to those very chip restrictions.

The trade-offs: GLM models are text/code-focused (no native vision in this lineup), the direct China platform stores data under Chinese rules, and the Entity-List status is a real procurement consideration. But Zhipu's open weights and free tiers let you sidestep the hosted API entirely. Versus DeepSeek and Moonshot, Zhipu competes on coding strength, free access, and breadth.

§ 07 / COMPETITORS

Other frontier labs.

All providers →

Frequently asked.

Practical questions about GLM pricing, free tiers, open weights, and sanctions.

Q · 01 Which GLM model should I start with? +
For coding/agentic work: GLM-4.7 ($0.60/$2.20) — it plugs into Claude Code, Cline, and similar tools. For the frontier: GLM-5.1 ($1.4/$4.4). Prototyping for free? GLM-4.7 Flash is $0/M. See the use case picker above.
Q · 02 Are there really free models? +
Yes — GLM-4.7 Flash and GLM-4.5 Flash are free ($0/M) for registered users, rate-limited but with no per-token charge and up to 200K context. The cheapest paid tier is GLM-4.7 FlashX at $0.07/$0.40. Few labs offer free frontier-adjacent tiers like this.
Q · 03 Are GLM models open-weight? +
Several are. GLM-4.5, GLM-4.6, and the GLM-4 32B snapshot are published as open weights on Hugging Face — self-hostable and a clean way to avoid routing data through the hosted API. The newest GLM-5 tiers are served via the Z.ai / BigModel APIs.
Q · 04 Is Zhipu under US sanctions? +
Yes. In January 2025 the US Commerce Department added Zhipu to the Entity List over national-security concerns, restricting its access to US technology. For Western buyers this is a real procurement consideration. Zhipu has since said it trained a major model on Huawei chips in response to chip restrictions.
Q · 05 Where is my data stored? +
It depends on the endpoint. The Z.ai international API and the China-domestic BigModel (open.bigmodel.cn) platform store data under their respective jurisdictions — BigModel keeps data in mainland China. For full control, self-host the open GLM weights.
Q · 06 Why is GLM popular for coding? +
Zhipu tuned GLM-4.6/4.7 for agentic coding and made them drop-in compatible with Claude Code, Cline, Roo Code, and Kilo Code. Positioned as Claude-Sonnet-class on several coding benchmarks at $0.60/$2.20, GLM-4.7 is a popular low-cost backend for coding agents.