LIVE Cheapest: GLM-4.7-Flash $0/Mtok in 154 models tracked Updated Jul 1, 2026
Jul 1, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / GLM-5.2 vs GPT-5.4

GLM-5.2 vs GPT-5.4

Side-by-side comparison of API pricing, specs, benchmarks, and capabilities

🏆 GLM-5.2 is 201.7% cheaper on blended cost ($2.90 vs $8.75/Mtok)
 
by Z.AI
2 providers: Together $1.40/$4.40 Z.AI $1.40/$4.40
Overview
Status Current flagship Open weights Current flagship
Released Jun 13, 2026 Aug 15, 2025
Pricing per million tokens
Input $1.40/Mtok $2.50/Mtok
Output $4.40/Mtok $15.00/Mtok
Blended avg $2.90/Mtok $8.75/Mtok
Cached input $0.260/Mtok $0.250/Mtok
Specifications
Context window 1M tokens 1M tokens
Parameters Proprietary Proprietary
Speed (TPS)
Modalities
Input
text
textimage
Benchmarks from Vellum LLM Leaderboard
Avg benchmark score 78.7
Perf / dollar 9
GPQA Diamond 68 88
SWE-Bench 45 75
HumanEval 84 94.5
MATH 500 76 92
Providers
Available from
Together — $1.40/$4.40/Mtok
Z.AI — $1.40/$4.40/Mtok
OpenAI — $2.50/$15.00/Mtok

Cost at scale — 1M tokens (50/50 input/output)

VolumeGLM-5.2GPT-5.4Savings
1M tokens $2.9 $8.75 $5.85 (66.9%)
10M tokens $29 $87.5 $58.5 (66.9%)
100M tokens $290 $875 $585 (66.9%)
1000M tokens $2900 $8750 $5850 (66.9%)

Summary

GLM-5.2 by Z.AI costs $1.40/Mtok input and $4.40/Mtok output, with a 1M-token context window. It supports text input and is available from 2 providers.

GPT-5.4 by OpenAI costs $2.50/Mtok input and $15.00/Mtok output, with a 1M-token context window. It supports text, image input.

On a blended cost basis, GLM-5.2 is 201.7% cheaper than GPT-5.4.

On benchmarks, GPT-5.4 scores higher (78.7 vs ) on average. In terms of value, GPT-5.4 has better performance per dollar (9 vs ).

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.