LIVE Cheapest: GLM-4.7-Flash $0.000/Mtok in 153 models tracked Updated Jun 25, 2026
Jun 25, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / GLM-5.2 vs Qwen3-Max

GLM-5.2 vs Qwen3-Max

Side-by-side API pricing comparison · Z.AI vs Alibaba

🏆 GLM-5.2 is 3.4% cheaper on blended cost ($2.90 vs $3.00/Mtok)

GLM-5.2

by Z.AI

Current flagship Open weights
Input
$1.40/Mtok
Output
$4.40/Mtok
✓ Cheaper
Blended avg$2.90/Mtok
Cached input$0.260/Mtok
Context128K tokens
Modalitytext
ParametersProprietary
ReleasedJan 1, 2026
Full details →

Qwen3-Max

by Alibaba

Current flagship
Input
$1.20/Mtok
Output
$4.80/Mtok
Blended avg$3.00/Mtok
Context262K tokens
Modalitytext
ParametersProprietary
ReleasedJan 1, 2026
Full details →

Cost at scale — 1M tokens (50/50 input/output)

VolumeGLM-5.2Qwen3-MaxSavings
1M tokens $2.9 $3 $0.1 (3.3%)
10M tokens $29 $30 $1 (3.3%)
100M tokens $290 $300 $10 (3.3%)
1000M tokens $2900 $3000 $100 (3.3%)

Summary

GLM-5.2 by Z.AI costs $1.40/Mtok input and $4.40/Mtok output, with a 128K-token context window. It supports text input.

Qwen3-Max by Alibaba costs $1.20/Mtok input and $4.80/Mtok output, with a 262K-token context window. It supports text input.

On a blended cost basis, GLM-5.2 is 3.4% cheaper than Qwen3-Max.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.