LIVE Cheapest: GLM-4.7-Flash $0.000/Mtok in 153 models tracked Updated Jun 25, 2026
Jun 25, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / GLM-4-32B-0414 vs GLM-4.7-FlashX

GLM-4-32B-0414 vs GLM-4.7-FlashX

Side-by-side API pricing comparison · Z.AI vs Z.AI

🏆 GLM-4-32B-0414 is 135% cheaper on blended cost ($0.100 vs $0.235/Mtok)

GLM-4-32B-0414

by Z.AI

Current budget Open weights
Input
$0.100/Mtok
Output
$0.100/Mtok
✓ Cheaper
Blended avg$0.100/Mtok
Context128K tokens
Modalitytext
Parameters32B
ReleasedApr 1, 2025
Full details →

GLM-4.7-FlashX

by Z.AI

Current budget Open weights
Input
$0.070/Mtok
Output
$0.400/Mtok
Blended avg$0.235/Mtok
Cached input$0.010/Mtok
Context128K tokens
Modalitytext
ParametersProprietary
ReleasedJun 1, 2025
Full details →

Cost at scale — 1M tokens (50/50 input/output)

VolumeGLM-4-32B-0414GLM-4.7-FlashXSavings
1M tokens $0.1 $0.24 $0.14 (59.6%)
10M tokens $1 $2.35 $1.35 (57.4%)
100M tokens $10 $23.5 $13.5 (57.4%)
1000M tokens $100 $235 $135 (57.4%)

Summary

GLM-4-32B-0414 by Z.AI costs $0.100/Mtok input and $0.100/Mtok output, with a 128K-token context window. It supports text input.

GLM-4.7-FlashX by Z.AI costs $0.070/Mtok input and $0.400/Mtok output, with a 128K-token context window. It supports text input.

On a blended cost basis, GLM-4-32B-0414 is 135% cheaper than GLM-4.7-FlashX.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.