GLM-4-32B-0414 vs GLM-4.7-FlashX
Side-by-side API pricing comparison · Z.AI vs Z.AI
🏆
GLM-4-32B-0414 is 135% cheaper on blended cost ($0.100 vs $0.235/Mtok)
GLM-4-32B-0414
by Z.AI
Current budget Open weightsInput
$0.100/Mtok
Output
$0.100/Mtok
✓ Cheaper
| Blended avg | $0.100/Mtok |
|---|---|
| Context | 128K tokens |
| Modality | text |
| Parameters | 32B |
| Released | Apr 1, 2025 |
GLM-4.7-FlashX
by Z.AI
Current budget Open weightsInput
$0.070/Mtok
Output
$0.400/Mtok
| Blended avg | $0.235/Mtok |
|---|---|
| Cached input | $0.010/Mtok |
| Context | 128K tokens |
| Modality | text |
| Parameters | Proprietary |
| Released | Jun 1, 2025 |
Cost at scale — 1M tokens (50/50 input/output)
| Volume | GLM-4-32B-0414 | GLM-4.7-FlashX | Savings |
|---|---|---|---|
| 1M tokens | $0.1 | $0.24 | $0.14 (59.6%) |
| 10M tokens | $1 | $2.35 | $1.35 (57.4%) |
| 100M tokens | $10 | $23.5 | $13.5 (57.4%) |
| 1000M tokens | $100 | $235 | $135 (57.4%) |
Sponsored
Get API key →
Try GLM-4-32B-0414 on Z.AI
Get started with Z.AI's API — GLM-4-32B-0414 costs $0.100/Mtok blended.
Summary
GLM-4-32B-0414 by Z.AI costs $0.100/Mtok input and $0.100/Mtok output, with a 128K-token context window. It supports text input.
GLM-4.7-FlashX by Z.AI costs $0.070/Mtok input and $0.400/Mtok output, with a 128K-token context window. It supports text input.
On a blended cost basis, GLM-4-32B-0414 is 135% cheaper than GLM-4.7-FlashX.
Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.