Pricing / Compare / GLM-4-32B-0414 vs GLM-4.7-FlashX

GLM-4-32B-0414 vs GLM-4.7-FlashX

Side-by-side API pricing comparison · Z.AI vs Z.AI

🏆 GLM-4-32B-0414 is 135% cheaper on blended cost ($0.100 vs $0.235/Mtok)

GLM-4-32B-0414

by Z.AI

Current budget Open weights

Input

$0.100/Mtok

Output

$0.100/Mtok

✓ Cheaper

Blended avg	$0.100/Mtok
Context	128K tokens
Modality	text
Parameters	32B
Released	Apr 1, 2025

Full details →

GLM-4.7-FlashX

by Z.AI

Current budget Open weights

Input

$0.070/Mtok

Output

$0.400/Mtok

Blended avg	$0.235/Mtok
Cached input	$0.010/Mtok
Context	128K tokens
Modality	text
Parameters	Proprietary
Released	Jun 1, 2025

Full details →

Cost at scale — 1M tokens (50/50 input/output)

Volume	GLM-4-32B-0414	GLM-4.7-FlashX	Savings
1M tokens	$0.1	$0.24	$0.14 (59.6%)
10M tokens	$1	$2.35	$1.35 (57.4%)
100M tokens	$10	$23.5	$13.5 (57.4%)
1000M tokens	$100	$235	$135 (57.4%)

Summary

GLM-4-32B-0414 by Z.AI costs $0.100/Mtok input and $0.100/Mtok output, with a 128K-token context window. It supports text input.

GLM-4.7-FlashX by Z.AI costs $0.070/Mtok input and $0.400/Mtok output, with a 128K-token context window. It supports text input.

On a blended cost basis, GLM-4-32B-0414 is 135% cheaper than GLM-4.7-FlashX.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.

GLM-4-32B-0414 vs GLM-4.7-FlashX

GLM-4-32B-0414

GLM-4.7-FlashX

Cost at scale — 1M tokens (50/50 input/output)

Summary

More comparisons