Pricing / Compare / GLM-4.7-Flash vs Granite 4.0 Micro

GLM-4.7-Flash vs Granite 4.0 Micro

Side-by-side API pricing comparison · Z.AI vs IBM

GLM-4.7-Flash

by Z.AI

Current budget Open weights

Input

$0.000/Mtok

Output

$0.000/Mtok

✓ Cheaper

Blended avg	$0.000/Mtok
Context	128K tokens
Modality	text
Parameters	Proprietary
Released	Jun 1, 2025

Full details →

Granite 4.0 Micro

by IBM

Current budget Open weights

Input

$0.017/Mtok

Output

$0.112/Mtok

Blended avg	$0.065/Mtok
Context	128K tokens
Modality	text
Parameters	Proprietary
Released	Oct 19, 2025

Full details →

Cost at scale — 1M tokens (50/50 input/output)

Volume	GLM-4.7-Flash	Granite 4.0 Micro	Savings
1M tokens	$0	$0.06	$0.06 (93%)
10M tokens	$0	$0.65	$0.65 (100.8%)
100M tokens	$0	$6.45	$6.45 (100%)
1000M tokens	$0	$64.5	$64.5 (100%)

Summary

GLM-4.7-Flash by Z.AI costs $0.000/Mtok input and $0.000/Mtok output, with a 128K-token context window. It supports text input.

Granite 4.0 Micro by IBM costs $0.017/Mtok input and $0.112/Mtok output, with a 128K-token context window. It supports text input.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.

GLM-4.7-Flash vs Granite 4.0 Micro

GLM-4.7-Flash

Granite 4.0 Micro

Cost at scale — 1M tokens (50/50 input/output)

Summary

More comparisons