GLM-4.7-Flash vs Granite 4.0 Micro
Side-by-side API pricing comparison · Zhipu vs IBM
GLM-4.7-Flash
by Zhipu
Current budget Open weightsInput
$0.000/Mtok
Output
$0.000/Mtok
✓ Cheaper
| Blended avg | $0.000/Mtok |
|---|---|
| Context | 128K tokens |
| Modality | text |
| Parameters | Proprietary |
| Released | Jun 1, 2025 |
Granite 4.0 Micro
by IBM
Current budget Open weightsInput
$0.017/Mtok
Output
$0.112/Mtok
| Blended avg | $0.065/Mtok |
|---|---|
| Context | 128K tokens |
| Modality | text |
| Parameters | Proprietary |
| Released | Oct 19, 2025 |
Cost at scale — 1M tokens (50/50 input/output)
| Volume | GLM-4.7-Flash | Granite 4.0 Micro | Savings |
|---|---|---|---|
| 1M tokens | $0 | $0.06 | $0.06 (93%) |
| 10M tokens | $0 | $0.65 | $0.65 (100.8%) |
| 100M tokens | $0 | $6.45 | $6.45 (100%) |
| 1000M tokens | $0 | $64.5 | $64.5 (100%) |
Summary
GLM-4.7-Flash by Zhipu costs $0.000/Mtok input and $0.000/Mtok output, with a 128K-token context window. It supports text input.
Granite 4.0 Micro by IBM costs $0.017/Mtok input and $0.112/Mtok output, with a 128K-token context window. It supports text input.
Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.