Granite 4.0 Micro vs Granite 4 H Small
Side-by-side comparison of API pricing, specs, benchmarks, and capabilities
🏆
Granite 4.0 Micro is 140.3% cheaper on blended cost ($0.065 vs $0.155/Mtok)
|
by IBM
|
by IBM
|
|
|---|---|---|
| Overview | ||
| Status | Current budget Open weights | Current budget Open weights |
| Released | Oct 19, 2025 | Oct 19, 2025 |
| Pricing per million tokens | ||
| Input | $0.017/Mtok | $0.060/Mtok |
| Output | $0.112/Mtok | $0.250/Mtok |
| Blended avg | $0.065/Mtok | $0.155/Mtok |
| Specifications | ||
| Context window | 128K tokens | 128K tokens |
| Parameters | Proprietary | Proprietary |
| Speed (TPS) | — | — |
| Modalities | ||
| Input |
text
|
text
|
| Providers | ||
| Available from |
IBM — $0.017/$0.112/Mtok
|
IBM — $0.060/$0.250/Mtok
|
Cost at scale — 1M tokens (50/50 input/output)
| Volume | Granite 4.0 Micro | Granite 4 H Small | Savings |
|---|---|---|---|
| 1M tokens | $0.06 | $0.16 | $0.09 (58.1%) |
| 10M tokens | $0.65 | $1.55 | $0.91 (58.7%) |
| 100M tokens | $6.45 | $15.5 | $9.05 (58.4%) |
| 1000M tokens | $64.5 | $155 | $90.5 (58.4%) |
Try Granite 4.0 Micro on IBM
The cheaper option here — Granite 4.0 Micro costs $0.065/Mtok blended on IBM.
Summary
Granite 4.0 Micro by IBM costs $0.017/Mtok input and $0.112/Mtok output, with a 128K-token context window. It supports text input.
Granite 4 H Small by IBM costs $0.060/Mtok input and $0.250/Mtok output, with a 128K-token context window. It supports text input.
On a blended cost basis, Granite 4.0 Micro is 140.3% cheaper than Granite 4 H Small.
Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.