Granite 4 H Small vs Granite 4 H Medium
Side-by-side API pricing comparison · IBM vs IBM
🏆
Granite 4 H Small is 141.9% cheaper on blended cost ($0.155 vs $0.375/Mtok)
Granite 4 H Small
by IBM
Current budget Open weightsInput
$0.060/Mtok
Output
$0.250/Mtok
✓ Cheaper
| Blended avg | $0.155/Mtok |
|---|---|
| Context | 128K tokens |
| Modality | text |
| Parameters | Proprietary |
| Released | Oct 19, 2025 |
Granite 4 H Medium
by IBM
Current mid tier Open weightsInput
$0.150/Mtok
Output
$0.600/Mtok
| Blended avg | $0.375/Mtok |
|---|---|
| Context | 128K tokens |
| Modality | text |
| Parameters | Proprietary |
| Released | Oct 19, 2025 |
Cost at scale — 1M tokens (50/50 input/output)
| Volume | Granite 4 H Small | Granite 4 H Medium | Savings |
|---|---|---|---|
| 1M tokens | $0.16 | $0.38 | $0.22 (58.7%) |
| 10M tokens | $1.55 | $3.75 | $2.2 (58.7%) |
| 100M tokens | $15.5 | $37.5 | $22 (58.7%) |
| 1000M tokens | $155 | $375 | $220 (58.7%) |
Summary
Granite 4 H Small by IBM costs $0.060/Mtok input and $0.250/Mtok output, with a 128K-token context window. It supports text input.
Granite 4 H Medium by IBM costs $0.150/Mtok input and $0.600/Mtok output, with a 128K-token context window. It supports text input.
On a blended cost basis, Granite 4 H Small is 141.9% cheaper than Granite 4 H Medium.
Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.