Granite 4.0 Micro vs Granite Embedding 278M Multilingual
Side-by-side API pricing comparison · IBM vs IBM
🏆
Granite 4.0 Micro is 64.3% cheaper on blended cost ($0.065 vs $0.106/Mtok)
Granite 4.0 Micro
by IBM
Current budget Open weightsInput
$0.017/Mtok
Output
$0.112/Mtok
✓ Cheaper
| Blended avg | $0.065/Mtok |
|---|---|
| Context | 128K tokens |
| Modality | text |
| Parameters | Proprietary |
| Released | Oct 19, 2025 |
Granite Embedding 278M Multilingual
by IBM
Current embedding Open weightsInput
$0.106/Mtok
Output
$0.106/Mtok
| Blended avg | $0.106/Mtok |
|---|---|
| Context | — tokens |
| Modality | text |
| Parameters | 278M |
| Released | Jun 1, 2025 |
Cost at scale — 1M tokens (50/50 input/output)
| Volume | Granite 4.0 Micro | Granite Embedding 278M Multilingual | Savings |
|---|---|---|---|
| 1M tokens | $0.06 | $0.11 | $0.04 (37.7%) |
| 10M tokens | $0.65 | $1.06 | $0.42 (39.6%) |
| 100M tokens | $6.45 | $10.6 | $4.15 (39.2%) |
| 1000M tokens | $64.5 | $106 | $41.5 (39.2%) |
Summary
Granite 4.0 Micro by IBM costs $0.017/Mtok input and $0.112/Mtok output, with a 128K-token context window. It supports text input.
Granite Embedding 278M Multilingual by IBM costs $0.106/Mtok input and $0.106/Mtok output, with a —-token context window. It supports text input.
On a blended cost basis, Granite 4.0 Micro is 64.3% cheaper than Granite Embedding 278M Multilingual. It also has a larger context window.
Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.