Granite 4 H Medium vs Gemini 3.1 Flash
Side-by-side API pricing comparison · IBM vs Google
🏆
Granite 4 H Medium is 273.3% cheaper on blended cost ($0.375 vs $1.40/Mtok)
Granite 4 H Medium
by IBM
Current mid tier Open weightsInput
$0.150/Mtok
Output
$0.600/Mtok
✓ Cheaper
| Blended avg | $0.375/Mtok |
|---|---|
| Context | 128K tokens |
| Modality | text |
| Parameters | Proprietary |
| Released | Oct 19, 2025 |
Gemini 3.1 Flash
by Google
Current mid tierInput
$0.300/Mtok
Output
$2.50/Mtok
| Blended avg | $1.40/Mtok |
|---|---|
| Context | 1M tokens |
| Modality | text, image, audio, video |
| Parameters | Proprietary |
| Released | Nov 1, 2025 |
Cost at scale — 1M tokens (50/50 input/output)
| Volume | Granite 4 H Medium | Gemini 3.1 Flash | Savings |
|---|---|---|---|
| 1M tokens | $0.38 | $1.4 | $1.02 (72.9%) |
| 10M tokens | $3.75 | $14 | $10.25 (73.2%) |
| 100M tokens | $37.5 | $140 | $102.5 (73.2%) |
| 1000M tokens | $375 | $1400 | $1025 (73.2%) |
Summary
Granite 4 H Medium by IBM costs $0.150/Mtok input and $0.600/Mtok output, with a 128K-token context window. It supports text input.
Gemini 3.1 Flash by Google costs $0.300/Mtok input and $2.50/Mtok output, with a 1M-token context window. It supports text, image, audio, video input.
On a blended cost basis, Granite 4 H Medium is 273.3% cheaper than Gemini 3.1 Flash.
Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.