LIVE Cheapest: GLM-4.7-Flash $0.000/Mtok in 153 models tracked Updated Jun 25, 2026
Jun 25, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / Granite 4 H Medium vs Gemini 3.1 Flash

Granite 4 H Medium vs Gemini 3.1 Flash

Side-by-side API pricing comparison · IBM vs Google

🏆 Granite 4 H Medium is 273.3% cheaper on blended cost ($0.375 vs $1.40/Mtok)

Granite 4 H Medium

by IBM

Current mid tier Open weights
Input
$0.150/Mtok
Output
$0.600/Mtok
✓ Cheaper
Blended avg$0.375/Mtok
Context128K tokens
Modalitytext
ParametersProprietary
ReleasedOct 19, 2025
Full details →

Gemini 3.1 Flash

by Google

Current mid tier
Input
$0.300/Mtok
Output
$2.50/Mtok
Blended avg$1.40/Mtok
Context1M tokens
Modalitytext, image, audio, video
ParametersProprietary
ReleasedNov 1, 2025
Full details →

Cost at scale — 1M tokens (50/50 input/output)

VolumeGranite 4 H MediumGemini 3.1 FlashSavings
1M tokens $0.38 $1.4 $1.02 (72.9%)
10M tokens $3.75 $14 $10.25 (73.2%)
100M tokens $37.5 $140 $102.5 (73.2%)
1000M tokens $375 $1400 $1025 (73.2%)

Summary

Granite 4 H Medium by IBM costs $0.150/Mtok input and $0.600/Mtok output, with a 128K-token context window. It supports text input.

Gemini 3.1 Flash by Google costs $0.300/Mtok input and $2.50/Mtok output, with a 1M-token context window. It supports text, image, audio, video input.

On a blended cost basis, Granite 4 H Medium is 273.3% cheaper than Gemini 3.1 Flash.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.