Pricing / Compare / Granite 4 H Medium vs Gemini 3.1 Flash

Granite 4 H Medium vs Gemini 3.1 Flash

Side-by-side API pricing comparison · IBM vs Google

🏆 Granite 4 H Medium is 273.3% cheaper on blended cost ($0.375 vs $1.40/Mtok)

Granite 4 H Medium

by IBM

Current mid tier Open weights

Input

$0.150/Mtok

Output

$0.600/Mtok

✓ Cheaper

Blended avg	$0.375/Mtok
Context	128K tokens
Modality	text
Parameters	Proprietary
Released	Oct 19, 2025

Full details →

Gemini 3.1 Flash

by Google

Current mid tier

Input

$0.300/Mtok

Output

$2.50/Mtok

Blended avg	$1.40/Mtok
Context	1M tokens
Modality	text, image, audio, video
Parameters	Proprietary
Released	Nov 1, 2025

Full details →

Cost at scale — 1M tokens (50/50 input/output)

Volume	Granite 4 H Medium	Gemini 3.1 Flash	Savings
1M tokens	$0.38	$1.4	$1.02 (72.9%)
10M tokens	$3.75	$14	$10.25 (73.2%)
100M tokens	$37.5	$140	$102.5 (73.2%)
1000M tokens	$375	$1400	$1025 (73.2%)

Summary

Granite 4 H Medium by IBM costs $0.150/Mtok input and $0.600/Mtok output, with a 128K-token context window. It supports text input.

Gemini 3.1 Flash by Google costs $0.300/Mtok input and $2.50/Mtok output, with a 1M-token context window. It supports text, image, audio, video input.

On a blended cost basis, Granite 4 H Medium is 273.3% cheaper than Gemini 3.1 Flash.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.

Granite 4 H Medium vs Gemini 3.1 Flash

Granite 4 H Medium

Gemini 3.1 Flash

Cost at scale — 1M tokens (50/50 input/output)

Summary

More comparisons