Pricing / Compare / Granite 4 H Large vs Gemini 3.1 Flash

Granite 4 H Large vs Gemini 3.1 Flash

Side-by-side API pricing comparison · IBM vs Google

🏆 Granite 4 H Large is 86.7% cheaper on blended cost ($0.750 vs $1.40/Mtok)

Granite 4 H Large

by IBM

Current mid tier Open weights

Input

$0.300/Mtok

Output

$1.20/Mtok

✓ Cheaper

Blended avg	$0.750/Mtok
Context	128K tokens
Modality	text
Parameters	Proprietary
Released	Oct 19, 2025

Full details →

Gemini 3.1 Flash

by Google

Current mid tier

Input

$0.300/Mtok

Output

$2.50/Mtok

Blended avg	$1.40/Mtok
Context	1M tokens
Modality	text, image, audio, video
Parameters	Proprietary
Released	Nov 1, 2025

Full details →

Cost at scale — 1M tokens (50/50 input/output)

Volume	Granite 4 H Large	Gemini 3.1 Flash	Savings
1M tokens	$0.75	$1.4	$0.65 (46.4%)
10M tokens	$7.5	$14	$6.5 (46.4%)
100M tokens	$75	$140	$65 (46.4%)
1000M tokens	$750	$1400	$650 (46.4%)

Summary

Granite 4 H Large by IBM costs $0.300/Mtok input and $1.20/Mtok output, with a 128K-token context window. It supports text input.

Gemini 3.1 Flash by Google costs $0.300/Mtok input and $2.50/Mtok output, with a 1M-token context window. It supports text, image, audio, video input.

On a blended cost basis, Granite 4 H Large is 86.7% cheaper than Gemini 3.1 Flash.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.

Granite 4 H Large vs Gemini 3.1 Flash

Granite 4 H Large

Gemini 3.1 Flash

Cost at scale — 1M tokens (50/50 input/output)

Summary

More comparisons