LIVE Cheapest: GLM-4.7-Flash $0/Mtok in 154 models tracked Updated Jul 1, 2026
Jul 1, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / Granite 4.0 Micro vs Granite 4 H Small

Granite 4.0 Micro vs Granite 4 H Small

Side-by-side comparison of API pricing, specs, benchmarks, and capabilities

🏆 Granite 4.0 Micro is 140.3% cheaper on blended cost ($0.065 vs $0.155/Mtok)
 
by IBM
by IBM
Overview
Status Current budget Open weights Current budget Open weights
Released Oct 19, 2025 Oct 19, 2025
Pricing per million tokens
Input $0.017/Mtok $0.060/Mtok
Output $0.112/Mtok $0.250/Mtok
Blended avg $0.065/Mtok $0.155/Mtok
Specifications
Context window 128K tokens 128K tokens
Parameters Proprietary Proprietary
Speed (TPS)
Modalities
Input
text
text
Providers
Available from
IBM — $0.017/$0.112/Mtok
IBM — $0.060/$0.250/Mtok

Cost at scale — 1M tokens (50/50 input/output)

VolumeGranite 4.0 MicroGranite 4 H SmallSavings
1M tokens $0.06 $0.16 $0.09 (58.1%)
10M tokens $0.65 $1.55 $0.91 (58.7%)
100M tokens $6.45 $15.5 $9.05 (58.4%)
1000M tokens $64.5 $155 $90.5 (58.4%)

Summary

Granite 4.0 Micro by IBM costs $0.017/Mtok input and $0.112/Mtok output, with a 128K-token context window. It supports text input.

Granite 4 H Small by IBM costs $0.060/Mtok input and $0.250/Mtok output, with a 128K-token context window. It supports text input.

On a blended cost basis, Granite 4.0 Micro is 140.3% cheaper than Granite 4 H Small.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.