LIVE Cheapest: GLM-4.7-Flash $0.000/Mtok in 153 models tracked Updated Jun 25, 2026
Jun 25, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / Granite 4.0 Micro vs Baichuan M2-32B

Granite 4.0 Micro vs Baichuan M2-32B

Side-by-side API pricing comparison · IBM vs Baichuan

🏆 Granite 4.0 Micro is 8.5% cheaper on blended cost ($0.065 vs $0.070/Mtok)

Granite 4.0 Micro

by IBM

Current budget Open weights
Input
$0.017/Mtok
Output
$0.112/Mtok
✓ Cheaper
Blended avg$0.065/Mtok
Context128K tokens
Modalitytext
ParametersProprietary
ReleasedOct 19, 2025
Full details →

Baichuan M2-32B

by Baichuan

Current budget Open weights
Input
$0.070/Mtok
Output
$0.070/Mtok
Blended avg$0.070/Mtok
Context33K tokens
Modalitytext
Parameters32B
ReleasedJun 1, 2025
Full details →

Cost at scale — 1M tokens (50/50 input/output)

VolumeGranite 4.0 MicroBaichuan M2-32BSavings
1M tokens $0.06 $0.07 $0.01 (14.3%)
10M tokens $0.65 $0.7 $0.06 (8.6%)
100M tokens $6.45 $7 $0.55 (7.9%)
1000M tokens $64.5 $70 $5.5 (7.9%)

Summary

Granite 4.0 Micro by IBM costs $0.017/Mtok input and $0.112/Mtok output, with a 128K-token context window. It supports text input.

Baichuan M2-32B by Baichuan costs $0.070/Mtok input and $0.070/Mtok output, with a 33K-token context window. It supports text input.

On a blended cost basis, Granite 4.0 Micro is 8.5% cheaper than Baichuan M2-32B. It also has a larger context window.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.