LIVE Cheapest: GLM-4.7-Flash $0.000/Mtok in 153 models tracked Updated Jun 25, 2026
Jun 25, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / Granite 4 H Large vs Qwen-Plus

Granite 4 H Large vs Qwen-Plus

Side-by-side API pricing comparison · IBM vs Alibaba

🏆 Granite 4 H Large is 6.7% cheaper on blended cost ($0.750 vs $0.800/Mtok)

Granite 4 H Large

by IBM

Current mid tier Open weights
Input
$0.300/Mtok
Output
$1.20/Mtok
✓ Cheaper
Blended avg$0.750/Mtok
Context128K tokens
Modalitytext
ParametersProprietary
ReleasedOct 19, 2025
Full details →

Qwen-Plus

by Alibaba

Current mid tier
Input
$0.400/Mtok
Output
$1.20/Mtok
Blended avg$0.800/Mtok
Context131K tokens
Modalitytext
ParametersProprietary
ReleasedOct 1, 2025
Full details →

Cost at scale — 1M tokens (50/50 input/output)

VolumeGranite 4 H LargeQwen-PlusSavings
1M tokens $0.75 $0.8 $0.05 (6.3%)
10M tokens $7.5 $8 $0.5 (6.3%)
100M tokens $75 $80 $5 (6.3%)
1000M tokens $750 $800 $50 (6.3%)

Summary

Granite 4 H Large by IBM costs $0.300/Mtok input and $1.20/Mtok output, with a 128K-token context window. It supports text input.

Qwen-Plus by Alibaba costs $0.400/Mtok input and $1.20/Mtok output, with a 131K-token context window. It supports text input.

On a blended cost basis, Granite 4 H Large is 6.7% cheaper than Qwen-Plus.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.