LIVE Cheapest: GLM-4.7-Flash $0.000/Mtok in 153 models tracked Updated Jun 25, 2026
Jun 25, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / GLM-OCR vs Granite 4.0 Micro

GLM-OCR vs Granite 4.0 Micro

Side-by-side API pricing comparison · Z.AI vs IBM

🏆 GLM-OCR is 115% cheaper on blended cost ($0.030 vs $0.065/Mtok)

GLM-OCR

by Z.AI

Current budget Open weights
Input
$0.030/Mtok
Output
$0.030/Mtok
✓ Cheaper
Blended avg$0.030/Mtok
Context128K tokens
Modalitytext, image
ParametersProprietary
ReleasedJun 1, 2025
Full details →

Granite 4.0 Micro

by IBM

Current budget Open weights
Input
$0.017/Mtok
Output
$0.112/Mtok
Blended avg$0.065/Mtok
Context128K tokens
Modalitytext
ParametersProprietary
ReleasedOct 19, 2025
Full details →

Cost at scale — 1M tokens (50/50 input/output)

VolumeGLM-OCRGranite 4.0 MicroSavings
1M tokens $0.03 $0.06 $0.03 (46.5%)
10M tokens $0.3 $0.65 $0.35 (54.3%)
100M tokens $3 $6.45 $3.45 (53.5%)
1000M tokens $30 $64.5 $34.5 (53.5%)

Summary

GLM-OCR by Z.AI costs $0.030/Mtok input and $0.030/Mtok output, with a 128K-token context window. It supports text, image input.

Granite 4.0 Micro by IBM costs $0.017/Mtok input and $0.112/Mtok output, with a 128K-token context window. It supports text input.

On a blended cost basis, GLM-OCR is 115% cheaper than Granite 4.0 Micro.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.