GLM-5.2 vs Gemini 3.5 Flash
Side-by-side API pricing comparison · Zhipu vs Google
🏆
GLM-5.2 is 81% cheaper on blended cost ($2.90 vs $5.25/Mtok)
GLM-5.2
by Zhipu
Current flagship Open weightsInput
$1.40/Mtok
Output
$4.40/Mtok
✓ Cheaper
| Blended avg | $2.90/Mtok |
|---|---|
| Cached input | $0.260/Mtok |
| Context | 128K tokens |
| Modality | text |
| Parameters | Proprietary |
| Released | Jan 1, 2026 |
Gemini 3.5 Flash
by Google
Current flagshipInput
$1.50/Mtok
Output
$9.00/Mtok
| Blended avg | $5.25/Mtok |
|---|---|
| Context | 1M tokens |
| Modality | text, image, audio, video |
| Parameters | Proprietary |
| Released | Mar 1, 2026 |
Cost at scale — 1M tokens (50/50 input/output)
| Volume | GLM-5.2 | Gemini 3.5 Flash | Savings |
|---|---|---|---|
| 1M tokens | $2.9 | $5.25 | $2.35 (44.8%) |
| 10M tokens | $29 | $52.5 | $23.5 (44.8%) |
| 100M tokens | $290 | $525 | $235 (44.8%) |
| 1000M tokens | $2900 | $5250 | $2350 (44.8%) |
Summary
GLM-5.2 by Zhipu costs $1.40/Mtok input and $4.40/Mtok output, with a 128K-token context window. It supports text input.
Gemini 3.5 Flash by Google costs $1.50/Mtok input and $9.00/Mtok output, with a 1M-token context window. It supports text, image, audio, video input.
On a blended cost basis, GLM-5.2 is 81% cheaper than Gemini 3.5 Flash.
Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.