Pricing / Compare / GLM-5.2 vs Gemini 3.5 Flash

GLM-5.2 vs Gemini 3.5 Flash

Side-by-side API pricing comparison · Zhipu vs Google

🏆 GLM-5.2 is 81% cheaper on blended cost ($2.90 vs $5.25/Mtok)

GLM-5.2

by Zhipu

Current flagship Open weights

Input

$1.40/Mtok

Output

$4.40/Mtok

✓ Cheaper

Blended avg	$2.90/Mtok
Cached input	$0.260/Mtok
Context	128K tokens
Modality	text
Parameters	Proprietary
Released	Jan 1, 2026

Full details →

Gemini 3.5 Flash

by Google

Current flagship

Input

$1.50/Mtok

Output

$9.00/Mtok

Blended avg	$5.25/Mtok
Context	1M tokens
Modality	text, image, audio, video
Parameters	Proprietary
Released	Mar 1, 2026

Full details →

Cost at scale — 1M tokens (50/50 input/output)

Volume	GLM-5.2	Gemini 3.5 Flash	Savings
1M tokens	$2.9	$5.25	$2.35 (44.8%)
10M tokens	$29	$52.5	$23.5 (44.8%)
100M tokens	$290	$525	$235 (44.8%)
1000M tokens	$2900	$5250	$2350 (44.8%)

Summary

GLM-5.2 by Zhipu costs $1.40/Mtok input and $4.40/Mtok output, with a 128K-token context window. It supports text input.

Gemini 3.5 Flash by Google costs $1.50/Mtok input and $9.00/Mtok output, with a 1M-token context window. It supports text, image, audio, video input.

On a blended cost basis, GLM-5.2 is 81% cheaper than Gemini 3.5 Flash.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.

GLM-5.2 vs Gemini 3.5 Flash

GLM-5.2

Gemini 3.5 Flash

Cost at scale — 1M tokens (50/50 input/output)

Summary

More comparisons