Pricing / Compare / GLM-5.2 vs Qwen3-Max

GLM-5.2 vs Qwen3-Max

Side-by-side API pricing comparison · Z.AI vs Alibaba

🏆 GLM-5.2 is 3.4% cheaper on blended cost ($2.90 vs $3.00/Mtok)

GLM-5.2

by Z.AI

Current flagship Open weights

Input

$1.40/Mtok

Output

$4.40/Mtok

✓ Cheaper

Blended avg	$2.90/Mtok
Cached input	$0.260/Mtok
Context	128K tokens
Modality	text
Parameters	Proprietary
Released	Jan 1, 2026

Full details →

Qwen3-Max

by Alibaba

Current flagship

Input

$1.20/Mtok

Output

$4.80/Mtok

Blended avg	$3.00/Mtok
Context	262K tokens
Modality	text
Parameters	Proprietary
Released	Jan 1, 2026

Full details →

Cost at scale — 1M tokens (50/50 input/output)

Volume	GLM-5.2	Qwen3-Max	Savings
1M tokens	$2.9	$3	$0.1 (3.3%)
10M tokens	$29	$30	$1 (3.3%)
100M tokens	$290	$300	$10 (3.3%)
1000M tokens	$2900	$3000	$100 (3.3%)

Summary

GLM-5.2 by Z.AI costs $1.40/Mtok input and $4.40/Mtok output, with a 128K-token context window. It supports text input.

Qwen3-Max by Alibaba costs $1.20/Mtok input and $4.80/Mtok output, with a 262K-token context window. It supports text input.

On a blended cost basis, GLM-5.2 is 3.4% cheaper than Qwen3-Max.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.

GLM-5.2 vs Qwen3-Max

GLM-5.2

Qwen3-Max

Cost at scale — 1M tokens (50/50 input/output)

Summary

More comparisons