GLM-5.2 vs Qwen3-Max
Side-by-side API pricing comparison · Zhipu vs Alibaba
🏆
GLM-5.2 is 3.4% cheaper on blended cost ($2.90 vs $3.00/Mtok)
GLM-5.2
by Zhipu
Current flagship Open weightsInput
$1.40/Mtok
Output
$4.40/Mtok
✓ Cheaper
| Blended avg | $2.90/Mtok |
|---|---|
| Cached input | $0.260/Mtok |
| Context | 128K tokens |
| Modality | text |
| Parameters | Proprietary |
| Released | Jan 1, 2026 |
Qwen3-Max
by Alibaba
Current flagshipInput
$1.20/Mtok
Output
$4.80/Mtok
| Blended avg | $3.00/Mtok |
|---|---|
| Context | 262K tokens |
| Modality | text |
| Parameters | Proprietary |
| Released | Jan 1, 2026 |
Cost at scale — 1M tokens (50/50 input/output)
| Volume | GLM-5.2 | Qwen3-Max | Savings |
|---|---|---|---|
| 1M tokens | $2.9 | $3 | $0.1 (3.3%) |
| 10M tokens | $29 | $30 | $1 (3.3%) |
| 100M tokens | $290 | $300 | $10 (3.3%) |
| 1000M tokens | $2900 | $3000 | $100 (3.3%) |
Summary
GLM-5.2 by Zhipu costs $1.40/Mtok input and $4.40/Mtok output, with a 128K-token context window. It supports text input.
Qwen3-Max by Alibaba costs $1.20/Mtok input and $4.80/Mtok output, with a 262K-token context window. It supports text input.
On a blended cost basis, GLM-5.2 is 3.4% cheaper than Qwen3-Max.
Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.