LIVE Cheapest paid: Granite 4.0 Micro $0.017/Mtok in 159 models tracked Updated Jul 4, 2026
Jul 4, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / GLM-5.2 vs Command A

GLM-5.2 vs Command A

Side-by-side comparison of API pricing, specs, benchmarks, and capabilities

GLM-5.2 is 53.6% cheaper on blended cost ($2.90 vs $6.25/Mtok)
 
Zby Z.AI
2 providers: Together $1.40/$4.40 Z.AI $1.40/$4.40
Cby Cohere
Overview
StatusCurrent flagship Open weights Current flagship
Released Jun 13, 2026 Mar 13, 2025
Pricing per million tokens
Input $1.40/Mtok $2.50/Mtok
Output $4.40/Mtok $10.00/Mtok
Blended avg $2.90/Mtok $6.25/Mtok
Cached input $0.260/Mtok
Specifications
Context window 1M tokens 256K tokens
Parameters Proprietary 111B
Speed (TPS)
Modalities
Input
text
text
Benchmarks sources: Z.AI model card / Cohere model card
Avg benchmark score
Perf / dollar
GPQA Diamond 68 50
SWE-Bench 45 25
HumanEval 84 78
MATH 500 76 62
Providers
Available from
Together — $1.40/$4.40/Mtok
Z.AI — $1.40/$4.40/Mtok
Cohere — $2.50/$10.00/Mtok

Cost at scale — 1M tokens (50/50 input/output)

VolumeGLM-5.2Command ASavings
1M tokens $2.9 $6.25 $3.35 (53.6%)
10M tokens $29 $62.5 $33.5 (53.6%)
100M tokens $290 $625 $335 (53.6%)
1000M tokens $2900 $6250 $3350 (53.6%)

Summary

GLM-5.2 by Z.AI costs $1.40/Mtok input and $4.40/Mtok output, with a 1M-token context window. It supports text input and is available from 2 providers.

Command A by Cohere costs $2.50/Mtok input and $10.00/Mtok output, with a 256K-token context window. It supports text input.

On a blended cost basis, GLM-5.2 is 53.6% cheaper than Command A. It also has a larger context window.

On benchmarks, Command A scores higher ( vs ) on average. In terms of value, Command A has better performance per dollar ( vs ).

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.