LIVE Cheapest paid: Granite 4.0 Micro $0.017/Mtok in 159 models tracked Updated Jul 4, 2026
Jul 4, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / Qwen3-Max vs Command A

Qwen3-Max vs Command A

Side-by-side comparison of API pricing, specs, benchmarks, and capabilities

Qwen3-Max is 42.4% cheaper on blended cost ($3.60 vs $6.25/Mtok)
 
Cby Cohere
Overview
StatusCurrent flagship Current flagship
Released Jan 1, 2026 Mar 13, 2025
Pricing per million tokens
Input $1.20/Mtok $2.50/Mtok
Output $6.00/Mtok $10.00/Mtok
Blended avg $3.60/Mtok $6.25/Mtok
Specifications
Context window 262K tokens 256K tokens
Parameters Proprietary 111B
Speed (TPS)
Modalities
Input
text
text
Benchmarks sources: Alibaba model card / Cohere model card
Avg benchmark score
Perf / dollar
GPQA Diamond 70 50
SWE-Bench 48 25
HumanEval 86 78
MATH 500 78 62
Providers
Available from
Alibaba — $1.20/$6.00/Mtok
Cohere — $2.50/$10.00/Mtok

Cost at scale — 1M tokens (50/50 input/output)

VolumeQwen3-MaxCommand ASavings
1M tokens $3.6 $6.25 $2.65 (42.4%)
10M tokens $36 $62.5 $26.5 (42.4%)
100M tokens $360 $625 $265 (42.4%)
1000M tokens $3600 $6250 $2650 (42.4%)

Summary

Qwen3-Max by Alibaba costs $1.20/Mtok input and $6.00/Mtok output, with a 262K-token context window. It supports text input.

Command A by Cohere costs $2.50/Mtok input and $10.00/Mtok output, with a 256K-token context window. It supports text input.

On a blended cost basis, Qwen3-Max is 42.4% cheaper than Command A. It also has a larger context window.

On benchmarks, Command A scores higher ( vs ) on average. In terms of value, Command A has better performance per dollar ( vs ).

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.