Qwen3-Max vs Gemini 3.5 Flash
Side-by-side API pricing comparison · Alibaba vs Google
🏆
Qwen3-Max is 75% cheaper on blended cost ($3.00 vs $5.25/Mtok)
Qwen3-Max
by Alibaba
Current flagshipInput
$1.20/Mtok
Output
$4.80/Mtok
✓ Cheaper
| Blended avg | $3.00/Mtok |
|---|---|
| Context | 262K tokens |
| Modality | text |
| Parameters | Proprietary |
| Released | Jan 1, 2026 |
Gemini 3.5 Flash
by Google
Current flagshipInput
$1.50/Mtok
Output
$9.00/Mtok
| Blended avg | $5.25/Mtok |
|---|---|
| Context | 1M tokens |
| Modality | text, image, audio, video |
| Parameters | Proprietary |
| Released | Mar 1, 2026 |
Cost at scale — 1M tokens (50/50 input/output)
| Volume | Qwen3-Max | Gemini 3.5 Flash | Savings |
|---|---|---|---|
| 1M tokens | $3 | $5.25 | $2.25 (42.9%) |
| 10M tokens | $30 | $52.5 | $22.5 (42.9%) |
| 100M tokens | $300 | $525 | $225 (42.9%) |
| 1000M tokens | $3000 | $5250 | $2250 (42.9%) |
Summary
Qwen3-Max by Alibaba costs $1.20/Mtok input and $4.80/Mtok output, with a 262K-token context window. It supports text input.
Gemini 3.5 Flash by Google costs $1.50/Mtok input and $9.00/Mtok output, with a 1M-token context window. It supports text, image, audio, video input.
On a blended cost basis, Qwen3-Max is 75% cheaper than Gemini 3.5 Flash.
Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.