Gemini 3.5 Flash vs Command A
Side-by-side comparison of API pricing, specs, benchmarks, and capabilities
|
Gby Google
|
Cby Cohere
|
|
|---|---|---|
| Overview | ||
| Status | Current flagship | Current flagship |
| Released | May 20, 2026 | Mar 13, 2025 |
| Pricing per million tokens | ||
| Input | $1.50/Mtok | $2.50/Mtok |
| Output | $9.00/Mtok | $10.00/Mtok |
| Blended avg | $5.25/Mtok | $6.25/Mtok |
| Specifications | ||
| Context window | 1M tokens | 256K tokens |
| Parameters | Proprietary | 111B |
| Speed (TPS) | — | — |
| Modalities | ||
| Input |
textimageaudiovideo
|
text
|
| Benchmarks sources: Google model card / Cohere model card | ||
| Avg benchmark score | ||
| Perf / dollar | ||
| GPQA Diamond | 72 | 50 |
| SWE-Bench | 55 | 25 |
| HumanEval | 88 | 78 |
| MATH 500 | 82 | 62 |
| Providers | ||
| Available from |
Google — $1.50/$9.00/Mtok
|
Cohere — $2.50/$10.00/Mtok
|
Cost at scale — 1M tokens (50/50 input/output)
| Volume | Gemini 3.5 Flash | Command A | Savings |
|---|---|---|---|
| 1M tokens | $5.25 | $6.25 | $1 (16%) |
| 10M tokens | $52.5 | $62.5 | $10 (16%) |
| 100M tokens | $525 | $625 | $100 (16%) |
| 1000M tokens | $5250 | $6250 | $1000 (16%) |
Summary
Gemini 3.5 Flash by Google costs $1.50/Mtok input and $9.00/Mtok output, with a 1M-token context window. It supports text, image, audio, video input.
Command A by Cohere costs $2.50/Mtok input and $10.00/Mtok output, with a 256K-token context window. It supports text input.
On a blended cost basis, Gemini 3.5 Flash is 16% cheaper than Command A. It also has a larger context window.
On benchmarks, Command A scores higher ( vs ) on average. In terms of value, Command A has better performance per dollar ( vs ).
Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.