LIVE Cheapest paid: Granite 4.0 Micro $0.017/Mtok in 159 models tracked Updated Jul 4, 2026
Jul 4, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / Gemini 3.5 Flash vs Command A

Gemini 3.5 Flash vs Command A

Side-by-side comparison of API pricing, specs, benchmarks, and capabilities

Gemini 3.5 Flash is 16% cheaper on blended cost ($5.25 vs $6.25/Mtok)
 
Gby Google
Cby Cohere
Overview
StatusCurrent flagship Current flagship
Released May 20, 2026 Mar 13, 2025
Pricing per million tokens
Input $1.50/Mtok $2.50/Mtok
Output $9.00/Mtok $10.00/Mtok
Blended avg $5.25/Mtok $6.25/Mtok
Specifications
Context window 1M tokens 256K tokens
Parameters Proprietary 111B
Speed (TPS)
Modalities
Input
textimageaudiovideo
text
Benchmarks sources: Google model card / Cohere model card
Avg benchmark score
Perf / dollar
GPQA Diamond 72 50
SWE-Bench 55 25
HumanEval 88 78
MATH 500 82 62
Providers
Available from
Google — $1.50/$9.00/Mtok
Cohere — $2.50/$10.00/Mtok

Cost at scale — 1M tokens (50/50 input/output)

VolumeGemini 3.5 FlashCommand ASavings
1M tokens $5.25 $6.25 $1 (16%)
10M tokens $52.5 $62.5 $10 (16%)
100M tokens $525 $625 $100 (16%)
1000M tokens $5250 $6250 $1000 (16%)

Summary

Gemini 3.5 Flash by Google costs $1.50/Mtok input and $9.00/Mtok output, with a 1M-token context window. It supports text, image, audio, video input.

Command A by Cohere costs $2.50/Mtok input and $10.00/Mtok output, with a 256K-token context window. It supports text input.

On a blended cost basis, Gemini 3.5 Flash is 16% cheaper than Command A. It also has a larger context window.

On benchmarks, Command A scores higher ( vs ) on average. In terms of value, Command A has better performance per dollar ( vs ).

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.