LIVE Cheapest: GLM-4.7-Flash $0/Mtok in 154 models tracked Updated Jul 1, 2026
Jul 1, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / Claude Sonnet 5 vs Gemini 3.5 Flash

Claude Sonnet 5 vs Gemini 3.5 Flash

Side-by-side comparison of API pricing, specs, benchmarks, and capabilities

🏆 Gemini 3.5 Flash is 14.3% cheaper on blended cost ($5.25 vs $6.00/Mtok)
 
Overview
Status Current mid tier Current flagship
Released Jun 30, 2026 May 20, 2026
Pricing per million tokens
Input $2.00/Mtok $1.50/Mtok
Output $10.00/Mtok $9.00/Mtok
Blended avg $6.00/Mtok $5.25/Mtok
Cached input $0.200/Mtok
Specifications
Context window 1M tokens 1M tokens
Parameters Proprietary Proprietary
Speed (TPS)
Modalities
Input
textimage
textimageaudiovideo
Benchmarks from Vellum LLM Leaderboard
Avg benchmark score
Perf / dollar
GPQA Diamond 72
SWE-Bench 55
HumanEval 88
MATH 500 82
Providers
Available from
Anthropic — $2.00/$10.00/Mtok
Google — $1.50/$9.00/Mtok

Cost at scale — 1M tokens (50/50 input/output)

VolumeClaude Sonnet 5Gemini 3.5 FlashSavings
1M tokens $6 $5.25 $0.75 (12.5%)
10M tokens $60 $52.5 $7.5 (12.5%)
100M tokens $600 $525 $75 (12.5%)
1000M tokens $6000 $5250 $750 (12.5%)

Summary

Claude Sonnet 5 by Anthropic costs $2.00/Mtok input and $10.00/Mtok output, with a 1M-token context window. It supports text, image input.

Gemini 3.5 Flash by Google costs $1.50/Mtok input and $9.00/Mtok output, with a 1M-token context window. It supports text, image, audio, video input.

On a blended cost basis, Gemini 3.5 Flash is 14.3% cheaper than Claude Sonnet 5.

On benchmarks, Gemini 3.5 Flash scores higher ( vs ) on average. In terms of value, Gemini 3.5 Flash has better performance per dollar ( vs ).

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.