LIVE Cheapest: GLM-4.7-Flash $0/Mtok in 154 models tracked Updated Jul 1, 2026
Jul 1, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / Gemini 3.1 Pro vs Gemini 2.5 Flash

Gemini 3.1 Pro vs Gemini 2.5 Flash

Side-by-side comparison of API pricing, specs, benchmarks, and capabilities

🏆 Gemini 2.5 Flash is 3633.3% cheaper on blended cost ($0.188 vs $7.00/Mtok)
 
Overview
Status Current flagship Current budget
Released Nov 1, 2025 Jun 17, 2025
Pricing per million tokens
Input $2.00/Mtok $0.075/Mtok
Output $12.00/Mtok $0.300/Mtok
Blended avg $7.00/Mtok $0.188/Mtok
Specifications
Context window 2M tokens 1M tokens
Parameters Proprietary Proprietary
Speed (TPS)
Modalities
Input
textimageaudiovideo
textimageaudiovideo
Benchmarks from Vellum LLM Leaderboard
Avg benchmark score 79.3 53.7
Perf / dollar 11.3 286.4
GPQA Diamond 88 55
SWE-Bench 75 38
HumanEval 93 82
MATH 500 95 72
Providers
Available from
Google — $2.00/$12.00/Mtok
Google — $0.075/$0.300/Mtok

Cost at scale — 1M tokens (50/50 input/output)

VolumeGemini 3.1 ProGemini 2.5 FlashSavings
1M tokens $7 $0.19 $6.81 (97.3%)
10M tokens $70 $1.88 $68.13 (97.3%)
100M tokens $700 $18.75 $681.25 (97.3%)
1000M tokens $7000 $187.5 $6812.5 (97.3%)

Summary

Gemini 3.1 Pro by Google costs $2.00/Mtok input and $12.00/Mtok output, with a 2M-token context window. It supports text, image, audio, video input.

Gemini 2.5 Flash by Google costs $0.075/Mtok input and $0.300/Mtok output, with a 1M-token context window. It supports text, image, audio, video input.

On a blended cost basis, Gemini 2.5 Flash is 3633.3% cheaper than Gemini 3.1 Pro.

On benchmarks, Gemini 3.1 Pro scores higher (79.3 vs 53.7) on average. In terms of value, Gemini 2.5 Flash has better performance per dollar (286.4 vs 11.3).

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.