LIVE Cheapest: GLM-4.7-Flash $0/Mtok in 154 models tracked Updated Jul 1, 2026
Jul 1, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / Claude Opus 4.8 vs Gemini 3.1 Pro

Claude Opus 4.8 vs Gemini 3.1 Pro

Side-by-side comparison of API pricing, specs, benchmarks, and capabilities

🏆 Gemini 3.1 Pro is 114.3% cheaper on blended cost ($7.00 vs $15.00/Mtok)
 
Overview
Status Current flagship Current flagship
Released May 28, 2026 Nov 1, 2025
Pricing per million tokens
Input $5.00/Mtok $2.00/Mtok
Output $25.00/Mtok $12.00/Mtok
Blended avg $15.00/Mtok $7.00/Mtok
Cached input $0.500/Mtok
Specifications
Context window 200K tokens 2M tokens
Parameters Proprietary Proprietary
Speed (TPS)
Modalities
Input
textimage
textimageaudiovideo
Benchmarks from Vellum LLM Leaderboard
Avg benchmark score 80 79.3
Perf / dollar 5.3 11.3
GPQA Diamond 93.6 88
SWE-Bench 88.6 75
HumanEval 93
MATH 500 95
Providers
Available from
Anthropic — $5.00/$25.00/Mtok
Google — $2.00/$12.00/Mtok

Cost at scale — 1M tokens (50/50 input/output)

VolumeClaude Opus 4.8Gemini 3.1 ProSavings
1M tokens $15 $7 $8 (53.3%)
10M tokens $150 $70 $80 (53.3%)
100M tokens $1500 $700 $800 (53.3%)
1000M tokens $15000 $7000 $8000 (53.3%)

Summary

Claude Opus 4.8 by Anthropic costs $5.00/Mtok input and $25.00/Mtok output, with a 200K-token context window. It supports text, image input.

Gemini 3.1 Pro by Google costs $2.00/Mtok input and $12.00/Mtok output, with a 2M-token context window. It supports text, image, audio, video input.

On a blended cost basis, Gemini 3.1 Pro is 114.3% cheaper than Claude Opus 4.8. It also has a larger context window.

On benchmarks, Claude Opus 4.8 scores higher (80 vs 79.3) on average. In terms of value, Gemini 3.1 Pro has better performance per dollar (11.3 vs 5.3).

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.