LIVE Cheapest: GLM-4.7-Flash $0/Mtok in 154 models tracked Updated Jul 1, 2026
Jul 1, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / o4-mini vs Gemini 2.5 Pro

o4-mini vs Gemini 2.5 Pro

Side-by-side comparison of API pricing, specs, benchmarks, and capabilities

🏆 o4-mini is 104.5% cheaper on blended cost ($2.75 vs $5.63/Mtok)
 
Overview
Status Current reasoning Current mid tier
Released Apr 17, 2025 Mar 1, 2025
Pricing per million tokens
Input $1.10/Mtok $1.25/Mtok
Output $4.40/Mtok $10.00/Mtok
Blended avg $2.75/Mtok $5.63/Mtok
Cached input $0.550/Mtok
Specifications
Context window 200K tokens 2M tokens
Parameters Proprietary Proprietary
Speed (TPS)
Modalities
Input
textimage
textimageaudiovideo
Benchmarks from Vellum LLM Leaderboard
Avg benchmark score 73.7 62.8
Perf / dollar 26.8 11.2
GPQA Diamond 82 68
SWE-Bench 68 50
HumanEval 92 87
MATH 500 90 80
Providers
Available from
OpenAI — $1.10/$4.40/Mtok
Google — $1.25/$10.00/Mtok

Cost at scale — 1M tokens (50/50 input/output)

Volumeo4-miniGemini 2.5 ProSavings
1M tokens $2.75 $5.63 $2.88 (51.2%)
10M tokens $27.5 $56.25 $28.75 (51.1%)
100M tokens $275 $562.5 $287.5 (51.1%)
1000M tokens $2750 $5625 $2875 (51.1%)

Summary

o4-mini by OpenAI costs $1.10/Mtok input and $4.40/Mtok output, with a 200K-token context window. It supports text, image input.

Gemini 2.5 Pro by Google costs $1.25/Mtok input and $10.00/Mtok output, with a 2M-token context window. It supports text, image, audio, video input.

On a blended cost basis, o4-mini is 104.5% cheaper than Gemini 2.5 Pro.

On benchmarks, o4-mini scores higher (73.7 vs 62.8) on average. In terms of value, o4-mini has better performance per dollar (26.8 vs 11.2).

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.