Which is cheaper: GPT-4.1 mini or Gemini 2.5 Flash?

Gemini 2.5 Flash is cheaper at $0.188/Mtok blended, compared to GPT-4.1 mini at $1.00/Mtok blended — a 433.3% difference.

What is the context window difference between GPT-4.1 mini and Gemini 2.5 Flash?

GPT-4.1 mini has a 1M-token context window, while Gemini 2.5 Flash has 1M tokens.

Pricing / Compare / GPT-4.1 mini vs Gemini 2.5 Flash

GPT-4.1 mini vs Gemini 2.5 Flash

Side-by-side comparison of API pricing, specs, benchmarks, and capabilities

🏆 Gemini 2.5 Flash is 433.3% cheaper on blended cost ($0.188 vs $1.00/Mtok)

🔔 Email me when GPT-4.1 mini gets cheaper

🔔 Email me when Gemini 2.5 Flash gets cheaper

	GPT-4.1 mini by OpenAI	Gemini 2.5 Flash by Google
Overview
Status	Current budget	Current budget
Released	Apr 14, 2025	Jun 17, 2025
Pricing per million tokens
Input	$0.400/Mtok	$0.075/Mtok
Output	$1.60/Mtok	$0.300/Mtok
Blended avg	$1.00/Mtok	$0.188/Mtok
Cached input	$0.200/Mtok	—
Specifications
Context window	1M tokens	1M tokens
Parameters	Proprietary	Proprietary
Speed (TPS)	—	—
Modalities
Input	textimage	textimageaudiovideo
Benchmarks from Vellum LLM Leaderboard
Avg benchmark score	50.6	53.7
Perf / dollar	50.6	286.4
GPQA Diamond	55	55
SWE-Bench	30	38
HumanEval	85	82
MATH 500	70	72
Providers
Available from	OpenAI — $0.400/$1.60/Mtok	Google — $0.075/$0.300/Mtok

Cost at scale — 1M tokens (50/50 input/output)

Volume	GPT-4.1 mini	Gemini 2.5 Flash	Savings
1M tokens	$1	$0.19	$0.81 (81%)
10M tokens	$10	$1.88	$8.13 (81.3%)
100M tokens	$100	$18.75	$81.25 (81.3%)
1000M tokens	$1000	$187.5	$812.5 (81.3%)

Try Gemini 2.5 Flash on Google

The cheaper option here — Gemini 2.5 Flash costs $0.188/Mtok blended on Google.

Get API key →

Summary

GPT-4.1 mini by OpenAI costs $0.400/Mtok input and $1.60/Mtok output, with a 1M-token context window. It supports text, image input.

Gemini 2.5 Flash by Google costs $0.075/Mtok input and $0.300/Mtok output, with a 1M-token context window. It supports text, image, audio, video input.

On a blended cost basis, Gemini 2.5 Flash is 433.3% cheaper than GPT-4.1 mini.

On benchmarks, Gemini 2.5 Flash scores higher (53.7 vs 50.6) on average. In terms of value, Gemini 2.5 Flash has better performance per dollar (286.4 vs 50.6).

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.

GPT-4.1 mini vs Gemini 2.5 Flash

Cost at scale — 1M tokens (50/50 input/output)

Summary

More comparisons