LIVE Cheapest: GLM-4.7-Flash $0/Mtok in 154 models tracked Updated Jul 1, 2026
Jul 1, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / o4-mini vs Grok 4

o4-mini vs Grok 4

Side-by-side comparison of API pricing, specs, benchmarks, and capabilities

🏆 o4-mini is 227.3% cheaper on blended cost ($2.75 vs $9.00/Mtok)
 
by xAI
Overview
Status Current reasoning Current flagship
Released Apr 17, 2025 Jul 9, 2025
Pricing per million tokens
Input $1.10/Mtok $3.00/Mtok
Output $4.40/Mtok $15.00/Mtok
Blended avg $2.75/Mtok $9.00/Mtok
Cached input $0.550/Mtok
Specifications
Context window 200K tokens 256K tokens
Parameters Proprietary Proprietary
Speed (TPS)
Modalities
Input
textimage
textimage
Benchmarks from Vellum LLM Leaderboard
Avg benchmark score 73.7 66.2
Perf / dollar 26.8 7.4
GPQA Diamond 82 75
SWE-Bench 68 55
HumanEval 92 87
MATH 500 90 82
Providers
Available from
OpenAI — $1.10/$4.40/Mtok
xAI — $3.00/$15.00/Mtok

Cost at scale — 1M tokens (50/50 input/output)

Volumeo4-miniGrok 4Savings
1M tokens $2.75 $9 $6.25 (69.4%)
10M tokens $27.5 $90 $62.5 (69.4%)
100M tokens $275 $900 $625 (69.4%)
1000M tokens $2750 $9000 $6250 (69.4%)

Summary

o4-mini by OpenAI costs $1.10/Mtok input and $4.40/Mtok output, with a 200K-token context window. It supports text, image input.

Grok 4 by xAI costs $3.00/Mtok input and $15.00/Mtok output, with a 256K-token context window. It supports text, image input.

On a blended cost basis, o4-mini is 227.3% cheaper than Grok 4.

On benchmarks, o4-mini scores higher (73.7 vs 66.2) on average. In terms of value, o4-mini has better performance per dollar (26.8 vs 7.4).

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.