LIVE Cheapest: GLM-4.7-Flash $0/Mtok in 154 models tracked Updated Jul 1, 2026
Jul 1, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / o4-mini vs Claude Sonnet 4.6

o4-mini vs Claude Sonnet 4.6

Side-by-side comparison of API pricing, specs, benchmarks, and capabilities

🏆 o4-mini is 227.3% cheaper on blended cost ($2.75 vs $9.00/Mtok)
 
Overview
Status Current reasoning Current mid tier
Released Apr 17, 2025 Jan 15, 2026
Pricing per million tokens
Input $1.10/Mtok $3.00/Mtok
Output $4.40/Mtok $15.00/Mtok
Blended avg $2.75/Mtok $9.00/Mtok
Cached input $0.550/Mtok $0.300/Mtok
Specifications
Context window 200K tokens 200K tokens
Parameters Proprietary Proprietary
Speed (TPS)
Modalities
Input
textimage
textimage
Benchmarks from Vellum LLM Leaderboard
Avg benchmark score 73.7 73.8
Perf / dollar 26.8 8.2
GPQA Diamond 82
SWE-Bench 68
HumanEval 92
MATH 500 90
Providers
Available from
OpenAI — $1.10/$4.40/Mtok
Anthropic — $3.00/$15.00/Mtok

Cost at scale — 1M tokens (50/50 input/output)

Volumeo4-miniClaude Sonnet 4.6Savings
1M tokens $2.75 $9 $6.25 (69.4%)
10M tokens $27.5 $90 $62.5 (69.4%)
100M tokens $275 $900 $625 (69.4%)
1000M tokens $2750 $9000 $6250 (69.4%)

Summary

o4-mini by OpenAI costs $1.10/Mtok input and $4.40/Mtok output, with a 200K-token context window. It supports text, image input.

Claude Sonnet 4.6 by Anthropic costs $3.00/Mtok input and $15.00/Mtok output, with a 200K-token context window. It supports text, image input.

On a blended cost basis, o4-mini is 227.3% cheaper than Claude Sonnet 4.6.

On benchmarks, Claude Sonnet 4.6 scores higher (73.8 vs 73.7) on average. In terms of value, o4-mini has better performance per dollar (26.8 vs 8.2).

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.