LIVE Cheapest: GLM-4.7-Flash $0/Mtok in 154 models tracked Updated Jul 1, 2026
Jul 1, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / GPT-5.4 vs Claude Sonnet 4.6

GPT-5.4 vs Claude Sonnet 4.6

Side-by-side comparison of API pricing, specs, benchmarks, and capabilities

🏆 GPT-5.4 is 2.9% cheaper on blended cost ($8.75 vs $9.00/Mtok)
 
Overview
Status Current flagship Current mid tier
Released Aug 15, 2025 Jan 15, 2026
Pricing per million tokens
Input $2.50/Mtok $3.00/Mtok
Output $15.00/Mtok $15.00/Mtok
Blended avg $8.75/Mtok $9.00/Mtok
Cached input $0.250/Mtok $0.300/Mtok
Specifications
Context window 1M tokens 200K tokens
Parameters Proprietary Proprietary
Speed (TPS)
Modalities
Input
textimage
textimage
Benchmarks from Vellum LLM Leaderboard
Avg benchmark score 78.7 73.8
Perf / dollar 9 8.2
GPQA Diamond 88
SWE-Bench 75
HumanEval 94.5
MATH 500 92
Providers
Available from
OpenAI — $2.50/$15.00/Mtok
Anthropic — $3.00/$15.00/Mtok

Cost at scale — 1M tokens (50/50 input/output)

VolumeGPT-5.4Claude Sonnet 4.6Savings
1M tokens $8.75 $9 $0.25 (2.8%)
10M tokens $87.5 $90 $2.5 (2.8%)
100M tokens $875 $900 $25 (2.8%)
1000M tokens $8750 $9000 $250 (2.8%)

Summary

GPT-5.4 by OpenAI costs $2.50/Mtok input and $15.00/Mtok output, with a 1M-token context window. It supports text, image input.

Claude Sonnet 4.6 by Anthropic costs $3.00/Mtok input and $15.00/Mtok output, with a 200K-token context window. It supports text, image input.

On a blended cost basis, GPT-5.4 is 2.9% cheaper than Claude Sonnet 4.6. It also has a larger context window.

On benchmarks, GPT-5.4 scores higher (78.7 vs 73.8) on average. In terms of value, GPT-5.4 has better performance per dollar (9 vs 8.2).

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.