LIVE Cheapest: GLM-4.7-Flash $0/Mtok in 154 models tracked Updated Jul 1, 2026
Jul 1, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / GPT-4.1 mini vs Claude Haiku 4.5

GPT-4.1 mini vs Claude Haiku 4.5

Side-by-side comparison of API pricing, specs, benchmarks, and capabilities

🏆 GPT-4.1 mini is 200% cheaper on blended cost ($1.00 vs $3.00/Mtok)
 
Overview
Status Current budget Current budget
Released Apr 14, 2025 Oct 15, 2025
Pricing per million tokens
Input $0.400/Mtok $1.00/Mtok
Output $1.60/Mtok $5.00/Mtok
Blended avg $1.00/Mtok $3.00/Mtok
Cached input $0.200/Mtok $0.100/Mtok
Specifications
Context window 1M tokens 200K tokens
Parameters Proprietary Proprietary
Speed (TPS)
Modalities
Input
textimage
textimage
Benchmarks from Vellum LLM Leaderboard
Avg benchmark score 50.6 82.9
Perf / dollar 50.6 27.6
GPQA Diamond 55
SWE-Bench 30
HumanEval 85 87.8
MATH 500 70 78
Providers
Available from
OpenAI — $0.400/$1.60/Mtok
Anthropic — $1.00/$5.00/Mtok

Cost at scale — 1M tokens (50/50 input/output)

VolumeGPT-4.1 miniClaude Haiku 4.5Savings
1M tokens $1 $3 $2 (66.7%)
10M tokens $10 $30 $20 (66.7%)
100M tokens $100 $300 $200 (66.7%)
1000M tokens $1000 $3000 $2000 (66.7%)

Summary

GPT-4.1 mini by OpenAI costs $0.400/Mtok input and $1.60/Mtok output, with a 1M-token context window. It supports text, image input.

Claude Haiku 4.5 by Anthropic costs $1.00/Mtok input and $5.00/Mtok output, with a 200K-token context window. It supports text, image input.

On a blended cost basis, GPT-4.1 mini is 200% cheaper than Claude Haiku 4.5. It also has a larger context window.

On benchmarks, Claude Haiku 4.5 scores higher (82.9 vs 50.6) on average. In terms of value, GPT-4.1 mini has better performance per dollar (50.6 vs 27.6).

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.