LIVE Cheapest: GLM-4.7-Flash $0/Mtok in 154 models tracked Updated Jul 1, 2026
Jul 1, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / Claude Sonnet 4.6 vs Claude Haiku 4.5

Claude Sonnet 4.6 vs Claude Haiku 4.5

Side-by-side comparison of API pricing, specs, benchmarks, and capabilities

🏆 Claude Haiku 4.5 is 200% cheaper on blended cost ($3.00 vs $9.00/Mtok)
 
Overview
Status Current mid tier Current budget
Released Jan 15, 2026 Oct 15, 2025
Pricing per million tokens
Input $3.00/Mtok $1.00/Mtok
Output $15.00/Mtok $5.00/Mtok
Blended avg $9.00/Mtok $3.00/Mtok
Cached input $0.300/Mtok $0.100/Mtok
Specifications
Context window 200K tokens 200K tokens
Parameters Proprietary Proprietary
Speed (TPS)
Modalities
Input
textimage
textimage
Benchmarks from Vellum LLM Leaderboard
Avg benchmark score 73.8 82.9
Perf / dollar 8.2 27.6
HumanEval 87.8
MATH 500 78
Providers
Available from
Anthropic — $3.00/$15.00/Mtok
Anthropic — $1.00/$5.00/Mtok

Cost at scale — 1M tokens (50/50 input/output)

VolumeClaude Sonnet 4.6Claude Haiku 4.5Savings
1M tokens $9 $3 $6 (66.7%)
10M tokens $90 $30 $60 (66.7%)
100M tokens $900 $300 $600 (66.7%)
1000M tokens $9000 $3000 $6000 (66.7%)

Summary

Claude Sonnet 4.6 by Anthropic costs $3.00/Mtok input and $15.00/Mtok output, with a 200K-token context window. It supports text, image input.

Claude Haiku 4.5 by Anthropic costs $1.00/Mtok input and $5.00/Mtok output, with a 200K-token context window. It supports text, image input.

On a blended cost basis, Claude Haiku 4.5 is 200% cheaper than Claude Sonnet 4.6.

On benchmarks, Claude Haiku 4.5 scores higher (82.9 vs 73.8) on average. In terms of value, Claude Haiku 4.5 has better performance per dollar (27.6 vs 8.2).

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.