LIVE Cheapest: GLM-4.7-Flash $0.000/Mtok in 153 models tracked Updated Jun 25, 2026
Jun 25, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / Llama 4 Scout vs Llama 3.3 70B

Llama 4 Scout vs Llama 3.3 70B

Side-by-side API pricing comparison · Meta vs Together

🏆 Llama 4 Scout is 362.2% cheaper on blended cost ($0.225 vs $1.04/Mtok)

Llama 4 Scout

by Meta

Current open weights Open weights
Input
$0.110/Mtok
Output
$0.340/Mtok
✓ Cheaper
Blended avg$0.225/Mtok
Context10M tokens
Modalitytext, image
Parameters17B (16 experts)
ReleasedApr 6, 2025
Full details →

Llama 3.3 70B

by Together

Current open weights Open weights
Input
$1.04/Mtok
Output
$1.04/Mtok
Blended avg$1.04/Mtok
Context128K tokens
Modalitytext
Parameters70B
ReleasedDec 6, 2024
Full details →

Cost at scale — 1M tokens (50/50 input/output)

VolumeLlama 4 ScoutLlama 3.3 70BSavings
1M tokens $0.23 $1.04 $0.82 (78.8%)
10M tokens $2.25 $10.4 $8.15 (78.4%)
100M tokens $22.5 $104 $81.5 (78.4%)
1000M tokens $225 $1040 $815 (78.4%)

Summary

Llama 4 Scout by Meta costs $0.110/Mtok input and $0.340/Mtok output, with a 10M-token context window. It supports text, image input.

Llama 3.3 70B by Together costs $1.04/Mtok input and $1.04/Mtok output, with a 128K-token context window. It supports text input.

On a blended cost basis, Llama 4 Scout is 362.2% cheaper than Llama 3.3 70B. It also has a larger context window.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.