Llama 4 Scout vs Llama 3.3 70B
Side-by-side API pricing comparison · Meta vs Meta
🏆
Llama 4 Scout is 206.7% cheaper on blended cost ($0.225 vs $0.690/Mtok)
Llama 4 Scout
by Meta
Current open weights Open weightsInput
$0.110/Mtok
Output
$0.340/Mtok
✓ Cheaper
| Blended avg | $0.225/Mtok |
|---|---|
| Context | 10M tokens |
| Modality | text, image |
| Parameters | 17B (16 experts) |
| Released | Apr 6, 2025 |
Llama 3.3 70B
by Meta
Current open weights Open weightsInput
$0.590/Mtok
Output
$0.790/Mtok
| Blended avg | $0.690/Mtok |
|---|---|
| Context | 128K tokens |
| Modality | text |
| Parameters | 70B |
| Released | Dec 6, 2024 |
Cost at scale — 1M tokens (50/50 input/output)
| Volume | Llama 4 Scout | Llama 3.3 70B | Savings |
|---|---|---|---|
| 1M tokens | $0.23 | $0.69 | $0.47 (68.1%) |
| 10M tokens | $2.25 | $6.9 | $4.65 (67.4%) |
| 100M tokens | $22.5 | $69 | $46.5 (67.4%) |
| 1000M tokens | $225 | $690 | $465 (67.4%) |
Summary
Llama 4 Scout by Meta costs $0.110/Mtok input and $0.340/Mtok output, with a 10M-token context window. It supports text, image input.
Llama 3.3 70B by Meta costs $0.590/Mtok input and $0.790/Mtok output, with a 128K-token context window. It supports text input.
On a blended cost basis, Llama 4 Scout is 206.7% cheaper than Llama 3.3 70B. It also has a larger context window.
Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.