Mixtral 8x7B Instruct vs Llama 3.3 70B
Side-by-side API pricing comparison · Mistral vs Together
🏆
Mixtral 8x7B Instruct is 92.6% cheaper on blended cost ($0.540 vs $1.04/Mtok)
Mixtral 8x7B Instruct
by Mistral
Current open weights Open weightsInput
$0.540/Mtok
Output
$0.540/Mtok
✓ Cheaper
| Blended avg | $0.540/Mtok |
|---|---|
| Context | 32K tokens |
| Modality | text |
| Parameters | 46.7B (8x7B MoE) |
| Released | Jul 1, 2024 |
Llama 3.3 70B
by Together
Current open weights Open weightsInput
$1.04/Mtok
Output
$1.04/Mtok
| Blended avg | $1.04/Mtok |
|---|---|
| Context | 128K tokens |
| Modality | text |
| Parameters | 70B |
| Released | Dec 6, 2024 |
Cost at scale — 1M tokens (50/50 input/output)
| Volume | Mixtral 8x7B Instruct | Llama 3.3 70B | Savings |
|---|---|---|---|
| 1M tokens | $0.54 | $1.04 | $0.5 (48.1%) |
| 10M tokens | $5.4 | $10.4 | $5 (48.1%) |
| 100M tokens | $54 | $104 | $50 (48.1%) |
| 1000M tokens | $540 | $1040 | $500 (48.1%) |
Summary
Mixtral 8x7B Instruct by Mistral costs $0.540/Mtok input and $0.540/Mtok output, with a 32K-token context window. It supports text input.
Llama 3.3 70B by Together costs $1.04/Mtok input and $1.04/Mtok output, with a 128K-token context window. It supports text input.
On a blended cost basis, Mixtral 8x7B Instruct is 92.6% cheaper than Llama 3.3 70B.
Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.