LIVE Cheapest: GLM-4.7-Flash $0.000/Mtok in 153 models tracked Updated Jun 25, 2026
Jun 25, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / Mixtral 8x7B Instruct vs Llama 3.3 70B

Mixtral 8x7B Instruct vs Llama 3.3 70B

Side-by-side API pricing comparison · Mistral vs Meta

🏆 Mixtral 8x7B Instruct is 27.8% cheaper on blended cost ($0.540 vs $0.690/Mtok)

Mixtral 8x7B Instruct

by Mistral

Current open weights Open weights
Input
$0.540/Mtok
Output
$0.540/Mtok
✓ Cheaper
Blended avg$0.540/Mtok
Context32K tokens
Modalitytext
Parameters46.7B (8x7B MoE)
ReleasedJul 1, 2024
Full details →

Llama 3.3 70B

by Meta

Current open weights Open weights
Input
$0.590/Mtok
Output
$0.790/Mtok
Blended avg$0.690/Mtok
Context128K tokens
Modalitytext
Parameters70B
ReleasedDec 6, 2024
Full details →

Cost at scale — 1M tokens (50/50 input/output)

VolumeMixtral 8x7B InstructLlama 3.3 70BSavings
1M tokens $0.54 $0.69 $0.15 (21.7%)
10M tokens $5.4 $6.9 $1.5 (21.7%)
100M tokens $54 $69 $15 (21.7%)
1000M tokens $540 $690 $150 (21.7%)

Summary

Mixtral 8x7B Instruct by Mistral costs $0.540/Mtok input and $0.540/Mtok output, with a 32K-token context window. It supports text input.

Llama 3.3 70B by Meta costs $0.590/Mtok input and $0.790/Mtok output, with a 128K-token context window. It supports text input.

On a blended cost basis, Mixtral 8x7B Instruct is 27.8% cheaper than Llama 3.3 70B.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.