Pricing / Compare / Mixtral 8x7B Instruct vs Llama 3.3 70B

Mixtral 8x7B Instruct vs Llama 3.3 70B

Side-by-side API pricing comparison · Mistral vs Together

🏆 Mixtral 8x7B Instruct is 92.6% cheaper on blended cost ($0.540 vs $1.04/Mtok)

Mixtral 8x7B Instruct

by Mistral

Current open weights Open weights

Input

$0.540/Mtok

Output

$0.540/Mtok

✓ Cheaper

Blended avg	$0.540/Mtok
Context	32K tokens
Modality	text
Parameters	46.7B (8x7B MoE)
Released	Jul 1, 2024

Full details →

Llama 3.3 70B

by Together

Current open weights Open weights

Input

$1.04/Mtok

Output

$1.04/Mtok

Blended avg	$1.04/Mtok
Context	128K tokens
Modality	text
Parameters	70B
Released	Dec 6, 2024

Full details →

Cost at scale — 1M tokens (50/50 input/output)

Volume	Mixtral 8x7B Instruct	Llama 3.3 70B	Savings
1M tokens	$0.54	$1.04	$0.5 (48.1%)
10M tokens	$5.4	$10.4	$5 (48.1%)
100M tokens	$54	$104	$50 (48.1%)
1000M tokens	$540	$1040	$500 (48.1%)

Summary

Mixtral 8x7B Instruct by Mistral costs $0.540/Mtok input and $0.540/Mtok output, with a 32K-token context window. It supports text input.

Llama 3.3 70B by Together costs $1.04/Mtok input and $1.04/Mtok output, with a 128K-token context window. It supports text input.

On a blended cost basis, Mixtral 8x7B Instruct is 92.6% cheaper than Llama 3.3 70B.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.

Mixtral 8x7B Instruct vs Llama 3.3 70B

Mixtral 8x7B Instruct

Llama 3.3 70B

Cost at scale — 1M tokens (50/50 input/output)

Summary

More comparisons