Pricing / Compare / Pixtral 12B vs Llama Nemotron Ultra 253B

Pixtral 12B vs Llama Nemotron Ultra 253B

Side-by-side API pricing comparison · Mistral vs NVIDIA

🏆 Pixtral 12B is 2000% cheaper on blended cost ($0.100 vs $2.10/Mtok)

Pixtral 12B

by Mistral

Current open weights Open weights

Input

$0.100/Mtok

Output

$0.100/Mtok

✓ Cheaper

Blended avg	$0.100/Mtok
Context	128K tokens
Modality	text, image
Parameters	12B
Released	Nov 1, 2024

Full details →

Llama Nemotron Ultra 253B

by NVIDIA

Current open weights Open weights

Input

$0.600/Mtok

Output

$3.60/Mtok

Blended avg	$2.10/Mtok
Context	128K tokens
Modality	text
Parameters	253B
Released	Jan 1, 2025

Full details →

Cost at scale — 1M tokens (50/50 input/output)

Volume	Pixtral 12B	Llama Nemotron Ultra 253B	Savings
1M tokens	$0.1	$2.1	$2 (95.2%)
10M tokens	$1	$21	$20 (95.2%)
100M tokens	$10	$210	$200 (95.2%)
1000M tokens	$100	$2100	$2000 (95.2%)

Summary

Pixtral 12B by Mistral costs $0.100/Mtok input and $0.100/Mtok output, with a 128K-token context window. It supports text, image input.

Llama Nemotron Ultra 253B by NVIDIA costs $0.600/Mtok input and $3.60/Mtok output, with a 128K-token context window. It supports text input.

On a blended cost basis, Pixtral 12B is 2000% cheaper than Llama Nemotron Ultra 253B.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.

Pixtral 12B vs Llama Nemotron Ultra 253B

Pixtral 12B

Llama Nemotron Ultra 253B

Cost at scale — 1M tokens (50/50 input/output)

Summary

More comparisons