LIVE Cheapest: GLM-4.7-Flash $0.000/Mtok in 153 models tracked Updated Jun 25, 2026
Jun 25, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / Llama Nemotron Ultra 253B vs Nemotron 3 Ultra

Llama Nemotron Ultra 253B vs Nemotron 3 Ultra

Side-by-side API pricing comparison · NVIDIA vs NVIDIA

Llama Nemotron Ultra 253B

by NVIDIA

Current open weights Open weights
Input
$0.600/Mtok
Output
$3.60/Mtok
✓ Cheaper
Blended avg$2.10/Mtok
Context128K tokens
Modalitytext
Parameters253B
ReleasedJan 1, 2025
Full details →

Nemotron 3 Ultra

by NVIDIA

Current mid tier Open weights
Input
$0.600/Mtok
Output
$3.60/Mtok
Blended avg$2.10/Mtok
Cached input$0.120/Mtok
Context128K tokens
Modalitytext
ParametersProprietary
ReleasedJan 1, 2026
Full details →

Cost at scale — 1M tokens (50/50 input/output)

VolumeLlama Nemotron Ultra 253BNemotron 3 UltraSavings
1M tokens $2.1 $2.1 $0 (0%)
10M tokens $21 $21 $0 (0%)
100M tokens $210 $210 $0 (0%)
1000M tokens $2100 $2100 $0 (0%)

Summary

Llama Nemotron Ultra 253B by NVIDIA costs $0.600/Mtok input and $3.60/Mtok output, with a 128K-token context window. It supports text input.

Nemotron 3 Ultra by NVIDIA costs $0.600/Mtok input and $3.60/Mtok output, with a 128K-token context window. It supports text input.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.