LIVE Cheapest: GLM-4.7-Flash $0.000/Mtok in 153 models tracked Updated Jun 24, 2026
Jun 24, 2026
ModelPriceWatch$/Mtok
Pricing / Fireworks / NVIDIA Nemotron 3 Ultra

NVIDIA Nemotron 3 Ultra

by Fireworks

Current mid tier Open weights mid tier

Pricing · per 1M tokens

Input
$0.600
per million tokens
Output
$2.40
per million tokens
Cached input$0.120/Mtok (20% of input — prompt caching)
Blended avg cost*$1.50/Mtok

Overview

NVIDIA Nemotron 3 Ultra (preview) on Fireworks. $0.60/$2.40 per 1M.

Specifications

ProviderFireworks
Context window128K tokens
Modalitytext
ParametersProprietary
Open sourceYes — open weights available
ReleasedJan 1, 2026
StatusCurrent
Last updatedJun 24, 2026
Tagsopen-weights mid-tier reasoning caching

Cost calculator

1K tokens (in)$0.0006
1K tokens (out)$0.0024
100K tokens (in)$0.06
100K tokens (out)$0.24
1M tokens (in)$0.600
1M tokens (out)$2.40
10M tokens (blended)$15
Full calculator →

At a glance

Input$0.600/M
Output$2.40/M
Blended$1.50/M
Context128K
Tiermid

Price history

Jun 24, 2026 Baseline · $0.6/$2.4/M

Tracking since Jun 24, 2026 · 1 data points

Related models