Pricing / Compare / DeepSeek V4 Flash vs GPT OSS 120B

DeepSeek V4 Flash vs GPT OSS 120B

Side-by-side API pricing comparison · Fireworks vs Fireworks

🏆 DeepSeek V4 Flash is 78.6% cheaper on blended cost ($0.210 vs $0.375/Mtok)

DeepSeek V4 Flash

by Fireworks

Current budget

Input

$0.140/Mtok

Output

$0.280/Mtok

✓ Cheaper

Blended avg	$0.210/Mtok
Cached input	$0.028/Mtok
Context	1M tokens
Modality	text
Parameters	Proprietary
Released	Dec 1, 2025

Full details →

GPT OSS 120B

by Fireworks

Current budget Open weights

Input

$0.150/Mtok

Output

$0.600/Mtok

Blended avg	$0.375/Mtok
Cached input	$0.015/Mtok
Context	128K tokens
Modality	text
Parameters	120B
Released	Jan 1, 2026

Full details →

Cost at scale — 1M tokens (50/50 input/output)

Volume	DeepSeek V4 Flash	GPT OSS 120B	Savings
1M tokens	$0.21	$0.38	$0.16 (42.7%)
10M tokens	$2.1	$3.75	$1.65 (44%)
100M tokens	$21	$37.5	$16.5 (44%)
1000M tokens	$210	$375	$165 (44%)

Summary

DeepSeek V4 Flash by Fireworks costs $0.140/Mtok input and $0.280/Mtok output, with a 1M-token context window. It supports text input.

GPT OSS 120B by Fireworks costs $0.150/Mtok input and $0.600/Mtok output, with a 128K-token context window. It supports text input.

On a blended cost basis, DeepSeek V4 Flash is 78.6% cheaper than GPT OSS 120B. It also has a larger context window.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.

DeepSeek V4 Flash vs GPT OSS 120B

DeepSeek V4 Flash

GPT OSS 120B

Cost at scale — 1M tokens (50/50 input/output)

Summary

More comparisons