LIVE Cheapest: GLM-4.7-Flash $0.000/Mtok in 153 models tracked Updated Jun 25, 2026
Jun 25, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / Rerank 3.5 vs Embed 4

Rerank 3.5 vs Embed 4

Side-by-side API pricing comparison · Cohere vs Cohere

🏆 Rerank 3.5 is 500% cheaper on blended cost ($0.020 vs $0.120/Mtok)

Rerank 3.5

by Cohere

Current embedding
Input
$0.020/Mtok
Output
$0.020/Mtok
✓ Cheaper
Blended avg$0.020/Mtok
Context— tokens
Modalitytext
ParametersProprietary
ReleasedMar 1, 2025
Full details →

Embed 4

by Cohere

Current embedding
Input
$0.120/Mtok
Output
$0.120/Mtok
Blended avg$0.120/Mtok
Context— tokens
Modalitytext, image
ParametersProprietary
ReleasedJun 1, 2025
Full details →

Cost at scale — 1M tokens (50/50 input/output)

VolumeRerank 3.5Embed 4Savings
1M tokens $0.02 $0.12 $0.1 (83.3%)
10M tokens $0.2 $1.2 $1 (83.3%)
100M tokens $2 $12 $10 (83.3%)
1000M tokens $20 $120 $100 (83.3%)

Summary

Rerank 3.5 by Cohere costs $0.020/Mtok input and $0.020/Mtok output, with a —-token context window. It supports text input.

Embed 4 by Cohere costs $0.120/Mtok input and $0.120/Mtok output, with a —-token context window. It supports text, image input.

On a blended cost basis, Rerank 3.5 is 500% cheaper than Embed 4.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.