LIVE Cheapest: GLM-4.7-Flash $0.000/Mtok in 153 models tracked Updated Jun 25, 2026
Jun 25, 2026
ModelPriceWatch$/Mtok
Pricing / Best For / Best LLM API for Chatbots

Best LLM API for Chatbots

Find the best LLM API for chatbots and conversational AI. Compare pricing for models optimized for dialogue, customer support, and interactive assistants.

66 models qualify Showing top 15 Sorted by blended cost
1

Granite 4 H Medium

IBM

$0.150 in $0.600 out
$0.375/Mtok blended
128K ctx
2

Llama 3.1 8B Instant

Groq

$0.050 in $1.00 out
$0.525/Mtok blended
128K ctx
3

GPT-OSS 20B

Groq

$0.075 in $1.00 out
$0.537/Mtok blended
128K ctx

Cost calculator for this use case

🥇 Granite 4 H Medium $—
🥈 Llama 3.1 8B Instant $—
🥉 GPT-OSS 20B $—

Full ranking — top 15 models

# Model Provider Input $/Mtok Output $/Mtok Blended Context
1 Granite 4 H Medium IBM $0.150 $0.600 $0.375 128K
2 Llama 3.1 8B Instant Groq $0.050 $1.00 $0.525 128K
3 GPT-OSS 20B Groq $0.075 $1.00 $0.537 128K
4 Llama 4 Scout Groq $0.110 $1.00 $0.555 10M
5 GPT-OSS 120B Groq $0.150 $1.00 $0.575 128K
6 Qwen3 32B Groq $0.290 $1.00 $0.645 128K
7 DeepSeek V4 Pro DeepSeek $0.435 $0.870 $0.652 1M
8 Granite 4 H Large IBM $0.300 $1.20 $0.750 128K
9 Llama 3.3 70B Versatile Groq $0.590 $1.00 $0.795 128K
10 Qwen-Plus Alibaba $0.400 $1.20 $0.800 131K
11 Qwen 3.6 27B Groq $0.600 $1.00 $0.800 128K
12 Mistral Large 3 Mistral $0.500 $1.50 $1.00 128K
13 Sonar Perplexity $1.00 $1.00 $1.00 200K
14 Gemini 3.1 Flash Google $0.300 $2.50 $1.40 1M
15 Reka Flash Reka $0.800 $2.00 $1.40 128K

How models are selected

Mid-tier and flagship models suitable for conversational use, sorted by blended cost.

Prices are per million tokens (Mtok), sourced directly from official provider pricing pages and verified by our automated scraper pipeline that runs 3x daily. "Blended cost" is the average of input and output pricing — a quick proxy for typical 50/50 usage patterns.