What is the best LLM API for chatbot?

Based on our verified pricing data, the cheapest model that qualifies is Granite 4 H Medium by IBM at $0.150/Mtok input. See the full ranking above for more options.

How often are prices updated?

Prices are verified against official provider pricing pages 3 times daily (8am, 2pm, 8pm UTC) by our automated scraper pipeline.

Pricing / Best For / Best LLM API for Chatbots

Best LLM API for Chatbots

Find the best LLM API for chatbots and conversational AI. Compare pricing for models optimized for dialogue, customer support, and interactive assistants.

66 models qualify Showing top 15 Sorted by blended cost

Granite 4 H Medium

IBM

$0.150 in $0.600 out

$0.375/Mtok blended

128K ctx

Llama 3.1 8B Instant

Groq

$0.050 in $1.00 out

$0.525/Mtok blended

128K ctx

GPT-OSS 20B

Groq

$0.075 in $1.00 out

$0.537/Mtok blended

128K ctx

Cost calculator for this use case

Tokens per day

Input/output ratio: 70/30

Days per month

🥇 Granite 4 H Medium $—

🥈 Llama 3.1 8B Instant $—

🥉 GPT-OSS 20B $—

Full ranking — top 15 models

#	Model	Provider	Input $/Mtok	Output $/Mtok	Blended	Context
1	Granite 4 H Medium	IBM	$0.150	$0.600	$0.375	128K	→
2	Llama 3.1 8B Instant	Groq	$0.050	$1.00	$0.525	128K	→
3	GPT-OSS 20B	Groq	$0.075	$1.00	$0.537	128K	→
4	Llama 4 Scout	Groq	$0.110	$1.00	$0.555	10M	→
5	GPT-OSS 120B	Groq	$0.150	$1.00	$0.575	128K	→
6	Qwen3 32B	Groq	$0.290	$1.00	$0.645	128K	→
7	DeepSeek V4 Pro	DeepSeek	$0.435	$0.870	$0.652	1M	→
8	Granite 4 H Large	IBM	$0.300	$1.20	$0.750	128K	→
9	Llama 3.3 70B Versatile	Groq	$0.590	$1.00	$0.795	128K	→
10	Qwen-Plus	Alibaba	$0.400	$1.20	$0.800	131K	→
11	Qwen 3.6 27B	Groq	$0.600	$1.00	$0.800	128K	→
12	Mistral Large 3	Mistral	$0.500	$1.50	$1.00	128K	→
13	Sonar	Perplexity	$1.00	$1.00	$1.00	200K	→
14	Gemini 3.1 Flash	Google	$0.300	$2.50	$1.40	1M	→
15	Reka Flash	Reka	$0.800	$2.00	$1.40	128K	→

How models are selected

Mid-tier and flagship models suitable for conversational use, sorted by blended cost.

Prices are per million tokens (Mtok), sourced directly from official provider pricing pages and verified by our automated scraper pipeline that runs 3x daily. "Blended cost" is the average of input and output pricing — a quick proxy for typical 50/50 usage patterns.

Best LLM API for Chatbots

Granite 4 H Medium

Llama 3.1 8B Instant

GPT-OSS 20B

Cost calculator for this use case

Full ranking — top 15 models

How models are selected

Other use case rankings