API Pricing

Every major LLM provider in one table. Sortable, filterable. Prices per 1 million tokens.

Model ▲▼	Provider ▲▼	Input / 1M ▲	Output / 1M ▲▼	Context ▲▼	Max Output	Speed ▲▼	Quality ▲▼	Value ▲▼
GPT-OSS 120B	OpenAI	$0.039	$0.190	131K	33K	--	--	--
GPT-5 Nano	OpenAI	$0.050	$0.400	128K	16K	148 tok/s	27	536.0
Qwen3.5-9B	Alibaba	$0.050	$0.150	262K	33K	--	--	--
Qwen3.5-Omni Flash	Alibaba	$0.065	$0.260	262K	33K	--	--	--
Hunyuan HY3 Preview	Tencent	$0.066	$0.260	262K	33K	--	--	--
GPT-OSS 20B (Bedrock)	OpenAI	$0.070	$0.300	16K	16K	--	--	--
GLM-4.7-flash	Zhipu	$0.070	$0.400	200K	8K	--	--	--
GPT-OSS 20B	OpenAI	$0.075	$0.300	131K	33K	--	--	--
Gemini 2.0 Flash-Lite	Google	$0.075	$0.300	1.0M	8K	--	15	193.3
Llama 4 Scout	Meta	$0.080	$0.300	1.0M	16K	134 tok/s	14	168.8
Qwen3-Next-80B-A3B-Thinking	Alibaba	$0.098	$0.780	262K	33K	--	--	--
GPT-4.1 Nano	OpenAI	$0.100	$0.400	1.0M	33K	126 tok/s	13	130.0
Gemini 2.5 Flash-Lite	Google	$0.100	$0.400	1.0M	66K	268 tok/s	13	127.0
Gemini 2.0 Flash	Google	$0.100	$0.400	1.0M	8K	--	19	185.0
Devstral Small	Mistral	$0.100	$0.300	256K	33K	--	--	--
Mistral Small 3.2	Mistral	$0.100	$0.300	128K	4K	143 tok/s	10	102.0
Gemma 4 26B A4B	Google	$0.130	$0.400	262K	33K	--	--	--
Gemma 4 31B	Google	$0.140	$0.400	262K	33K	--	--	--
DeepSeek V4-Flash	DeepSeek	$0.140	$0.280	1.0M	384K	--	--	--
GPT-OSS 120B (Bedrock)	OpenAI	$0.150	$0.600	16K	16K	--	--	--
GPT-4o Mini	OpenAI	$0.150	$0.600	128K	16K	75 tok/s	13	84.0
Mistral Small 4	Mistral	$0.150	$0.600	256K	33K	--	--	--
Ministral 3 8B	Mistral	$0.150	$0.150	262K	33K	--	--	--
Pixtral 12B	Mistral	$0.150	$0.150	128K	4K	--	--	--
Command R	Cohere	$0.150	$0.600	128K	4K	--	--	--
Command R 08 2024	Cohere	$0.150	$0.600	128K	4K	--	--	--
Command R7b 12 2024	Cohere	$0.150	$0.038	128K	4K	--	--	--
Llama 3.3 70B	Meta	$0.180	$0.180	131K	4K	--	--	--
GPT-5.4 Nano	OpenAI	$0.200	$1.25	400K	128K	--	--	--
Grok 4.1 Fast	xAI	$0.200	$0.500	2.0M	16K	75 tok/s	24	118.0
Grok 4.1 Fast Reasoning	xAI	$0.200	$0.500	2.0M	16K	88 tok/s	39	193.0
Grok Code Fast	xAI	$0.200	$1.50	256K	16K	--	--	--
Ministral 3 14B	Mistral	$0.200	$0.200	262K	33K	--	--	--
Jamba 1.5	AI21	$0.200	$0.400	256K	256K	--	--	--
Jamba 1.5 Mini	AI21	$0.200	$0.400	256K	256K	--	--	--
Jamba 1.5 Mini@001	AI21	$0.200	$0.400	256K	256K	--	--	--
Jamba Mini 1.6	AI21	$0.200	$0.400	256K	256K	--	--	--
Jamba Mini 1.7	AI21	$0.200	$0.400	256K	256K	--	--	--
GPT-5 Mini	OpenAI	$0.250	$2.00	400K	16K	85 tok/s	41	164.8
Gemini 3.1 Flash-Lite	Google	$0.250	$1.50	1.0M	66K	350 tok/s	34	134.0
Claude 3 Haiku 20240307	Anthropic	$0.250	$1.25	200K	4K	--	--	--
Llama 4 Maverick	Meta	$0.270	$0.850	1.0M	16K	113 tok/s	18	68.1
Qwen3.6-Plus	Alibaba	$0.276	$1.65	1.0M	33K	--	--	--
DeepSeek V3.2 (Chat)	DeepSeek	$0.280	$0.420	128K	8K	--	32	114.6
DeepSeek V3.2 (Reasoner)	DeepSeek	$0.280	$0.420	128K	8K	--	--	--
DeepSeek Reasoner	DeepSeek	$0.280	$0.420	131K	66K	--	--	--
Gemini 2.5 Flash	Google	$0.300	$2.50	1.0M	66K	215 tok/s	21	68.7
Grok 3 Mini	xAI	$0.300	$0.500	131K	16K	--	--	--
Codestral	Mistral	$0.300	$0.900	256K	33K	--	--	--
Qwen 3.5 27B	Alibaba	$0.300	$2.40	128K	33K	--	--	--
Nova 2.0 Lite	Amazon	$0.300	$2.50	1.0M	64K	229 tok/s	18	60.0
Nemotron 3 Super 120B	NVIDIA	$0.300	$0.800	1.0M	33K	159 tok/s	36	120.0
MiniMax M2.5	MiniMax	$0.300	$1.20	128K	33K	86 tok/s	42	139.7
Command Light	Cohere	$0.300	$0.600	4K	4K	--	--	--
GPT-4.1 Mini	OpenAI	$0.400	$1.60	1.0M	33K	79 tok/s	23	57.2
Mistral Medium	Mistral	$0.400	$2.00	131K	16K	--	--	--
Devstral	Mistral	$0.400	$2.00	256K	33K	--	--	--
Qwen3.5-Omni Plus	Alibaba	$0.400	$4.80	262K	33K	--	--	--
Gemini 3 Flash	Google	$0.500	$3.00	1.0M	66K	204 tok/s	35	70.0
Gemini 3 Flash Reasoning	Google	$0.500	$3.00	1.0M	66K	203 tok/s	46	92.8
Mistral Large 3	Mistral	$0.500	$1.50	262K	8K	55 tok/s	23	45.6
Magistral Small	Mistral	$0.500	$1.50	40K	16K	--	--	--
DeepSeek R1	DeepSeek	$0.550	$2.19	128K	33K	--	27	49.3
Grok 3 Mini Fast	xAI	$0.600	$4.00	131K	16K	--	--	--
Qwen 3.5 397B	Alibaba	$0.600	$3.60	128K	33K	--	--	--
Kimi K2.5	Moonshot	$0.600	$3.00	262K	33K	42 tok/s	47	78.0
Kimi K2 Thinking	Moonshot	$0.600	$2.50	262K	33K	--	--	--
GLM-4.7	Zhipu	$0.600	$2.20	200K	128K	--	--	--
GPT-5.4 Mini	OpenAI	$0.750	$4.50	400K	128K	--	--	--
Gemini 3.1 Flash Live	Google	$0.750	$4.50	1.0M	66K	--	--	--
Claude Haiku 3.5	Anthropic	$0.800	$4.00	200K	8K	--	19	23.4
QwQ-Plus	Alibaba	$0.800	$2.40	131K	16K	--	--	--
Kimi K2.6	Moonshot	$0.950	$4.00	262K	33K	--	--	--
Claude Haiku 4.5	Anthropic	$1.00	$5.00	200K	8K	100 tok/s	31	31.1
Claude 4.5 Haiku Reasoning	Anthropic	$1.00	$5.00	200K	8K	139 tok/s	37	37.1
Gemini 3.1 Flash TTS	Google	$1.00	$20.00	32K	32K	--	--	--
GLM-5	Zhipu	$1.00	$3.20	128K	33K	68 tok/s	50	49.8
MiMo-V2-Pro	Xiaomi	$1.00	$3.00	1.0M	131K	--	--	--
Claude Haiku 4 5 20251001	Anthropic	$1.00	$5.00	200K	64K	--	--	--
Claude Haiku 4 5	Anthropic	$1.00	$5.00	200K	64K	--	--	--
o4 Mini	OpenAI	$1.10	$4.40	200K	100K	171 tok/s	33	30.1
o3 Mini	OpenAI	$1.10	$4.40	200K	100K	162 tok/s	26	23.5
Kimi K2 Thinking Turbo	Moonshot	$1.15	$8.00	262K	33K	--	--	--
GLM-5 Turbo	Zhipu	$1.20	$4.00	200K	128K	--	--	--
GPT-5.1	OpenAI	$1.25	$10.00	400K	16K	168 tok/s	48	38.2
GPT-5	OpenAI	$1.25	$10.00	400K	16K	88 tok/s	45	35.7
GPT-5 Medium	OpenAI	$1.25	$10.00	400K	16K	84 tok/s	42	33.6
Gemini 2.5 Pro	Google	$1.25	$10.00	1.0M	66K	135 tok/s	35	27.7
Grok 4.3	xAI	$1.25	$2.50	1.0M	16K	--	--	--
Nova 2.0 Pro Reasoning	Amazon	$1.25	$10.00	128K	33K	162 tok/s	32	25.5
GLM-5.1	Zhipu	$1.40	$4.40	200K	128K	--	--	--
Mistral Medium 3.5	Mistral	$1.50	$7.50	256K	33K	--	--	--
DeepSeek V4-Pro	DeepSeek	$1.74	$3.48	1.0M	384K	--	--	--
GPT-5.2	OpenAI	$1.75	$14.00	400K	16K	80 tok/s	51	29.3
GPT-5.3 Codex	OpenAI	$1.75	$14.00	400K	33K	96 tok/s	54	30.6
GPT-4.1	OpenAI	$2.00	$8.00	1.0M	33K	127 tok/s	26	13.2
o3	OpenAI	$2.00	$8.00	200K	100K	111 tok/s	38	19.2
o4 Mini Deep Research	OpenAI	$2.00	$8.00	200K	100K	--	--	--
Gemini 3.1 Pro	Google	$2.00	$12.00	1.0M	66K	130 tok/s	57	28.6
Gemini 3 Pro	Google	$2.00	$12.00	1.0M	66K	141 tok/s	48	24.2
Grok 4.20	xAI	$2.00	$6.00	2.0M	16K	97 tok/s	49	24.6
Grok 2	xAI	$2.00	$10.00	131K	16K	--	--	--
Magistral Medium	Mistral	$2.00	$5.00	40K	16K	--	--	--
Pixtral Large	Mistral	$2.00	$6.00	128K	4K	--	--	--
Jamba 1.5 Large	AI21	$2.00	$8.00	256K	256K	--	--	--
Jamba 1.5 Large@001	AI21	$2.00	$8.00	256K	256K	--	--	--
Jamba Large 1.6	AI21	$2.00	$8.00	256K	256K	--	--	--
Jamba Large 1.7	AI21	$2.00	$8.00	256K	256K	--	--	--
GPT-5.4	OpenAI	$2.50	$15.00	1.1M	128K	94 tok/s	57	22.7
GPT-4o	OpenAI	$2.50	$10.00	128K	16K	167 tok/s	17	6.9
Command A	Cohere	$2.50	$10.00	128K	4K	36 tok/s	14	5.4
Command A 03 2025	Cohere	$2.50	$10.00	256K	8K	--	--	--
Command R Plus	Cohere	$2.50	$10.00	128K	4K	--	--	--
Command R Plus 08 2024	Cohere	$2.50	$10.00	128K	4K	--	--	--
Claude Sonnet 4.6 Adaptive	Anthropic	$3.00	$15.00	200K	64K	79 tok/s	52	17.2
Claude Sonnet 4.6	Anthropic	$3.00	$15.00	200K	64K	53 tok/s	44	14.8
Claude Sonnet 4.5	Anthropic	$3.00	$15.00	200K	64K	--	--	--
Claude Sonnet 4	Anthropic	$3.00	$15.00	200K	64K	50 tok/s	43	14.2
Claude 3.7 Sonnet	Anthropic	$3.00	$15.00	200K	64K	--	--	--
Grok 4	xAI	$3.00	$15.00	2.0M	16K	44 tok/s	42	13.8
Grok 3	xAI	$3.00	$15.00	131K	16K	--	--	--
Sonar Pro	Perplexity	$3.00	$15.00	128K	4K	--	15	5.1
Claude 3 7 Sonnet 20250219	Anthropic	$3.00	$15.00	200K	64K	--	--	--
Claude 4 Sonnet 20250514	Anthropic	$3.00	$15.00	1.0M	64K	--	--	--
Claude Sonnet 4 5	Anthropic	$3.00	$15.00	200K	64K	--	--	--
Claude Sonnet 4 5 20250929	Anthropic	$3.00	$15.00	200K	64K	--	--	--
Claude Sonnet 4 6	Anthropic	$3.00	$15.00	1.0M	64K	--	--	--
Claude Sonnet 4 20250514	Anthropic	$3.00	$15.00	1.0M	64K	--	--	--
GPT-5.5	OpenAI	$5.00	$30.00	1.1M	128K	--	--	--
Claude Opus 4.7	Anthropic	$5.00	$25.00	1.0M	128K	--	--	--
Claude Opus 4.6 Adaptive	Anthropic	$5.00	$25.00	200K	32K	52 tok/s	53	10.6
Claude Opus 4.6	Anthropic	$5.00	$25.00	200K	32K	46 tok/s	47	9.3
Claude Opus 4.5	Anthropic	$5.00	$25.00	200K	32K	58 tok/s	43	8.6
Grok 3 Fast	xAI	$5.00	$25.00	131K	16K	--	--	--
MAI-Image-2	Microsoft	$5.00	$33.00	32K	1K	--	--	--
Claude Opus 4 5 20251101	Anthropic	$5.00	$25.00	200K	64K	--	--	--
Claude Opus 4 5	Anthropic	$5.00	$25.00	200K	64K	--	--	--
Claude Opus 4 6	Anthropic	$5.00	$25.00	1.0M	128K	--	--	--
Claude Opus 4 6 20260205	Anthropic	$5.00	$25.00	1.0M	128K	--	--	--
Claude Opus 4 7	Anthropic	$5.00	$25.00	1.0M	128K	--	--	--
Claude Opus 4 7 20260416	Anthropic	$5.00	$25.00	1.0M	128K	--	--	--
o3 Deep Research	OpenAI	$10.00	$40.00	200K	100K	--	--	--
o1	OpenAI	$15.00	$60.00	200K	100K	108 tok/s	31	2.1
Claude Opus 4.1	Anthropic	$15.00	$75.00	200K	32K	--	--	--
Claude 3 Opus 20240229	Anthropic	$15.00	$75.00	200K	4K	--	--	--
Claude 4 Opus 20250514	Anthropic	$15.00	$75.00	200K	32K	--	--	--
Claude Opus 4 1	Anthropic	$15.00	$75.00	200K	32K	--	--	--
Claude Opus 4 1 20250805	Anthropic	$15.00	$75.00	200K	32K	--	--	--
Claude Opus 4 20250514	Anthropic	$15.00	$75.00	200K	32K	--	--	--
Voxtral TTS	Mistral	$16.00	$< 0.01	128K	0K	--	--	--
o3-pro	OpenAI	$20.00	$80.00	200K	100K	--	--	--
Claude Mythos Preview	Anthropic	$25.00	$125	1.0M	32K	--	--	--
GPT-5.5 Pro	OpenAI	$30.00	$180	1.1M	128K	--	--	--
GPT-5.4 Pro	OpenAI	$30.00	$180	1.1M	128K	--	--	--
o1 Pro	OpenAI	$150	$600	200K	100K	--	--	--

Pricing as of March 2026. Open-source model prices reflect hosted API providers. Always verify with official pages. Value = quality index / input cost per 1M tokens (higher is better).

Speed & quality data by Artificial Analysis

How to Compare LLM API Pricing

1
Browse the pricing table
400+ models are listed with input and output pricing per million tokens, plus context window sizes and benchmark scores.
2
Sort and filter
Click any column header to sort by price, context window, or benchmark score. Use the provider filter to focus on specific vendors.
3
Evaluate price vs. performance
Quality and speed metrics from Artificial Analysis help you weigh cost against actual model capability.

Why Use This Pricing Comparison

400+ models from 15+ providers in a single sortable table — no tab switching
Enriched with quality index and speed benchmarks from Artificial Analysis
Provider-colored badges for quick visual scanning across vendors
Context window and max output token data alongside pricing
Data sourced from official provider docs and LiteLLM open-source project

Common Use Cases

Vendor selection

Compare pricing across all major providers before committing to an API integration. Sort by cost to find the cheapest option.

Cost optimization

Find cheaper alternatives to your current model by sorting on price and checking if the quality benchmark is close enough.

Technical evaluation

Use the quality index and speed metrics to shortlist models, then compare their pricing to make a final decision.

Model comparison

Use the compare tool to see side-by-side pricing and specs for any two models.

Related Tools

Token Counter

Count tokens and see per-request costs for any text.

Cost Calculator

Estimate monthly API spend based on usage.

Compare Models

Side-by-side pricing and specs for two models.

LLM Leaderboard

Rankings by quality, speed, and value.

Frequently Asked Questions

Common questions about LLM API pricing

2026年LLM API价格对比 — 免费在线工具 | TokenCost