API Pricing
Every major LLM provider in one table. Sortable, filterable. Prices per 1 million tokens.
| Model ▲▼ | Provider ▲▼ | Input / 1M ▲ | Output / 1M ▲▼ | Context ▲▼ | Max Output | Speed ▲▼ | Quality ▲▼ | Value ▲▼ |
|---|---|---|---|---|---|---|---|---|
| GPT-OSS 120B | OpenAI | $0.039 | $0.190 | 131K | 33K | -- | -- | -- |
| GPT-5 Nano | OpenAI | $0.050 | $0.400 | 128K | 16K | 141 tok/s | 27 | 536.0 |
| Qwen3.5-9B | Alibaba | $0.050 | $0.150 | 262K | 33K | -- | -- | -- |
| Qwen3.5-Omni Flash | Alibaba | $0.065 | $0.260 | 262K | 33K | -- | -- | -- |
| Hunyuan HY3 Preview | Tencent | $0.066 | $0.260 | 262K | 33K | -- | -- | -- |
| GPT-OSS 20B (Bedrock) | OpenAI | $0.070 | $0.300 | 16K | 16K | -- | -- | -- |
| GLM-4.7-flash | Zhipu | $0.070 | $0.400 | 200K | 8K | -- | -- | -- |
| GPT-OSS 20B | OpenAI | $0.075 | $0.300 | 131K | 33K | -- | -- | -- |
| Gemini 2.0 Flash-Lite | $0.075 | $0.300 | 1.0M | 8K | -- | 15 | 193.3 | |
| Llama 4 Scout | Meta | $0.080 | $0.300 | 1.0M | 16K | 136 tok/s | 14 | 168.8 |
| Qwen3-Next-80B-A3B-Thinking | Alibaba | $0.098 | $0.780 | 262K | 33K | -- | -- | -- |
| GPT-4.1 Nano | OpenAI | $0.100 | $0.400 | 1.0M | 33K | 127 tok/s | 13 | 130.0 |
| Gemini 2.5 Flash-Lite | $0.100 | $0.400 | 1.0M | 66K | 263 tok/s | 13 | 127.0 | |
| Gemini 2.0 Flash | $0.100 | $0.400 | 1.0M | 8K | -- | 19 | 185.0 | |
| Devstral Small | Mistral | $0.100 | $0.300 | 256K | 33K | -- | -- | -- |
| Mistral Small 3.2 | Mistral | $0.100 | $0.300 | 128K | 4K | 147 tok/s | 10 | 102.0 |
| Gemma 4 26B A4B | $0.130 | $0.400 | 262K | 33K | -- | -- | -- | |
| Gemma 4 31B | $0.140 | $0.400 | 262K | 33K | -- | -- | -- | |
| DeepSeek V4-Flash | DeepSeek | $0.140 | $0.280 | 1.0M | 384K | -- | -- | -- |
| GPT-OSS 120B (Bedrock) | OpenAI | $0.150 | $0.600 | 16K | 16K | -- | -- | -- |
| GPT-4o Mini | OpenAI | $0.150 | $0.600 | 128K | 16K | 80 tok/s | 13 | 84.0 |
| Mistral Small 4 | Mistral | $0.150 | $0.600 | 256K | 33K | -- | -- | -- |
| Ministral 3 8B | Mistral | $0.150 | $0.150 | 262K | 33K | -- | -- | -- |
| Pixtral 12B | Mistral | $0.150 | $0.150 | 128K | 4K | -- | -- | -- |
| Command R | Cohere | $0.150 | $0.600 | 128K | 4K | -- | -- | -- |
| Command R 08 2024 | Cohere | $0.150 | $0.600 | 128K | 4K | -- | -- | -- |
| Command R7b 12 2024 | Cohere | $0.150 | $0.038 | 128K | 4K | -- | -- | -- |
| Llama 3.3 70B | Meta | $0.180 | $0.180 | 131K | 4K | -- | -- | -- |
| GPT-5.4 Nano | OpenAI | $0.200 | $1.25 | 400K | 128K | -- | -- | -- |
| Grok 4.1 Fast | xAI | $0.200 | $0.500 | 2.0M | 16K | 78 tok/s | 24 | 118.0 |
| Grok 4.1 Fast Reasoning | xAI | $0.200 | $0.500 | 2.0M | 16K | 95 tok/s | 39 | 193.0 |
| Grok Code Fast | xAI | $0.200 | $1.50 | 256K | 16K | -- | -- | -- |
| Ministral 3 14B | Mistral | $0.200 | $0.200 | 262K | 33K | -- | -- | -- |
| Jamba 1.5 | AI21 | $0.200 | $0.400 | 256K | 256K | -- | -- | -- |
| Jamba 1.5 Mini | AI21 | $0.200 | $0.400 | 256K | 256K | -- | -- | -- |
| Jamba 1.5 Mini@001 | AI21 | $0.200 | $0.400 | 256K | 256K | -- | -- | -- |
| Jamba Mini 1.6 | AI21 | $0.200 | $0.400 | 256K | 256K | -- | -- | -- |
| Jamba Mini 1.7 | AI21 | $0.200 | $0.400 | 256K | 256K | -- | -- | -- |
| GPT-5 Mini | OpenAI | $0.250 | $2.00 | 400K | 16K | 103 tok/s | 41 | 164.8 |
| Gemini 3.1 Flash-Lite | $0.250 | $1.50 | 1.0M | 66K | 317 tok/s | 34 | 134.0 | |
| Claude 3 Haiku 20240307 | Anthropic | $0.250 | $1.25 | 200K | 4K | -- | -- | -- |
| Llama 4 Maverick | Meta | $0.270 | $0.850 | 1.0M | 16K | 113 tok/s | 18 | 68.1 |
| Qwen3.6-Plus | Alibaba | $0.276 | $1.65 | 1.0M | 33K | -- | -- | -- |
| DeepSeek V3.2 (Chat) | DeepSeek | $0.280 | $0.420 | 128K | 8K | -- | 32 | 114.6 |
| DeepSeek V3.2 (Reasoner) | DeepSeek | $0.280 | $0.420 | 128K | 8K | -- | -- | -- |
| DeepSeek Reasoner | DeepSeek | $0.280 | $0.420 | 131K | 66K | -- | -- | -- |
| Gemini 2.5 Flash | $0.300 | $2.50 | 1.0M | 66K | 215 tok/s | 21 | 68.7 | |
| Grok 3 Mini | xAI | $0.300 | $0.500 | 131K | 16K | -- | -- | -- |
| Codestral | Mistral | $0.300 | $0.900 | 256K | 33K | -- | -- | -- |
| Qwen 3.5 27B | Alibaba | $0.300 | $2.40 | 128K | 33K | -- | -- | -- |
| Nova 2.0 Lite | Amazon | $0.300 | $2.50 | 1.0M | 64K | 228 tok/s | 18 | 60.0 |
| Nemotron 3 Super 120B | NVIDIA | $0.300 | $0.800 | 1.0M | 33K | 157 tok/s | 36 | 120.0 |
| MiniMax M2.5 | MiniMax | $0.300 | $1.20 | 128K | 33K | 94 tok/s | 42 | 139.7 |
| Command Light | Cohere | $0.300 | $0.600 | 4K | 4K | -- | -- | -- |
| GPT-4.1 Mini | OpenAI | $0.400 | $1.60 | 1.0M | 33K | 79 tok/s | 23 | 57.2 |
| Mistral Medium | Mistral | $0.400 | $2.00 | 131K | 16K | -- | -- | -- |
| Devstral | Mistral | $0.400 | $2.00 | 256K | 33K | -- | -- | -- |
| Qwen3.5-Omni Plus | Alibaba | $0.400 | $4.80 | 262K | 33K | -- | -- | -- |
| Gemini 3 Flash | $0.500 | $3.00 | 1.0M | 66K | 198 tok/s | 35 | 70.0 | |
| Gemini 3 Flash Reasoning | $0.500 | $3.00 | 1.0M | 66K | 205 tok/s | 46 | 92.8 | |
| Mistral Large 3 | Mistral | $0.500 | $1.50 | 262K | 8K | 56 tok/s | 23 | 45.6 |
| Magistral Small | Mistral | $0.500 | $1.50 | 40K | 16K | -- | -- | -- |
| DeepSeek R1 | DeepSeek | $0.550 | $2.19 | 128K | 33K | -- | 27 | 49.3 |
| Grok 3 Mini Fast | xAI | $0.600 | $4.00 | 131K | 16K | -- | -- | -- |
| Qwen 3.5 397B | Alibaba | $0.600 | $3.60 | 128K | 33K | -- | -- | -- |
| Kimi K2.5 | Moonshot | $0.600 | $3.00 | 262K | 33K | 47 tok/s | 47 | 78.0 |
| Kimi K2 Thinking | Moonshot | $0.600 | $2.50 | 262K | 33K | -- | -- | -- |
| GLM-4.7 | Zhipu | $0.600 | $2.20 | 200K | 128K | -- | -- | -- |
| GPT-5.4 Mini | OpenAI | $0.750 | $4.50 | 400K | 128K | -- | -- | -- |
| Gemini 3.1 Flash Live | $0.750 | $4.50 | 1.0M | 66K | -- | -- | -- | |
| Claude Haiku 3.5 | Anthropic | $0.800 | $4.00 | 200K | 8K | -- | 19 | 23.4 |
| QwQ-Plus | Alibaba | $0.800 | $2.40 | 131K | 16K | -- | -- | -- |
| Kimi K2.6 | Moonshot | $0.950 | $4.00 | 262K | 33K | -- | -- | -- |
| Claude Haiku 4.5 | Anthropic | $1.00 | $5.00 | 200K | 8K | 103 tok/s | 31 | 31.1 |
| Claude 4.5 Haiku Reasoning | Anthropic | $1.00 | $5.00 | 200K | 8K | 139 tok/s | 37 | 37.1 |
| Gemini 3.1 Flash TTS | $1.00 | $20.00 | 32K | 32K | -- | -- | -- | |
| GLM-5 | Zhipu | $1.00 | $3.20 | 128K | 33K | 72 tok/s | 50 | 49.8 |
| MiMo-V2-Pro | Xiaomi | $1.00 | $3.00 | 1.0M | 131K | -- | -- | -- |
| Claude Haiku 4 5 20251001 | Anthropic | $1.00 | $5.00 | 200K | 64K | -- | -- | -- |
| Claude Haiku 4 5 | Anthropic | $1.00 | $5.00 | 200K | 64K | -- | -- | -- |
| o4 Mini | OpenAI | $1.10 | $4.40 | 200K | 100K | 168 tok/s | 33 | 30.1 |
| o3 Mini | OpenAI | $1.10 | $4.40 | 200K | 100K | 158 tok/s | 26 | 23.5 |
| Kimi K2 Thinking Turbo | Moonshot | $1.15 | $8.00 | 262K | 33K | -- | -- | -- |
| GLM-5 Turbo | Zhipu | $1.20 | $4.00 | 200K | 128K | -- | -- | -- |
| GPT-5.1 | OpenAI | $1.25 | $10.00 | 400K | 16K | 150 tok/s | 48 | 38.2 |
| GPT-5 | OpenAI | $1.25 | $10.00 | 400K | 16K | 95 tok/s | 45 | 35.7 |
| GPT-5 Medium | OpenAI | $1.25 | $10.00 | 400K | 16K | 85 tok/s | 42 | 33.6 |
| Gemini 2.5 Pro | $1.25 | $10.00 | 1.0M | 66K | 136 tok/s | 35 | 27.7 | |
| Grok 4.3 | xAI | $1.25 | $2.50 | 1.0M | 16K | -- | -- | -- |
| Nova 2.0 Pro Reasoning | Amazon | $1.25 | $10.00 | 128K | 33K | 155 tok/s | 36 | 28.6 |
| GLM-5.1 | Zhipu | $1.40 | $4.40 | 200K | 128K | -- | -- | -- |
| Mistral Medium 3.5 | Mistral | $1.50 | $7.50 | 256K | 33K | -- | -- | -- |
| DeepSeek V4-Pro | DeepSeek | $1.74 | $3.48 | 1.0M | 384K | -- | -- | -- |
| GPT-5.2 | OpenAI | $1.75 | $14.00 | 400K | 16K | 79 tok/s | 51 | 29.3 |
| GPT-5.3 Codex | OpenAI | $1.75 | $14.00 | 400K | 33K | 98 tok/s | 54 | 30.6 |
| GPT-4.1 | OpenAI | $2.00 | $8.00 | 1.0M | 33K | 123 tok/s | 26 | 13.2 |
| o3 | OpenAI | $2.00 | $8.00 | 200K | 100K | 113 tok/s | 38 | 19.2 |
| o4 Mini Deep Research | OpenAI | $2.00 | $8.00 | 200K | 100K | -- | -- | -- |
| Gemini 3.1 Pro | $2.00 | $12.00 | 1.0M | 66K | 147 tok/s | 57 | 28.6 | |
| Gemini 3 Pro | $2.00 | $12.00 | 1.0M | 66K | 141 tok/s | 48 | 24.2 | |
| Grok 4.20 | xAI | $2.00 | $6.00 | 2.0M | 16K | 94 tok/s | 49 | 24.6 |
| Grok 2 | xAI | $2.00 | $10.00 | 131K | 16K | -- | -- | -- |
| Magistral Medium | Mistral | $2.00 | $5.00 | 40K | 16K | -- | -- | -- |
| Pixtral Large | Mistral | $2.00 | $6.00 | 128K | 4K | -- | -- | -- |
| Jamba 1.5 Large | AI21 | $2.00 | $8.00 | 256K | 256K | -- | -- | -- |
| Jamba 1.5 Large@001 | AI21 | $2.00 | $8.00 | 256K | 256K | -- | -- | -- |
| Jamba Large 1.6 | AI21 | $2.00 | $8.00 | 256K | 256K | -- | -- | -- |
| Jamba Large 1.7 | AI21 | $2.00 | $8.00 | 256K | 256K | -- | -- | -- |
| GPT-5.4 | OpenAI | $2.50 | $15.00 | 1.1M | 128K | 95 tok/s | 57 | 22.7 |
| GPT-4o | OpenAI | $2.50 | $10.00 | 128K | 16K | 149 tok/s | 17 | 6.9 |
| Command A | Cohere | $2.50 | $10.00 | 128K | 4K | 34 tok/s | 14 | 5.4 |
| Command A 03 2025 | Cohere | $2.50 | $10.00 | 256K | 8K | -- | -- | -- |
| Command R Plus | Cohere | $2.50 | $10.00 | 128K | 4K | -- | -- | -- |
| Command R Plus 08 2024 | Cohere | $2.50 | $10.00 | 128K | 4K | -- | -- | -- |
| Claude Sonnet 4.6 Adaptive | Anthropic | $3.00 | $15.00 | 200K | 64K | 69 tok/s | 52 | 17.2 |
| Claude Sonnet 4.6 | Anthropic | $3.00 | $15.00 | 200K | 64K | 51 tok/s | 44 | 14.8 |
| Claude Sonnet 4.5 | Anthropic | $3.00 | $15.00 | 200K | 64K | -- | -- | -- |
| Claude Sonnet 4 | Anthropic | $3.00 | $15.00 | 200K | 64K | 57 tok/s | 43 | 14.2 |
| Claude 3.7 Sonnet | Anthropic | $3.00 | $15.00 | 200K | 64K | -- | -- | -- |
| Grok 4 | xAI | $3.00 | $15.00 | 2.0M | 16K | 46 tok/s | 42 | 13.8 |
| Grok 3 | xAI | $3.00 | $15.00 | 131K | 16K | -- | -- | -- |
| Sonar Pro | Perplexity | $3.00 | $15.00 | 128K | 4K | -- | 15 | 5.1 |
| Claude 3 7 Sonnet 20250219 | Anthropic | $3.00 | $15.00 | 200K | 64K | -- | -- | -- |
| Claude 4 Sonnet 20250514 | Anthropic | $3.00 | $15.00 | 1.0M | 64K | -- | -- | -- |
| Claude Sonnet 4 5 | Anthropic | $3.00 | $15.00 | 200K | 64K | -- | -- | -- |
| Claude Sonnet 4 5 20250929 | Anthropic | $3.00 | $15.00 | 200K | 64K | -- | -- | -- |
| Claude Sonnet 4 6 | Anthropic | $3.00 | $15.00 | 1.0M | 64K | -- | -- | -- |
| Claude Sonnet 4 20250514 | Anthropic | $3.00 | $15.00 | 1.0M | 64K | -- | -- | -- |
| GPT-5.5 | OpenAI | $5.00 | $30.00 | 1.1M | 128K | -- | -- | -- |
| Claude Opus 4.7 | Anthropic | $5.00 | $25.00 | 1.0M | 128K | -- | -- | -- |
| Claude Opus 4.6 Adaptive | Anthropic | $5.00 | $25.00 | 200K | 32K | 53 tok/s | 53 | 10.6 |
| Claude Opus 4.6 | Anthropic | $5.00 | $25.00 | 200K | 32K | 44 tok/s | 47 | 9.3 |
| Claude Opus 4.5 | Anthropic | $5.00 | $25.00 | 200K | 32K | 59 tok/s | 43 | 8.6 |
| Grok 3 Fast | xAI | $5.00 | $25.00 | 131K | 16K | -- | -- | -- |
| MAI-Image-2 | Microsoft | $5.00 | $33.00 | 32K | 1K | -- | -- | -- |
| Claude Opus 4 5 20251101 | Anthropic | $5.00 | $25.00 | 200K | 64K | -- | -- | -- |
| Claude Opus 4 5 | Anthropic | $5.00 | $25.00 | 200K | 64K | -- | -- | -- |
| Claude Opus 4 6 | Anthropic | $5.00 | $25.00 | 1.0M | 128K | -- | -- | -- |
| Claude Opus 4 6 20260205 | Anthropic | $5.00 | $25.00 | 1.0M | 128K | -- | -- | -- |
| Claude Opus 4 7 | Anthropic | $5.00 | $25.00 | 1.0M | 128K | -- | -- | -- |
| Claude Opus 4 7 20260416 | Anthropic | $5.00 | $25.00 | 1.0M | 128K | -- | -- | -- |
| o3 Deep Research | OpenAI | $10.00 | $40.00 | 200K | 100K | -- | -- | -- |
| o1 | OpenAI | $15.00 | $60.00 | 200K | 100K | 102 tok/s | 31 | 2.1 |
| Claude Opus 4.1 | Anthropic | $15.00 | $75.00 | 200K | 32K | -- | -- | -- |
| Claude 3 Opus 20240229 | Anthropic | $15.00 | $75.00 | 200K | 4K | -- | -- | -- |
| Claude 4 Opus 20250514 | Anthropic | $15.00 | $75.00 | 200K | 32K | -- | -- | -- |
| Claude Opus 4 1 | Anthropic | $15.00 | $75.00 | 200K | 32K | -- | -- | -- |
| Claude Opus 4 1 20250805 | Anthropic | $15.00 | $75.00 | 200K | 32K | -- | -- | -- |
| Claude Opus 4 20250514 | Anthropic | $15.00 | $75.00 | 200K | 32K | -- | -- | -- |
| Voxtral TTS | Mistral | $16.00 | $< 0.01 | 128K | 0K | -- | -- | -- |
| o3-pro | OpenAI | $20.00 | $80.00 | 200K | 100K | -- | -- | -- |
| Claude Mythos Preview | Anthropic | $25.00 | $125 | 1.0M | 32K | -- | -- | -- |
| GPT-5.5 Pro | OpenAI | $30.00 | $180 | 1.1M | 128K | -- | -- | -- |
| GPT-5.4 Pro | OpenAI | $30.00 | $180 | 1.1M | 128K | -- | -- | -- |
| o1 Pro | OpenAI | $150 | $600 | 200K | 100K | -- | -- | -- |
Pricing as of March 2026. Open-source model prices reflect hosted API providers. Always verify with official pages. Value = quality index / input cost per 1M tokens (higher is better).
Speed & quality data by Artificial AnalysisHow to Compare LLM API Pricing
- 1
Browse the pricing table
400+ models are listed with input and output pricing per million tokens, plus context window sizes and benchmark scores.
- 2
Sort and filter
Click any column header to sort by price, context window, or benchmark score. Use the provider filter to focus on specific vendors.
- 3
Evaluate price vs. performance
Quality and speed metrics from Artificial Analysis help you weigh cost against actual model capability.
Why Use This Pricing Comparison
- 400+ models from 15+ providers in a single sortable table — no tab switching
- Enriched with quality index and speed benchmarks from Artificial Analysis
- Provider-colored badges for quick visual scanning across vendors
- Context window and max output token data alongside pricing
- Data sourced from official provider docs and LiteLLM open-source project
Common Use Cases
Vendor selection
Compare pricing across all major providers before committing to an API integration. Sort by cost to find the cheapest option.
Cost optimization
Find cheaper alternatives to your current model by sorting on price and checking if the quality benchmark is close enough.
Technical evaluation
Use the quality index and speed metrics to shortlist models, then compare their pricing to make a final decision.
Model comparison
Use the compare tool to see side-by-side pricing and specs for any two models.
Related Tools
Frequently Asked Questions
Common questions about LLM API pricing