API Pricing
Every major LLM provider in one table. Sortable, filterable. Prices per 1 million tokens.
| Model ▲▼ | Provider ▲▼ | Input / 1M ▲ | Output / 1M ▲▼ | Context ▲▼ | Max Output | Speed ▲▼ | Quality ▲▼ | Value ▲▼ |
|---|---|---|---|---|---|---|---|---|
| GPT-OSS 120B | OpenAI | $0.039 | $0.190 | 131K | 33K | -- | -- | -- |
| GPT-5 Nano | OpenAI | $0.050 | $0.400 | 128K | 16K | 148 tok/s | 27 | 536.0 |
| Qwen3.5-9B | Alibaba | $0.050 | $0.150 | 262K | 33K | -- | -- | -- |
| Qwen3.5-Omni Flash | Alibaba | $0.065 | $0.260 | 262K | 33K | -- | -- | -- |
| Hunyuan HY3 Preview | Tencent | $0.066 | $0.260 | 262K | 33K | -- | -- | -- |
| GPT-OSS 20B (Bedrock) | OpenAI | $0.070 | $0.300 | 16K | 16K | -- | -- | -- |
| GLM-4.7-flash | Zhipu | $0.070 | $0.400 | 200K | 8K | -- | -- | -- |
| GPT-OSS 20B | OpenAI | $0.075 | $0.300 | 131K | 33K | -- | -- | -- |
| Gemini 2.0 Flash-Lite | $0.075 | $0.300 | 1.0M | 8K | -- | 15 | 193.3 | |
| Llama 4 Scout | Meta | $0.080 | $0.300 | 1.0M | 16K | 134 tok/s | 14 | 168.8 |
| Qwen3-Next-80B-A3B-Thinking | Alibaba | $0.098 | $0.780 | 262K | 33K | -- | -- | -- |
| GPT-4.1 Nano | OpenAI | $0.100 | $0.400 | 1.0M | 33K | 126 tok/s | 13 | 130.0 |
| Gemini 2.5 Flash-Lite | $0.100 | $0.400 | 1.0M | 66K | 268 tok/s | 13 | 127.0 | |
| Gemini 2.0 Flash | $0.100 | $0.400 | 1.0M | 8K | -- | 19 | 185.0 | |
| Devstral Small | Mistral | $0.100 | $0.300 | 256K | 33K | -- | -- | -- |
| Mistral Small 3.2 | Mistral | $0.100 | $0.300 | 128K | 4K | 143 tok/s | 10 | 102.0 |
| Gemma 4 26B A4B | $0.130 | $0.400 | 262K | 33K | -- | -- | -- | |
| Gemma 4 31B | $0.140 | $0.400 | 262K | 33K | -- | -- | -- | |
| DeepSeek V4-Flash | DeepSeek | $0.140 | $0.280 | 1.0M | 384K | -- | -- | -- |
| GPT-OSS 120B (Bedrock) | OpenAI | $0.150 | $0.600 | 16K | 16K | -- | -- | -- |
| GPT-4o Mini | OpenAI | $0.150 | $0.600 | 128K | 16K | 75 tok/s | 13 | 84.0 |
| Mistral Small 4 | Mistral | $0.150 | $0.600 | 256K | 33K | -- | -- | -- |
| Ministral 3 8B | Mistral | $0.150 | $0.150 | 262K | 33K | -- | -- | -- |
| Pixtral 12B | Mistral | $0.150 | $0.150 | 128K | 4K | -- | -- | -- |
| Command R | Cohere | $0.150 | $0.600 | 128K | 4K | -- | -- | -- |
| Command R 08 2024 | Cohere | $0.150 | $0.600 | 128K | 4K | -- | -- | -- |
| Command R7b 12 2024 | Cohere | $0.150 | $0.038 | 128K | 4K | -- | -- | -- |
| Llama 3.3 70B | Meta | $0.180 | $0.180 | 131K | 4K | -- | -- | -- |
| GPT-5.4 Nano | OpenAI | $0.200 | $1.25 | 400K | 128K | -- | -- | -- |
| Grok 4.1 Fast | xAI | $0.200 | $0.500 | 2.0M | 16K | 75 tok/s | 24 | 118.0 |
| Grok 4.1 Fast Reasoning | xAI | $0.200 | $0.500 | 2.0M | 16K | 88 tok/s | 39 | 193.0 |
| Grok Code Fast | xAI | $0.200 | $1.50 | 256K | 16K | -- | -- | -- |
| Ministral 3 14B | Mistral | $0.200 | $0.200 | 262K | 33K | -- | -- | -- |
| Jamba 1.5 | AI21 | $0.200 | $0.400 | 256K | 256K | -- | -- | -- |
| Jamba 1.5 Mini | AI21 | $0.200 | $0.400 | 256K | 256K | -- | -- | -- |
| Jamba 1.5 Mini@001 | AI21 | $0.200 | $0.400 | 256K | 256K | -- | -- | -- |
| Jamba Mini 1.6 | AI21 | $0.200 | $0.400 | 256K | 256K | -- | -- | -- |
| Jamba Mini 1.7 | AI21 | $0.200 | $0.400 | 256K | 256K | -- | -- | -- |
| GPT-5 Mini | OpenAI | $0.250 | $2.00 | 400K | 16K | 85 tok/s | 41 | 164.8 |
| Gemini 3.1 Flash-Lite | $0.250 | $1.50 | 1.0M | 66K | 350 tok/s | 34 | 134.0 | |
| Claude 3 Haiku 20240307 | Anthropic | $0.250 | $1.25 | 200K | 4K | -- | -- | -- |
| Llama 4 Maverick | Meta | $0.270 | $0.850 | 1.0M | 16K | 113 tok/s | 18 | 68.1 |
| Qwen3.6-Plus | Alibaba | $0.276 | $1.65 | 1.0M | 33K | -- | -- | -- |
| DeepSeek V3.2 (Chat) | DeepSeek | $0.280 | $0.420 | 128K | 8K | -- | 32 | 114.6 |
| DeepSeek V3.2 (Reasoner) | DeepSeek | $0.280 | $0.420 | 128K | 8K | -- | -- | -- |
| DeepSeek Reasoner | DeepSeek | $0.280 | $0.420 | 131K | 66K | -- | -- | -- |
| Gemini 2.5 Flash | $0.300 | $2.50 | 1.0M | 66K | 215 tok/s | 21 | 68.7 | |
| Grok 3 Mini | xAI | $0.300 | $0.500 | 131K | 16K | -- | -- | -- |
| Codestral | Mistral | $0.300 | $0.900 | 256K | 33K | -- | -- | -- |
| Qwen 3.5 27B | Alibaba | $0.300 | $2.40 | 128K | 33K | -- | -- | -- |
| Nova 2.0 Lite | Amazon | $0.300 | $2.50 | 1.0M | 64K | 229 tok/s | 18 | 60.0 |
| Nemotron 3 Super 120B | NVIDIA | $0.300 | $0.800 | 1.0M | 33K | 159 tok/s | 36 | 120.0 |
| MiniMax M2.5 | MiniMax | $0.300 | $1.20 | 128K | 33K | 86 tok/s | 42 | 139.7 |
| Command Light | Cohere | $0.300 | $0.600 | 4K | 4K | -- | -- | -- |
| GPT-4.1 Mini | OpenAI | $0.400 | $1.60 | 1.0M | 33K | 79 tok/s | 23 | 57.2 |
| Mistral Medium | Mistral | $0.400 | $2.00 | 131K | 16K | -- | -- | -- |
| Devstral | Mistral | $0.400 | $2.00 | 256K | 33K | -- | -- | -- |
| Qwen3.5-Omni Plus | Alibaba | $0.400 | $4.80 | 262K | 33K | -- | -- | -- |
| Gemini 3 Flash | $0.500 | $3.00 | 1.0M | 66K | 204 tok/s | 35 | 70.0 | |
| Gemini 3 Flash Reasoning | $0.500 | $3.00 | 1.0M | 66K | 203 tok/s | 46 | 92.8 | |
| Mistral Large 3 | Mistral | $0.500 | $1.50 | 262K | 8K | 55 tok/s | 23 | 45.6 |
| Magistral Small | Mistral | $0.500 | $1.50 | 40K | 16K | -- | -- | -- |
| DeepSeek R1 | DeepSeek | $0.550 | $2.19 | 128K | 33K | -- | 27 | 49.3 |
| Grok 3 Mini Fast | xAI | $0.600 | $4.00 | 131K | 16K | -- | -- | -- |
| Qwen 3.5 397B | Alibaba | $0.600 | $3.60 | 128K | 33K | -- | -- | -- |
| Kimi K2.5 | Moonshot | $0.600 | $3.00 | 262K | 33K | 42 tok/s | 47 | 78.0 |
| Kimi K2 Thinking | Moonshot | $0.600 | $2.50 | 262K | 33K | -- | -- | -- |
| GLM-4.7 | Zhipu | $0.600 | $2.20 | 200K | 128K | -- | -- | -- |
| GPT-5.4 Mini | OpenAI | $0.750 | $4.50 | 400K | 128K | -- | -- | -- |
| Gemini 3.1 Flash Live | $0.750 | $4.50 | 1.0M | 66K | -- | -- | -- | |
| Claude Haiku 3.5 | Anthropic | $0.800 | $4.00 | 200K | 8K | -- | 19 | 23.4 |
| QwQ-Plus | Alibaba | $0.800 | $2.40 | 131K | 16K | -- | -- | -- |
| Kimi K2.6 | Moonshot | $0.950 | $4.00 | 262K | 33K | -- | -- | -- |
| Claude Haiku 4.5 | Anthropic | $1.00 | $5.00 | 200K | 8K | 100 tok/s | 31 | 31.1 |
| Claude 4.5 Haiku Reasoning | Anthropic | $1.00 | $5.00 | 200K | 8K | 139 tok/s | 37 | 37.1 |
| Gemini 3.1 Flash TTS | $1.00 | $20.00 | 32K | 32K | -- | -- | -- | |
| GLM-5 | Zhipu | $1.00 | $3.20 | 128K | 33K | 68 tok/s | 50 | 49.8 |
| MiMo-V2-Pro | Xiaomi | $1.00 | $3.00 | 1.0M | 131K | -- | -- | -- |
| Claude Haiku 4 5 20251001 | Anthropic | $1.00 | $5.00 | 200K | 64K | -- | -- | -- |
| Claude Haiku 4 5 | Anthropic | $1.00 | $5.00 | 200K | 64K | -- | -- | -- |
| o4 Mini | OpenAI | $1.10 | $4.40 | 200K | 100K | 171 tok/s | 33 | 30.1 |
| o3 Mini | OpenAI | $1.10 | $4.40 | 200K | 100K | 162 tok/s | 26 | 23.5 |
| Kimi K2 Thinking Turbo | Moonshot | $1.15 | $8.00 | 262K | 33K | -- | -- | -- |
| GLM-5 Turbo | Zhipu | $1.20 | $4.00 | 200K | 128K | -- | -- | -- |
| GPT-5.1 | OpenAI | $1.25 | $10.00 | 400K | 16K | 168 tok/s | 48 | 38.2 |
| GPT-5 | OpenAI | $1.25 | $10.00 | 400K | 16K | 88 tok/s | 45 | 35.7 |
| GPT-5 Medium | OpenAI | $1.25 | $10.00 | 400K | 16K | 84 tok/s | 42 | 33.6 |
| Gemini 2.5 Pro | $1.25 | $10.00 | 1.0M | 66K | 135 tok/s | 35 | 27.7 | |
| Grok 4.3 | xAI | $1.25 | $2.50 | 1.0M | 16K | -- | -- | -- |
| Nova 2.0 Pro Reasoning | Amazon | $1.25 | $10.00 | 128K | 33K | 162 tok/s | 32 | 25.5 |
| GLM-5.1 | Zhipu | $1.40 | $4.40 | 200K | 128K | -- | -- | -- |
| Mistral Medium 3.5 | Mistral | $1.50 | $7.50 | 256K | 33K | -- | -- | -- |
| DeepSeek V4-Pro | DeepSeek | $1.74 | $3.48 | 1.0M | 384K | -- | -- | -- |
| GPT-5.2 | OpenAI | $1.75 | $14.00 | 400K | 16K | 80 tok/s | 51 | 29.3 |
| GPT-5.3 Codex | OpenAI | $1.75 | $14.00 | 400K | 33K | 96 tok/s | 54 | 30.6 |
| GPT-4.1 | OpenAI | $2.00 | $8.00 | 1.0M | 33K | 127 tok/s | 26 | 13.2 |
| o3 | OpenAI | $2.00 | $8.00 | 200K | 100K | 111 tok/s | 38 | 19.2 |
| o4 Mini Deep Research | OpenAI | $2.00 | $8.00 | 200K | 100K | -- | -- | -- |
| Gemini 3.1 Pro | $2.00 | $12.00 | 1.0M | 66K | 130 tok/s | 57 | 28.6 | |
| Gemini 3 Pro | $2.00 | $12.00 | 1.0M | 66K | 141 tok/s | 48 | 24.2 | |
| Grok 4.20 | xAI | $2.00 | $6.00 | 2.0M | 16K | 97 tok/s | 49 | 24.6 |
| Grok 2 | xAI | $2.00 | $10.00 | 131K | 16K | -- | -- | -- |
| Magistral Medium | Mistral | $2.00 | $5.00 | 40K | 16K | -- | -- | -- |
| Pixtral Large | Mistral | $2.00 | $6.00 | 128K | 4K | -- | -- | -- |
| Jamba 1.5 Large | AI21 | $2.00 | $8.00 | 256K | 256K | -- | -- | -- |
| Jamba 1.5 Large@001 | AI21 | $2.00 | $8.00 | 256K | 256K | -- | -- | -- |
| Jamba Large 1.6 | AI21 | $2.00 | $8.00 | 256K | 256K | -- | -- | -- |
| Jamba Large 1.7 | AI21 | $2.00 | $8.00 | 256K | 256K | -- | -- | -- |
| GPT-5.4 | OpenAI | $2.50 | $15.00 | 1.1M | 128K | 94 tok/s | 57 | 22.7 |
| GPT-4o | OpenAI | $2.50 | $10.00 | 128K | 16K | 167 tok/s | 17 | 6.9 |
| Command A | Cohere | $2.50 | $10.00 | 128K | 4K | 36 tok/s | 14 | 5.4 |
| Command A 03 2025 | Cohere | $2.50 | $10.00 | 256K | 8K | -- | -- | -- |
| Command R Plus | Cohere | $2.50 | $10.00 | 128K | 4K | -- | -- | -- |
| Command R Plus 08 2024 | Cohere | $2.50 | $10.00 | 128K | 4K | -- | -- | -- |
| Claude Sonnet 4.6 Adaptive | Anthropic | $3.00 | $15.00 | 200K | 64K | 79 tok/s | 52 | 17.2 |
| Claude Sonnet 4.6 | Anthropic | $3.00 | $15.00 | 200K | 64K | 53 tok/s | 44 | 14.8 |
| Claude Sonnet 4.5 | Anthropic | $3.00 | $15.00 | 200K | 64K | -- | -- | -- |
| Claude Sonnet 4 | Anthropic | $3.00 | $15.00 | 200K | 64K | 50 tok/s | 43 | 14.2 |
| Claude 3.7 Sonnet | Anthropic | $3.00 | $15.00 | 200K | 64K | -- | -- | -- |
| Grok 4 | xAI | $3.00 | $15.00 | 2.0M | 16K | 44 tok/s | 42 | 13.8 |
| Grok 3 | xAI | $3.00 | $15.00 | 131K | 16K | -- | -- | -- |
| Sonar Pro | Perplexity | $3.00 | $15.00 | 128K | 4K | -- | 15 | 5.1 |
| Claude 3 7 Sonnet 20250219 | Anthropic | $3.00 | $15.00 | 200K | 64K | -- | -- | -- |
| Claude 4 Sonnet 20250514 | Anthropic | $3.00 | $15.00 | 1.0M | 64K | -- | -- | -- |
| Claude Sonnet 4 5 | Anthropic | $3.00 | $15.00 | 200K | 64K | -- | -- | -- |
| Claude Sonnet 4 5 20250929 | Anthropic | $3.00 | $15.00 | 200K | 64K | -- | -- | -- |
| Claude Sonnet 4 6 | Anthropic | $3.00 | $15.00 | 1.0M | 64K | -- | -- | -- |
| Claude Sonnet 4 20250514 | Anthropic | $3.00 | $15.00 | 1.0M | 64K | -- | -- | -- |
| GPT-5.5 | OpenAI | $5.00 | $30.00 | 1.1M | 128K | -- | -- | -- |
| Claude Opus 4.7 | Anthropic | $5.00 | $25.00 | 1.0M | 128K | -- | -- | -- |
| Claude Opus 4.6 Adaptive | Anthropic | $5.00 | $25.00 | 200K | 32K | 52 tok/s | 53 | 10.6 |
| Claude Opus 4.6 | Anthropic | $5.00 | $25.00 | 200K | 32K | 46 tok/s | 47 | 9.3 |
| Claude Opus 4.5 | Anthropic | $5.00 | $25.00 | 200K | 32K | 58 tok/s | 43 | 8.6 |
| Grok 3 Fast | xAI | $5.00 | $25.00 | 131K | 16K | -- | -- | -- |
| MAI-Image-2 | Microsoft | $5.00 | $33.00 | 32K | 1K | -- | -- | -- |
| Claude Opus 4 5 20251101 | Anthropic | $5.00 | $25.00 | 200K | 64K | -- | -- | -- |
| Claude Opus 4 5 | Anthropic | $5.00 | $25.00 | 200K | 64K | -- | -- | -- |
| Claude Opus 4 6 | Anthropic | $5.00 | $25.00 | 1.0M | 128K | -- | -- | -- |
| Claude Opus 4 6 20260205 | Anthropic | $5.00 | $25.00 | 1.0M | 128K | -- | -- | -- |
| Claude Opus 4 7 | Anthropic | $5.00 | $25.00 | 1.0M | 128K | -- | -- | -- |
| Claude Opus 4 7 20260416 | Anthropic | $5.00 | $25.00 | 1.0M | 128K | -- | -- | -- |
| o3 Deep Research | OpenAI | $10.00 | $40.00 | 200K | 100K | -- | -- | -- |
| o1 | OpenAI | $15.00 | $60.00 | 200K | 100K | 108 tok/s | 31 | 2.1 |
| Claude Opus 4.1 | Anthropic | $15.00 | $75.00 | 200K | 32K | -- | -- | -- |
| Claude 3 Opus 20240229 | Anthropic | $15.00 | $75.00 | 200K | 4K | -- | -- | -- |
| Claude 4 Opus 20250514 | Anthropic | $15.00 | $75.00 | 200K | 32K | -- | -- | -- |
| Claude Opus 4 1 | Anthropic | $15.00 | $75.00 | 200K | 32K | -- | -- | -- |
| Claude Opus 4 1 20250805 | Anthropic | $15.00 | $75.00 | 200K | 32K | -- | -- | -- |
| Claude Opus 4 20250514 | Anthropic | $15.00 | $75.00 | 200K | 32K | -- | -- | -- |
| Voxtral TTS | Mistral | $16.00 | $< 0.01 | 128K | 0K | -- | -- | -- |
| o3-pro | OpenAI | $20.00 | $80.00 | 200K | 100K | -- | -- | -- |
| Claude Mythos Preview | Anthropic | $25.00 | $125 | 1.0M | 32K | -- | -- | -- |
| GPT-5.5 Pro | OpenAI | $30.00 | $180 | 1.1M | 128K | -- | -- | -- |
| GPT-5.4 Pro | OpenAI | $30.00 | $180 | 1.1M | 128K | -- | -- | -- |
| o1 Pro | OpenAI | $150 | $600 | 200K | 100K | -- | -- | -- |
Pricing as of March 2026. Open-source model prices reflect hosted API providers. Always verify with official pages. Value = quality index / input cost per 1M tokens (higher is better).
Speed & quality data by Artificial AnalysisHow to Compare LLM API Pricing
- 1
Browse the pricing table
400+ models are listed with input and output pricing per million tokens, plus context window sizes and benchmark scores.
- 2
Sort and filter
Click any column header to sort by price, context window, or benchmark score. Use the provider filter to focus on specific vendors.
- 3
Evaluate price vs. performance
Quality and speed metrics from Artificial Analysis help you weigh cost against actual model capability.
Why Use This Pricing Comparison
- 400+ models from 15+ providers in a single sortable table — no tab switching
- Enriched with quality index and speed benchmarks from Artificial Analysis
- Provider-colored badges for quick visual scanning across vendors
- Context window and max output token data alongside pricing
- Data sourced from official provider docs and LiteLLM open-source project
Common Use Cases
Vendor selection
Compare pricing across all major providers before committing to an API integration. Sort by cost to find the cheapest option.
Cost optimization
Find cheaper alternatives to your current model by sorting on price and checking if the quality benchmark is close enough.
Technical evaluation
Use the quality index and speed metrics to shortlist models, then compare their pricing to make a final decision.
Model comparison
Use the compare tool to see side-by-side pricing and specs for any two models.
Related Tools
Frequently Asked Questions
Common questions about LLM API pricing