API Pricing
Every major LLM provider in one table. Sortable, filterable. Prices per 1 million tokens.
| Model ▲▼ | Provider ▲▼ | Input / 1M ▲ | Output / 1M ▲▼ | Context ▲▼ | Max Output | Speed ▲▼ | Quality ▲▼ | Value ▲▼ |
|---|---|---|---|---|---|---|---|---|
| GPT-OSS 120B | OpenAI | $0.039 | $0.190 | 131K | 33K | -- | -- | -- |
| GPT-5 Nano | OpenAI | $0.050 | $0.400 | 128K | 16K | 141 tok/s | 27 | 536.0 |
| Qwen3.5-9B | Alibaba | $0.050 | $0.150 | 262K | 33K | -- | -- | -- |
| Qwen3.5-Omni Flash | Alibaba | $0.065 | $0.260 | 262K | 33K | -- | -- | -- |
| Hunyuan HY3 Preview | Tencent | $0.066 | $0.260 | 262K | 33K | -- | -- | -- |
| GPT-OSS 20B (Bedrock) | OpenAI | $0.070 | $0.300 | 16K | 16K | -- | -- | -- |
| GLM-4.7-flash | Zhipu | $0.070 | $0.400 | 200K | 8K | -- | -- | -- |
| GPT-OSS 20B | OpenAI | $0.075 | $0.300 | 131K | 33K | -- | -- | -- |
| Gemini 2.0 Flash-Lite | $0.075 | $0.300 | 1.0M | 8K | -- | 15 | 193.3 | |
| Llama 4 Scout | Meta | $0.080 | $0.300 | 1.0M | 16K | 134 tok/s | 14 | 168.8 |
| Qwen3-Next-80B-A3B-Thinking | Alibaba | $0.098 | $0.780 | 262K | 33K | -- | -- | -- |
| GPT-4.1 Nano | OpenAI | $0.100 | $0.400 | 1.0M | 33K | 114 tok/s | 13 | 130.0 |
| Gemini 2.5 Flash-Lite | $0.100 | $0.400 | 1.0M | 66K | 250 tok/s | 13 | 127.0 | |
| Gemini 2.0 Flash | $0.100 | $0.400 | 1.0M | 8K | -- | 19 | 185.0 | |
| Devstral Small | Mistral | $0.100 | $0.300 | 256K | 33K | -- | -- | -- |
| Mistral Small 3.2 | Mistral | $0.100 | $0.300 | 128K | 4K | 149 tok/s | 10 | 102.0 |
| Gemma 4 26B A4B | $0.130 | $0.400 | 262K | 33K | -- | -- | -- | |
| Gemma 4 31B | $0.140 | $0.400 | 262K | 33K | -- | -- | -- | |
| DeepSeek V4-Flash | DeepSeek | $0.140 | $0.280 | 1.0M | 384K | -- | -- | -- |
| GPT-OSS 120B (Bedrock) | OpenAI | $0.150 | $0.600 | 16K | 16K | -- | -- | -- |
| GPT-4o Mini | OpenAI | $0.150 | $0.600 | 128K | 16K | 76 tok/s | 13 | 84.0 |
| Mistral Small 4 | Mistral | $0.150 | $0.600 | 256K | 33K | -- | -- | -- |
| Ministral 3 8B | Mistral | $0.150 | $0.150 | 262K | 33K | -- | -- | -- |
| Pixtral 12B | Mistral | $0.150 | $0.150 | 128K | 4K | -- | -- | -- |
| Command R | Cohere | $0.150 | $0.600 | 128K | 4K | -- | -- | -- |
| Command R 08 2024 | Cohere | $0.150 | $0.600 | 128K | 4K | -- | -- | -- |
| Command R7b 12 2024 | Cohere | $0.150 | $0.038 | 128K | 4K | -- | -- | -- |
| Llama 3.3 70B | Meta | $0.180 | $0.180 | 131K | 4K | -- | -- | -- |
| GPT-5.4 Nano | OpenAI | $0.200 | $1.25 | 400K | 128K | -- | -- | -- |
| Grok 4.1 Fast | xAI | $0.200 | $0.500 | 2.0M | 16K | 81 tok/s | 24 | 118.0 |
| Grok 4.1 Fast Reasoning | xAI | $0.200 | $0.500 | 2.0M | 16K | 97 tok/s | 39 | 193.0 |
| Grok Code Fast | xAI | $0.200 | $1.50 | 256K | 16K | -- | -- | -- |
| Ministral 3 14B | Mistral | $0.200 | $0.200 | 262K | 33K | -- | -- | -- |
| Jamba 1.5 | AI21 | $0.200 | $0.400 | 256K | 256K | -- | -- | -- |
| Jamba 1.5 Mini | AI21 | $0.200 | $0.400 | 256K | 256K | -- | -- | -- |
| Jamba 1.5 Mini@001 | AI21 | $0.200 | $0.400 | 256K | 256K | -- | -- | -- |
| Jamba Mini 1.6 | AI21 | $0.200 | $0.400 | 256K | 256K | -- | -- | -- |
| Jamba Mini 1.7 | AI21 | $0.200 | $0.400 | 256K | 256K | -- | -- | -- |
| GPT-5 Mini | OpenAI | $0.250 | $2.00 | 400K | 16K | 105 tok/s | 41 | 164.8 |
| Gemini 3.1 Flash-Lite | $0.250 | $1.50 | 1.0M | 66K | 292 tok/s | 34 | 134.0 | |
| Claude 3 Haiku 20240307 | Anthropic | $0.250 | $1.25 | 200K | 4K | -- | -- | -- |
| Llama 4 Maverick | Meta | $0.270 | $0.850 | 1.0M | 16K | 109 tok/s | 18 | 68.1 |
| Qwen3.6-Plus | Alibaba | $0.276 | $1.65 | 1.0M | 33K | -- | -- | -- |
| DeepSeek V3.2 (Chat) | DeepSeek | $0.280 | $0.420 | 128K | 8K | -- | 32 | 114.6 |
| DeepSeek V3.2 (Reasoner) | DeepSeek | $0.280 | $0.420 | 128K | 8K | -- | -- | -- |
| DeepSeek Reasoner | DeepSeek | $0.280 | $0.420 | 131K | 66K | -- | -- | -- |
| Gemini 2.5 Flash | $0.300 | $2.50 | 1.0M | 66K | 209 tok/s | 21 | 68.7 | |
| Grok 3 Mini | xAI | $0.300 | $0.500 | 131K | 16K | -- | -- | -- |
| Codestral | Mistral | $0.300 | $0.900 | 256K | 33K | -- | -- | -- |
| Qwen 3.5 27B | Alibaba | $0.300 | $2.40 | 128K | 33K | -- | -- | -- |
| Nova 2.0 Lite | Amazon | $0.300 | $2.50 | 1.0M | 64K | 234 tok/s | 18 | 60.0 |
| Nemotron 3 Super 120B | NVIDIA | $0.300 | $0.800 | 1.0M | 33K | 158 tok/s | 36 | 120.0 |
| MiniMax M2.5 | MiniMax | $0.300 | $1.20 | 128K | 33K | 94 tok/s | 42 | 139.7 |
| Command Light | Cohere | $0.300 | $0.600 | 4K | 4K | -- | -- | -- |
| GPT-4.1 Mini | OpenAI | $0.400 | $1.60 | 1.0M | 33K | 82 tok/s | 23 | 57.2 |
| Mistral Medium | Mistral | $0.400 | $2.00 | 131K | 16K | -- | -- | -- |
| Devstral | Mistral | $0.400 | $2.00 | 256K | 33K | -- | -- | -- |
| Qwen3.5-Omni Plus | Alibaba | $0.400 | $4.80 | 262K | 33K | -- | -- | -- |
| Gemini 3 Flash | $0.500 | $3.00 | 1.0M | 66K | 200 tok/s | 35 | 70.0 | |
| Gemini 3 Flash Reasoning | $0.500 | $3.00 | 1.0M | 66K | 206 tok/s | 46 | 92.8 | |
| Mistral Large 3 | Mistral | $0.500 | $1.50 | 262K | 8K | 55 tok/s | 23 | 45.6 |
| Magistral Small | Mistral | $0.500 | $1.50 | 40K | 16K | -- | -- | -- |
| DeepSeek R1 | DeepSeek | $0.550 | $2.19 | 128K | 33K | -- | 27 | 49.3 |
| Grok 3 Mini Fast | xAI | $0.600 | $4.00 | 131K | 16K | -- | -- | -- |
| Qwen 3.5 397B | Alibaba | $0.600 | $3.60 | 128K | 33K | -- | -- | -- |
| Kimi K2.5 | Moonshot | $0.600 | $3.00 | 262K | 33K | 52 tok/s | 47 | 78.0 |
| Kimi K2 Thinking | Moonshot | $0.600 | $2.50 | 262K | 33K | -- | -- | -- |
| GLM-4.7 | Zhipu | $0.600 | $2.20 | 200K | 128K | -- | -- | -- |
| GPT-5.4 Mini | OpenAI | $0.750 | $4.50 | 400K | 128K | -- | -- | -- |
| Gemini 3.1 Flash Live | $0.750 | $4.50 | 1.0M | 66K | -- | -- | -- | |
| Claude Haiku 3.5 | Anthropic | $0.800 | $4.00 | 200K | 8K | -- | 19 | 23.4 |
| QwQ-Plus | Alibaba | $0.800 | $2.40 | 131K | 16K | -- | -- | -- |
| Kimi K2.6 | Moonshot | $0.950 | $4.00 | 262K | 33K | -- | -- | -- |
| Claude Haiku 4.5 | Anthropic | $1.00 | $5.00 | 200K | 8K | 114 tok/s | 31 | 31.1 |
| Claude 4.5 Haiku Reasoning | Anthropic | $1.00 | $5.00 | 200K | 8K | 139 tok/s | 37 | 37.1 |
| Gemini 3.1 Flash TTS | $1.00 | $20.00 | 32K | 32K | -- | -- | -- | |
| GLM-5 | Zhipu | $1.00 | $3.20 | 128K | 33K | 71 tok/s | 50 | 49.8 |
| MiMo-V2-Pro | Xiaomi | $1.00 | $3.00 | 1.0M | 131K | -- | -- | -- |
| Claude Haiku 4 5 20251001 | Anthropic | $1.00 | $5.00 | 200K | 64K | -- | -- | -- |
| Claude Haiku 4 5 | Anthropic | $1.00 | $5.00 | 200K | 64K | -- | -- | -- |
| o4 Mini | OpenAI | $1.10 | $4.40 | 200K | 100K | 154 tok/s | 33 | 30.1 |
| o3 Mini | OpenAI | $1.10 | $4.40 | 200K | 100K | 152 tok/s | 26 | 23.5 |
| Kimi K2 Thinking Turbo | Moonshot | $1.15 | $8.00 | 262K | 33K | -- | -- | -- |
| GLM-5 Turbo | Zhipu | $1.20 | $4.00 | 200K | 128K | -- | -- | -- |
| GPT-5.1 | OpenAI | $1.25 | $10.00 | 400K | 16K | 128 tok/s | 48 | 38.2 |
| GPT-5 | OpenAI | $1.25 | $10.00 | 400K | 16K | 86 tok/s | 45 | 35.7 |
| GPT-5 Medium | OpenAI | $1.25 | $10.00 | 400K | 16K | 84 tok/s | 42 | 33.6 |
| Gemini 2.5 Pro | $1.25 | $10.00 | 1.0M | 66K | 133 tok/s | 35 | 27.7 | |
| Grok 4.3 | xAI | $1.25 | $2.50 | 1.0M | 16K | -- | -- | -- |
| Nova 2.0 Pro Reasoning | Amazon | $1.25 | $10.00 | 128K | 33K | 158 tok/s | 32 | 25.5 |
| GLM-5.1 | Zhipu | $1.40 | $4.40 | 200K | 128K | -- | -- | -- |
| Mistral Medium 3.5 | Mistral | $1.50 | $7.50 | 256K | 33K | -- | -- | -- |
| DeepSeek V4-Pro | DeepSeek | $1.74 | $3.48 | 1.0M | 384K | -- | -- | -- |
| GPT-5.2 | OpenAI | $1.75 | $14.00 | 400K | 16K | 74 tok/s | 51 | 29.3 |
| GPT-5.3 Codex | OpenAI | $1.75 | $14.00 | 400K | 33K | 84 tok/s | 54 | 30.6 |
| GPT-4.1 | OpenAI | $2.00 | $8.00 | 1.0M | 33K | 121 tok/s | 26 | 13.2 |
| o3 | OpenAI | $2.00 | $8.00 | 200K | 100K | 113 tok/s | 38 | 19.2 |
| o4 Mini Deep Research | OpenAI | $2.00 | $8.00 | 200K | 100K | -- | -- | -- |
| Gemini 3.1 Pro | $2.00 | $12.00 | 1.0M | 66K | 138 tok/s | 57 | 28.6 | |
| Gemini 3 Pro | $2.00 | $12.00 | 1.0M | 66K | 129 tok/s | 48 | 24.2 | |
| Grok 4.20 | xAI | $2.00 | $6.00 | 2.0M | 16K | 91 tok/s | 49 | 24.6 |
| Grok 2 | xAI | $2.00 | $10.00 | 131K | 16K | -- | -- | -- |
| Magistral Medium | Mistral | $2.00 | $5.00 | 40K | 16K | -- | -- | -- |
| Pixtral Large | Mistral | $2.00 | $6.00 | 128K | 4K | -- | -- | -- |
| Jamba 1.5 Large | AI21 | $2.00 | $8.00 | 256K | 256K | -- | -- | -- |
| Jamba 1.5 Large@001 | AI21 | $2.00 | $8.00 | 256K | 256K | -- | -- | -- |
| Jamba Large 1.6 | AI21 | $2.00 | $8.00 | 256K | 256K | -- | -- | -- |
| Jamba Large 1.7 | AI21 | $2.00 | $8.00 | 256K | 256K | -- | -- | -- |
| GPT-5.4 | OpenAI | $2.50 | $15.00 | 1.1M | 128K | 90 tok/s | 57 | 22.7 |
| GPT-4o | OpenAI | $2.50 | $10.00 | 128K | 16K | 139 tok/s | 17 | 6.9 |
| Command A | Cohere | $2.50 | $10.00 | 128K | 4K | 46 tok/s | 14 | 5.4 |
| Command A 03 2025 | Cohere | $2.50 | $10.00 | 256K | 8K | -- | -- | -- |
| Command R Plus | Cohere | $2.50 | $10.00 | 128K | 4K | -- | -- | -- |
| Command R Plus 08 2024 | Cohere | $2.50 | $10.00 | 128K | 4K | -- | -- | -- |
| Claude Sonnet 4.6 Adaptive | Anthropic | $3.00 | $15.00 | 200K | 64K | 69 tok/s | 52 | 17.2 |
| Claude Sonnet 4.6 | Anthropic | $3.00 | $15.00 | 200K | 64K | 54 tok/s | 44 | 14.8 |
| Claude Sonnet 4.5 | Anthropic | $3.00 | $15.00 | 200K | 64K | -- | -- | -- |
| Claude Sonnet 4 | Anthropic | $3.00 | $15.00 | 200K | 64K | 54 tok/s | 44 | 14.8 |
| Claude 3.7 Sonnet | Anthropic | $3.00 | $15.00 | 200K | 64K | -- | -- | -- |
| Grok 4 | xAI | $3.00 | $15.00 | 2.0M | 16K | 47 tok/s | 42 | 13.8 |
| Grok 3 | xAI | $3.00 | $15.00 | 131K | 16K | -- | -- | -- |
| Sonar Pro | Perplexity | $3.00 | $15.00 | 128K | 4K | -- | 15 | 5.1 |
| Claude 3 7 Sonnet 20250219 | Anthropic | $3.00 | $15.00 | 200K | 64K | -- | -- | -- |
| Claude 4 Sonnet 20250514 | Anthropic | $3.00 | $15.00 | 1.0M | 64K | -- | -- | -- |
| Claude Sonnet 4 5 | Anthropic | $3.00 | $15.00 | 200K | 64K | -- | -- | -- |
| Claude Sonnet 4 5 20250929 | Anthropic | $3.00 | $15.00 | 200K | 64K | -- | -- | -- |
| Claude Sonnet 4 6 | Anthropic | $3.00 | $15.00 | 1.0M | 64K | -- | -- | -- |
| Claude Sonnet 4 20250514 | Anthropic | $3.00 | $15.00 | 1.0M | 64K | -- | -- | -- |
| GPT-5.5 | OpenAI | $5.00 | $30.00 | 1.1M | 128K | -- | -- | -- |
| Claude Opus 4.7 | Anthropic | $5.00 | $25.00 | 1.0M | 128K | -- | -- | -- |
| Claude Opus 4.6 Adaptive | Anthropic | $5.00 | $25.00 | 200K | 32K | 50 tok/s | 53 | 10.6 |
| Claude Opus 4.6 | Anthropic | $5.00 | $25.00 | 200K | 32K | 47 tok/s | 47 | 9.3 |
| Claude Opus 4.5 | Anthropic | $5.00 | $25.00 | 200K | 32K | 55 tok/s | 43 | 8.6 |
| Grok 3 Fast | xAI | $5.00 | $25.00 | 131K | 16K | -- | -- | -- |
| MAI-Image-2 | Microsoft | $5.00 | $33.00 | 32K | 1K | -- | -- | -- |
| Claude Opus 4 5 20251101 | Anthropic | $5.00 | $25.00 | 200K | 64K | -- | -- | -- |
| Claude Opus 4 5 | Anthropic | $5.00 | $25.00 | 200K | 64K | -- | -- | -- |
| Claude Opus 4 6 | Anthropic | $5.00 | $25.00 | 1.0M | 128K | -- | -- | -- |
| Claude Opus 4 6 20260205 | Anthropic | $5.00 | $25.00 | 1.0M | 128K | -- | -- | -- |
| Claude Opus 4 7 | Anthropic | $5.00 | $25.00 | 1.0M | 128K | -- | -- | -- |
| Claude Opus 4 7 20260416 | Anthropic | $5.00 | $25.00 | 1.0M | 128K | -- | -- | -- |
| o3 Deep Research | OpenAI | $10.00 | $40.00 | 200K | 100K | -- | -- | -- |
| o1 | OpenAI | $15.00 | $60.00 | 200K | 100K | 95 tok/s | 31 | 2.1 |
| Claude Opus 4.1 | Anthropic | $15.00 | $75.00 | 200K | 32K | -- | -- | -- |
| Claude 3 Opus 20240229 | Anthropic | $15.00 | $75.00 | 200K | 4K | -- | -- | -- |
| Claude 4 Opus 20250514 | Anthropic | $15.00 | $75.00 | 200K | 32K | -- | -- | -- |
| Claude Opus 4 1 | Anthropic | $15.00 | $75.00 | 200K | 32K | -- | -- | -- |
| Claude Opus 4 1 20250805 | Anthropic | $15.00 | $75.00 | 200K | 32K | -- | -- | -- |
| Claude Opus 4 20250514 | Anthropic | $15.00 | $75.00 | 200K | 32K | -- | -- | -- |
| Voxtral TTS | Mistral | $16.00 | $< 0.01 | 128K | 0K | -- | -- | -- |
| o3-pro | OpenAI | $20.00 | $80.00 | 200K | 100K | -- | -- | -- |
| Claude Mythos Preview | Anthropic | $25.00 | $125 | 1.0M | 32K | -- | -- | -- |
| GPT-5.5 Pro | OpenAI | $30.00 | $180 | 1.1M | 128K | -- | -- | -- |
| GPT-5.4 Pro | OpenAI | $30.00 | $180 | 1.1M | 128K | -- | -- | -- |
| o1 Pro | OpenAI | $150 | $600 | 200K | 100K | -- | -- | -- |
Pricing as of March 2026. Open-source model prices reflect hosted API providers. Always verify with official pages. Value = quality index / input cost per 1M tokens (higher is better).
Speed & quality data by Artificial AnalysisHow to Compare LLM API Pricing
- 1
Browse the pricing table
400+ models are listed with input and output pricing per million tokens, plus context window sizes and benchmark scores.
- 2
Sort and filter
Click any column header to sort by price, context window, or benchmark score. Use the provider filter to focus on specific vendors.
- 3
Evaluate price vs. performance
Quality and speed metrics from Artificial Analysis help you weigh cost against actual model capability.
Why Use This Pricing Comparison
- 400+ models from 15+ providers in a single sortable table — no tab switching
- Enriched with quality index and speed benchmarks from Artificial Analysis
- Provider-colored badges for quick visual scanning across vendors
- Context window and max output token data alongside pricing
- Data sourced from official provider docs and LiteLLM open-source project
Common Use Cases
Vendor selection
Compare pricing across all major providers before committing to an API integration. Sort by cost to find the cheapest option.
Cost optimization
Find cheaper alternatives to your current model by sorting on price and checking if the quality benchmark is close enough.
Technical evaluation
Use the quality index and speed metrics to shortlist models, then compare their pricing to make a final decision.
Model comparison
Use the compare tool to see side-by-side pricing and specs for any two models.
Related Tools
Frequently Asked Questions
Common questions about LLM API pricing