Compare AI Models
Filter and sort 49 AI models across 11 providers side-by-side โ pricing, context window, type, and what each is best for.
Covers OpenAI, Anthropic, Google, Meta, DeepSeek, Microsoft, Amazon, Mistral, xAI, Cohere, and Qwen. Via-API pricing per 1M tokens as of May 2026. Always verify current pricing at the official provider pricing page before production use. Subscription plan pricing differs from API pricing.
Comparison Table
Showing 49 of 49 models
| Model | Provider | Type | Modalities | Params | Context โ | Input /1M โ | Output /1M โ | Tier | OW | Best for |
|---|---|---|---|---|---|---|---|---|---|---|
| Gemma 4 1B | General | TxtImgAud | 1B | 128K | $0.010 | $0.020 | Budget | โ | Extreme edge, mobile inference | |
| Llama 3.2 1B | Meta | General | Txt | 1B | 128K | $0.010 | $0.010 | Budget | โ | Smallest open-weight, edge & mobile |
| Mistral Nemo | Mistral | General | Txt | 12B | 128K | $0.020 | $0.040 | Budget | โ | Apache 2.0, ultra-budget open-weight |
| Gemma 4 4B | General | TxtImgAud | 4B | 128K | $0.030 | $0.060 | Budget | โ | On-device and edge deployments | |
| Phi-4-mini | Microsoft | General | Txt | 3.8B | 16K | $0.030 | $0.070 | Budget | โ | 3.8B ultra-small, on-device deployment |
| Nova Micro | Amazon | General | Txt | โ | 128K | $0.035 | $0.14 | Budget | โ | Text-only, lowest latency on AWS |
| Gemma 4 12B | General | TxtImgAud | 12B | 128K | $0.040 | $0.13 | Budget | โ | Efficient open-weight text, vision & audio | |
| Llama 3.2 11B Vision | Meta | General | TxtImg | 11B | 128K | $0.050 | $0.050 | Budget | โ | Open-weight vision & image understanding |
| Qwen3 8B | Qwen | General | Txt | 8B | 128K | $0.050 | $0.20 | Budget | โ | Small open-weight, local inference |
| Nova Lite | Amazon | General | TxtImgVid | โ | 300K | $0.060 | $0.24 | Budget | โ | Fast multimodal, very low cost on AWS |
| Phi-4 | Microsoft | General | Txt | 14B | 16K | $0.065 | $0.14 | Budget | โ | 14B โ outperforms many larger text models |
| Phi-4-multimodal | Microsoft | General | TxtImgAud | 5.6B | 128K | $0.070 | $0.14 | Budget | โ | Audio + image + text, small multimodal model |
| Gemma 4 27B | General | TxtImgAud | 27B | 256K | $0.080 | $0.16 | Budget | โ | Open-weight multimodal + audio, self-hostable | |
| Phi-4 Mini Reasoning | Microsoft | Reasoning | Txt | 3.8B | 131K | $0.080 | $0.32 | Budget | โ | Small open-weight reasoning, 131K context |
| GPT-4.1 nano | OpenAI | General | TxtImg | โ | 1M | $0.10 | $0.40 | Budget | โ | Cheapest OpenAI model with 1M context |
| Gemini 2.5 Flash-Lite | General | TxtImg | โ | 1M | $0.10 | $0.40 | Budget | โ | Cheapest capable model per token | |
| Llama 4 Scout | Meta | General | TxtImg | 109B (MoE) | 10M | $0.10 | $0.35 | Budget | โ | Ultra-long 10M context window, multimodal |
| Llama 3.3 70B | Meta | General | Txt | 70B | 128K | $0.10 | $0.32 | Budget | โ | Open-weight text workhorse, widely deployed |
| Mistral Small 3.1 | Mistral | General | TxtImg | 24B | 128K | $0.10 | $0.30 | Budget | โ | SOTA small model with vision, multilingual |
| DeepSeek V4 Flash | DeepSeek | General | Txt | โ | 163K | $0.14 | $0.28 | Budget | โ | Ultra-low cost open-weight inference |
| GPT-4o mini | OpenAI | General | TxtImg | โ | 128K | $0.15 | $0.60 | Budget | โ | Fast, affordable chat and extraction |
| Command R | Cohere | General | Txt | 35B | 128K | $0.15 | $0.60 | Budget | โ | Efficient RAG for lighter workloads |
| Qwen3 32B | Qwen | General | Txt | 32B | 128K | $0.15 | $0.60 | Budget | โ | Strong open-weight, self-hostable |
| Llama 4 Maverick | Meta | General | TxtImg | 400B (MoE) | 1M | $0.20 | $0.85 | Budget | โ | Open-weight multimodal quality at low cost |
| Qwen3 Coder | Qwen | General | Txt | 480B (MoE) | 128K | $0.22 | $0.90 | Budget | โ | Alibaba coding specialist, open-weight |
| DeepSeek V4 Pro | DeepSeek | General | Txt | 1.6T (MoE) | 163K | $0.27 | $1.10 | Budget | โ | MIT license, frontier open-weight |
| Gemini 2.5 Flash | General | TxtImgVidAud | โ | 1M | $0.30 | $2.50 | Budget | โ | Great value, tunable thinking budget | |
| Codestral | Mistral | General | Txt | 22B | 256K | $0.30 | $0.90 | Budget | โ | Code specialist โ 80+ programming languages |
| Grok 3 Mini | xAI | Reasoning | Txt | โ | 131K | $0.30 | $0.50 | Budget | โ | Cost-effective Grok reasoning |
| GPT-4.1 mini | OpenAI | General | TxtImg | โ | 1M | $0.40 | $1.60 | Budget | โ | Budget-friendly, 1M context tasks |
| DeepSeek R1 | DeepSeek | Reasoning | Txt | 671B | 163K | $0.55 | $2.19 | Budget | โ | Open-weight reasoning, matches o1 |
| Qwen3 Max | Qwen | Both | Txt | 235B (MoE) | 128K | $0.78 | $3.90 | Budget | โ | Alibaba flagship, extended thinking mode |
| Nova Pro | Amazon | General | TxtImgVid | โ | 300K | $0.80 | $3.20 | Budget | โ | Multimodal agentic workflows on AWS |
| Claude Haiku 4.5 | Anthropic | General | TxtImgDoc | โ | 200K | $1.00 | $5.00 | Budget | โ | High-volume extraction & classification |
| o4-mini | OpenAI | Reasoning | TxtImg | โ | 200K | $1.10 | $4.40 | Mid | โ | Cost-effective reasoning at scale |
| Gemini 2.5 Pro | Both | TxtImgVidAudDoc | โ | 1M | $2.00 | $12.00 | Mid | โ | Top coding benchmarks, 1M context | |
| Mistral Large 3 | Mistral | General | Txt | โ | 128K | $2.00 | $6.00 | Mid | โ | Frontier-class, multilingual tasks |
| Pixtral Large | Mistral | General | TxtImg | โ | 128K | $2.00 | $6.00 | Mid | โ | Multimodal flagship โ vision + text |
| GPT-4o | OpenAI | General | TxtImgAud | โ | 128K | $2.50 | $10.00 | Mid | โ | Multimodal โ real-time audio, video, images |
| Nova Premier | Amazon | Both | TxtImgVidAud | โ | 1M | $2.50 | $12.50 | Mid | โ | Amazon flagship, extended thinking, 1M context |
| Command A | Cohere | General | Txt | โ | 256K | $2.50 | $10.00 | Mid | โ | Enterprise agentic workflows |
| Command R+ | Cohere | General | Txt | 104B | 128K | $2.50 | $10.00 | Mid | โ | Enterprise RAG, grounded tool use |
| Claude Sonnet 4.6 | Anthropic | Both | TxtImgDoc | โ | 200K | $3.00 | $15.00 | Mid | โ | Production balance + extended thinking |
| Grok 3 | xAI | Both | TxtImg | โ | 131K | $3.00 | $15.00 | Mid | โ | Frontier reasoning, X/Twitter knowledge |
| GPT-5.5 | OpenAI | Both | TxtImg | โ | 1M | $5.00 | $30.00 | Mid | โ | Frontier quality, complex multi-step tasks |
| GPT-4.1 | OpenAI | General | TxtImgDoc | โ | 1M | $5.00 | $15.00 | Mid | โ | Coding, instruction-following, 1M context |
| Claude Opus 4.7 | Anthropic | Both | TxtImgDoc | โ | 200K | $5.00 | $25.00 | Mid | โ | Agentic pipelines, deep reasoning |
| o3 | OpenAI | Reasoning | TxtImg | โ | 200K | $10.00 | $40.00 | Frontier | โ | Hard math, science, competition-level reasoning |
| o1 | OpenAI | Reasoning | TxtImg | โ | 200K | $15.00 | $60.00 | Frontier | โ | Deep deliberate problem-solving, research |
OW = Open-weight (publicly released model weights). Click column headers to sort. Prices approximate via API (May 2026); always verify at the official provider pricing page before production use.
Reading This Table
Budget (โค $1/1M input)
Best for high-volume, cost-sensitive workloads: classification, extraction, summarization at scale.
Mid ($1โ$5/1M input)
Production workhorses. Good quality/cost ratio for most use cases: coding, analysis, chat.
Frontier (> $5/1M input)
Reserve for hard reasoning, complex agentic tasks, or where output quality is the primary constraint.
General vs Reasoning vs Both
General = standard chat/completion. Reasoning = uses internal thinking steps (o3, R1). Both = supports both modes โ e.g., Sonnet 4.6 with extended thinking on/off.
Open-weight
Open-weight models have publicly released weights. You can self-host them or access via third-party APIs (Groq, Together, Fireworks). Prices shown are typical API rates โ self-hosting costs vary by GPU.