Beginner

Compare AI Models

Filter and sort 49 AI models across 11 providers side-by-side โ€” pricing, context window, type, and what each is best for.

Covers OpenAI, Anthropic, Google, Meta, DeepSeek, Microsoft, Amazon, Mistral, xAI, Cohere, and Qwen. Via-API pricing per 1M tokens as of May 2026. Always verify current pricing at the official provider pricing page before production use. Subscription plan pricing differs from API pricing.

Comparison Table

Provider
Type
Price
Modality

Showing 49 of 49 models

ModelProviderTypeModalitiesParamsContext โ†•Input /1M โ†‘Output /1M โ†•TierOWBest for
Gemma 4 1BGoogleGeneral
TxtImgAud
1B128K$0.010$0.020Budgetโœ“Extreme edge, mobile inference
Llama 3.2 1BMetaGeneral
Txt
1B128K$0.010$0.010Budgetโœ“Smallest open-weight, edge & mobile
Mistral NemoMistralGeneral
Txt
12B128K$0.020$0.040Budgetโœ“Apache 2.0, ultra-budget open-weight
Gemma 4 4BGoogleGeneral
TxtImgAud
4B128K$0.030$0.060Budgetโœ“On-device and edge deployments
Phi-4-miniMicrosoftGeneral
Txt
3.8B16K$0.030$0.070Budgetโœ“3.8B ultra-small, on-device deployment
Nova MicroAmazonGeneral
Txt
โ€”128K$0.035$0.14Budgetโ€”Text-only, lowest latency on AWS
Gemma 4 12BGoogleGeneral
TxtImgAud
12B128K$0.040$0.13Budgetโœ“Efficient open-weight text, vision & audio
Llama 3.2 11B VisionMetaGeneral
TxtImg
11B128K$0.050$0.050Budgetโœ“Open-weight vision & image understanding
Qwen3 8BQwenGeneral
Txt
8B128K$0.050$0.20Budgetโœ“Small open-weight, local inference
Nova LiteAmazonGeneral
TxtImgVid
โ€”300K$0.060$0.24Budgetโ€”Fast multimodal, very low cost on AWS
Phi-4MicrosoftGeneral
Txt
14B16K$0.065$0.14Budgetโœ“14B โ€” outperforms many larger text models
Phi-4-multimodalMicrosoftGeneral
TxtImgAud
5.6B128K$0.070$0.14Budgetโœ“Audio + image + text, small multimodal model
Gemma 4 27BGoogleGeneral
TxtImgAud
27B256K$0.080$0.16Budgetโœ“Open-weight multimodal + audio, self-hostable
Phi-4 Mini ReasoningMicrosoftReasoning
Txt
3.8B131K$0.080$0.32Budgetโœ“Small open-weight reasoning, 131K context
GPT-4.1 nanoOpenAIGeneral
TxtImg
โ€”1M$0.10$0.40Budgetโ€”Cheapest OpenAI model with 1M context
Gemini 2.5 Flash-LiteGoogleGeneral
TxtImg
โ€”1M$0.10$0.40Budgetโ€”Cheapest capable model per token
Llama 4 ScoutMetaGeneral
TxtImg
109B (MoE)10M$0.10$0.35Budgetโœ“Ultra-long 10M context window, multimodal
Llama 3.3 70BMetaGeneral
Txt
70B128K$0.10$0.32Budgetโœ“Open-weight text workhorse, widely deployed
Mistral Small 3.1MistralGeneral
TxtImg
24B128K$0.10$0.30Budgetโ€”SOTA small model with vision, multilingual
DeepSeek V4 FlashDeepSeekGeneral
Txt
โ€”163K$0.14$0.28Budgetโœ“Ultra-low cost open-weight inference
GPT-4o miniOpenAIGeneral
TxtImg
โ€”128K$0.15$0.60Budgetโ€”Fast, affordable chat and extraction
Command RCohereGeneral
Txt
35B128K$0.15$0.60Budgetโ€”Efficient RAG for lighter workloads
Qwen3 32BQwenGeneral
Txt
32B128K$0.15$0.60Budgetโœ“Strong open-weight, self-hostable
Llama 4 MaverickMetaGeneral
TxtImg
400B (MoE)1M$0.20$0.85Budgetโœ“Open-weight multimodal quality at low cost
Qwen3 CoderQwenGeneral
Txt
480B (MoE)128K$0.22$0.90Budgetโœ“Alibaba coding specialist, open-weight
DeepSeek V4 ProDeepSeekGeneral
Txt
1.6T (MoE)163K$0.27$1.10Budgetโœ“MIT license, frontier open-weight
Gemini 2.5 FlashGoogleGeneral
TxtImgVidAud
โ€”1M$0.30$2.50Budgetโ€”Great value, tunable thinking budget
CodestralMistralGeneral
Txt
22B256K$0.30$0.90Budgetโ€”Code specialist โ€” 80+ programming languages
Grok 3 MinixAIReasoning
Txt
โ€”131K$0.30$0.50Budgetโ€”Cost-effective Grok reasoning
GPT-4.1 miniOpenAIGeneral
TxtImg
โ€”1M$0.40$1.60Budgetโ€”Budget-friendly, 1M context tasks
DeepSeek R1DeepSeekReasoning
Txt
671B163K$0.55$2.19Budgetโœ“Open-weight reasoning, matches o1
Qwen3 MaxQwenBoth
Txt
235B (MoE)128K$0.78$3.90Budgetโœ“Alibaba flagship, extended thinking mode
Nova ProAmazonGeneral
TxtImgVid
โ€”300K$0.80$3.20Budgetโ€”Multimodal agentic workflows on AWS
Claude Haiku 4.5AnthropicGeneral
TxtImgDoc
โ€”200K$1.00$5.00Budgetโ€”High-volume extraction & classification
o4-miniOpenAIReasoning
TxtImg
โ€”200K$1.10$4.40Midโ€”Cost-effective reasoning at scale
Gemini 2.5 ProGoogleBoth
TxtImgVidAudDoc
โ€”1M$2.00$12.00Midโ€”Top coding benchmarks, 1M context
Mistral Large 3MistralGeneral
Txt
โ€”128K$2.00$6.00Midโ€”Frontier-class, multilingual tasks
Pixtral LargeMistralGeneral
TxtImg
โ€”128K$2.00$6.00Midโ€”Multimodal flagship โ€” vision + text
GPT-4oOpenAIGeneral
TxtImgAud
โ€”128K$2.50$10.00Midโ€”Multimodal โ€” real-time audio, video, images
Nova PremierAmazonBoth
TxtImgVidAud
โ€”1M$2.50$12.50Midโ€”Amazon flagship, extended thinking, 1M context
Command ACohereGeneral
Txt
โ€”256K$2.50$10.00Midโ€”Enterprise agentic workflows
Command R+CohereGeneral
Txt
104B128K$2.50$10.00Midโ€”Enterprise RAG, grounded tool use
Claude Sonnet 4.6AnthropicBoth
TxtImgDoc
โ€”200K$3.00$15.00Midโ€”Production balance + extended thinking
Grok 3xAIBoth
TxtImg
โ€”131K$3.00$15.00Midโ€”Frontier reasoning, X/Twitter knowledge
GPT-5.5OpenAIBoth
TxtImg
โ€”1M$5.00$30.00Midโ€”Frontier quality, complex multi-step tasks
GPT-4.1OpenAIGeneral
TxtImgDoc
โ€”1M$5.00$15.00Midโ€”Coding, instruction-following, 1M context
Claude Opus 4.7AnthropicBoth
TxtImgDoc
โ€”200K$5.00$25.00Midโ€”Agentic pipelines, deep reasoning
o3OpenAIReasoning
TxtImg
โ€”200K$10.00$40.00Frontierโ€”Hard math, science, competition-level reasoning
o1OpenAIReasoning
TxtImg
โ€”200K$15.00$60.00Frontierโ€”Deep deliberate problem-solving, research
Modalities:TxtText inputImgImage inputVidVideo inputAudAudio inputDocFiles input| Params: โ€” = undisclosed ยท (MoE) = Mixture-of-Experts total

OW = Open-weight (publicly released model weights). Click column headers to sort. Prices approximate via API (May 2026); always verify at the official provider pricing page before production use.

Reading This Table

Budget (โ‰ค $1/1M input)

Best for high-volume, cost-sensitive workloads: classification, extraction, summarization at scale.

Mid ($1โ€“$5/1M input)

Production workhorses. Good quality/cost ratio for most use cases: coding, analysis, chat.

Frontier (> $5/1M input)

Reserve for hard reasoning, complex agentic tasks, or where output quality is the primary constraint.

General vs Reasoning vs Both

General = standard chat/completion. Reasoning = uses internal thinking steps (o3, R1). Both = supports both modes โ€” e.g., Sonnet 4.6 with extended thinking on/off.

Open-weight

Open-weight models have publicly released weights. You can self-host them or access via third-party APIs (Groq, Together, Fireworks). Prices shown are typical API rates โ€” self-hosting costs vary by GPU.

Page built: 01 Jun 2026