Claude Model Family
Anthropic's Claude models are organised into three tiers — Haiku (fast and cheap), Sonnet (the balanced workhorse), and Opus (maximum capability). Each generation raises the ceiling on all three. Understanding the tier system and current model IDs is the first step to using Claude effectively.
A Note on Parameters
Anthropic does not publicly disclose parameter counts for any Claude model. No official figure has ever been published for Haiku, Sonnet, or Opus. Estimates that circulate online come from third-party inference-speed analysis and should not be treated as authoritative. The dimensions that matter for practical use are context window, speed, cost, and benchmark performance — all of which Anthropic does publish.
The Naming Convention
Claude model names follow a consistent pattern: family tier + version number. The tier names are inspired by Japanese poetry forms — reflecting their positioning from concise-and-fast to expansive-and-powerful:
- Haiku — Fastest, cheapest. Designed for high-throughput tasks where latency and cost per token are the priority: classification, routing, extraction, simple Q&A.
- Sonnet — Balanced. The default choice for most production workloads: coding, analysis, writing, agentic tasks. Best value in the family.
- Opus — Most capable. Used for the hardest tasks: complex research, multi-step agentic reasoning, extended thinking, and tasks where output quality matters more than cost or speed.
Version numbers (3, 3.5, 3.7, 4, 4.5, 4.6) indicate the generation. A higher version is almost always strictly better than a lower version within the same tier. The API model ID (e.g. claude-sonnet-4-6) is what you use in code; the display name is what appears in Claude.ai.
Current Models (as of Mar 2026)
| Model | API Model ID | Context | Best For |
|---|---|---|---|
| Claude Haiku 4.5 | claude-haiku-4-5-20251001 | 200K | Classification, routing, extraction, high-volume pipelines |
| Claude Sonnet 4.6 | claude-sonnet-4-6 | 200K | Coding, analysis, writing, agents — the default production choice |
| Claude Opus 4.6 | claude-opus-4-6 | 200K | Complex research, extended thinking, hardest agentic tasks |
Previous Generation (still available)
| Model | Context | Notes |
|---|---|---|
| Claude 3.7 Sonnet | 200K (128K with extended thinking) | Hybrid reasoning model; extended thinking mode for hard problems |
| Claude 3.5 Sonnet | 200K | Strong coding + instruction following; widely adopted before 3.7 |
| Claude 3 Opus | 200K | Previous top-tier; superseded by 4.x Opus on most tasks |
| Claude 3 Haiku | 200K | Previous fast-tier; still lowest cost option on some platforms |
The 200K Context Window
All current Claude models share a 200,000-token context window — one of the largest in the industry. In practical terms:
- ~150,000 words of text (~500 pages)
- Entire medium-sized codebases
- Hours of meeting transcripts
- Multiple long documents simultaneously
Claude is particularly strong at accurately using its full context — it doesn't lose track of information near the middle or start of a long document (a known failure mode for some competing models, sometimes called the "lost in the middle" problem). This makes it a leading choice for long-document analysis tasks.
Extended Thinking
Starting with Claude 3.7 Sonnet and continued in the 4.x Opus tier, Claude supports extended thinking mode — where the model spends additional tokens on internal reasoning before producing its final response. This is Claude's equivalent of OpenAI's o-series reasoning.
- Activated via the API by setting a
thinkingparameter with a token budget - The internal thinking is returned as a separate
thinkingblock in the response - Most useful for: complex maths, formal logic, multi-step agentic tasks, hard coding problems
- Adds cost (thinking tokens are billed) and latency — use selectively, not by default
Multimodal Input
All Claude models accept text and images as input. Claude can read and reason about PDFs, screenshots, charts, diagrams, and mixed text+image documents. File upload limits:
- Claude.ai Free/Plus: up to 50MB per file
- Claude.ai Pro/Enterprise: up to 1GB per upload session
- API: image input via base64 or URL; file analysis via the Files API
Claude does not generate images or audio — it is text-in, text-out (with image/document input support).
Pricing Tier Logic
Anthropic pricing follows the tier hierarchy: Haiku is cheapest, Sonnet is mid, Opus is most expensive. For the current generation (check anthropic.com/pricing for live rates):
Haiku 4.5
Lowest cost per million tokens. Best for pipelines running millions of calls per day.
Sonnet 4.6
Mid-tier pricing. The best balance of cost and capability for most production workloads.
Opus 4.6
Highest cost per token. Reserve for tasks where output quality directly drives business value.
Input tokens are cheaper than output tokens across all models. Extended thinking tokens are billed at the input token rate but consumed before the response — factor this into cost estimates for Opus-heavy agentic workflows.
Checklist: Do You Understand This?
- Anthropic does not disclose parameter counts — context window and benchmarks are the public specs
- Three tiers: Haiku (fast/cheap) → Sonnet (balanced, default) → Opus (most capable)
- All current Claude models have a 200K token context window
- Extended thinking is available in Claude 3.7 Sonnet and 4.x Opus — adds reasoning depth at extra cost
- Claude accepts text + image/document input; does not generate images or audio
- Use the API model ID (e.g.
claude-sonnet-4-6) in code, not the display name