Beginner

Claude Model Family

Anthropic's Claude models are organised into three tiers — Haiku (fast and cheap), Sonnet (the balanced workhorse), and Opus (maximum capability). Each generation raises the ceiling on all three. Understanding the tier system and current model IDs is the first step to using Claude effectively.

A Note on Parameters

Anthropic does not publicly disclose parameter counts for any Claude model. No official figure has ever been published for Haiku, Sonnet, or Opus. Estimates that circulate online come from third-party inference-speed analysis and should not be treated as authoritative. The dimensions that matter for practical use are context window, speed, cost, and benchmark performance — all of which Anthropic does publish.

The Naming Convention

Claude model names follow a consistent pattern: family tier + version number. The tier names are inspired by Japanese poetry forms — reflecting their positioning from concise-and-fast to expansive-and-powerful:

  • Haiku — Fastest, cheapest. Designed for high-throughput tasks where latency and cost per token are the priority: classification, routing, extraction, simple Q&A.
  • Sonnet — Balanced. The default choice for most production workloads: coding, analysis, writing, agentic tasks. Best value in the family.
  • Opus — Most capable. Used for the hardest tasks: complex research, multi-step agentic reasoning, extended thinking, and tasks where output quality matters more than cost or speed.

Version numbers (3, 3.5, 3.7, 4, 4.5, 4.6) indicate the generation. A higher version is almost always strictly better than a lower version within the same tier. The API model ID (e.g. claude-sonnet-4-6) is what you use in code; the display name is what appears in Claude.ai.

Current Models (as of Mar 2026)

ModelAPI Model IDContextBest For
Claude Haiku 4.5claude-haiku-4-5-20251001200KClassification, routing, extraction, high-volume pipelines
Claude Sonnet 4.6claude-sonnet-4-6200KCoding, analysis, writing, agents — the default production choice
Claude Opus 4.6claude-opus-4-6200KComplex research, extended thinking, hardest agentic tasks

Previous Generation (still available)

ModelContextNotes
Claude 3.7 Sonnet200K (128K with extended thinking)Hybrid reasoning model; extended thinking mode for hard problems
Claude 3.5 Sonnet200KStrong coding + instruction following; widely adopted before 3.7
Claude 3 Opus200KPrevious top-tier; superseded by 4.x Opus on most tasks
Claude 3 Haiku200KPrevious fast-tier; still lowest cost option on some platforms

The 200K Context Window

All current Claude models share a 200,000-token context window — one of the largest in the industry. In practical terms:

  • ~150,000 words of text (~500 pages)
  • Entire medium-sized codebases
  • Hours of meeting transcripts
  • Multiple long documents simultaneously

Claude is particularly strong at accurately using its full context — it doesn't lose track of information near the middle or start of a long document (a known failure mode for some competing models, sometimes called the "lost in the middle" problem). This makes it a leading choice for long-document analysis tasks.

Extended Thinking

Starting with Claude 3.7 Sonnet and continued in the 4.x Opus tier, Claude supports extended thinking mode — where the model spends additional tokens on internal reasoning before producing its final response. This is Claude's equivalent of OpenAI's o-series reasoning.

  • Activated via the API by setting a thinking parameter with a token budget
  • The internal thinking is returned as a separate thinking block in the response
  • Most useful for: complex maths, formal logic, multi-step agentic tasks, hard coding problems
  • Adds cost (thinking tokens are billed) and latency — use selectively, not by default

Multimodal Input

All Claude models accept text and images as input. Claude can read and reason about PDFs, screenshots, charts, diagrams, and mixed text+image documents. File upload limits:

  • Claude.ai Free/Plus: up to 50MB per file
  • Claude.ai Pro/Enterprise: up to 1GB per upload session
  • API: image input via base64 or URL; file analysis via the Files API

Claude does not generate images or audio — it is text-in, text-out (with image/document input support).

Pricing Tier Logic

Anthropic pricing follows the tier hierarchy: Haiku is cheapest, Sonnet is mid, Opus is most expensive. For the current generation (check anthropic.com/pricing for live rates):

Haiku 4.5

Lowest cost per million tokens. Best for pipelines running millions of calls per day.

Sonnet 4.6

Mid-tier pricing. The best balance of cost and capability for most production workloads.

Opus 4.6

Highest cost per token. Reserve for tasks where output quality directly drives business value.

Input tokens are cheaper than output tokens across all models. Extended thinking tokens are billed at the input token rate but consumed before the response — factor this into cost estimates for Opus-heavy agentic workflows.

Checklist: Do You Understand This?

  • Anthropic does not disclose parameter counts — context window and benchmarks are the public specs
  • Three tiers: Haiku (fast/cheap) → Sonnet (balanced, default) → Opus (most capable)
  • All current Claude models have a 200K token context window
  • Extended thinking is available in Claude 3.7 Sonnet and 4.x Opus — adds reasoning depth at extra cost
  • Claude accepts text + image/document input; does not generate images or audio
  • Use the API model ID (e.g. claude-sonnet-4-6) in code, not the display name

Page built: 01 Jun 2026