Anthropic
Claude Haiku 4.5
Best ValueAnthropic's fastest and most affordable model in the Claude 4 generation, released October 2025. Claude Haiku 4.5 runs at 108.8 tokens/second — fast enough for real-time streaming — at $1/$5 per 1M tokens. Despite the low price, it scores an AA Intelligence Index of 31, placing it #13 of 60 proprietary models. It outperforms Claude Sonnet 4 on computer-use benchmarks (50.7% vs 42.2%) while costing three times less. Supports extended thinking mode (billed at $5/1M for thinking tokens), image input, and the full 200K context window shared across the Claude 4 generation.
Context window
200K tokens
API (blended)
$2.00/1M
Consumer access
Free (limited) / $20/mo
Multimodal
Yes
Strengths
- +Fastest Anthropic model: 108.8 t/s — suitable for real-time streaming and high-throughput pipelines
- +AA Intelligence Index 31 — top 25% of proprietary models at a budget price point
- +Outperforms Claude Sonnet 4 on computer-use benchmarks (50.7% vs 42.2%) at 3× lower cost
- +200K context window — same as Opus and Sonnet 4.6
- +Extended thinking mode available for complex tasks
- +Image input supported — describe, analyze, and reason about uploaded images
Weaknesses
- -Intelligence Index 31 vs Claude Opus 4.6's 53 — substantial capability gap for complex tasks
- -API output is $5/1M — more expensive per output token than comparable budget models (GPT-5 mini: $2/1M, Gemini 3 Flash: ~$1/1M)
- -Extended thinking tokens add cost quickly — heavy thinking tasks can approach Sonnet pricing
- -No native web search; relies on tool use for real-time information
Best for
Not ideal for
Pricing details
Subscription plans
API pricing
Prices verified February 2026. LLM pricing changes frequently — verify at the provider's site before budgeting.
Last updated: February 2026