
claude-haiku-4-5-20251001
The fastest model with near-frontier intelligence
2025-10-16
Input:
$1/1M tokens
Output:
$5/1M tokens
Bulk order? Contact your manager for exclusive deals
API Overview
Basic Information
- Claude Haiku 4.5 is a compact yet highly efficient language model launched by Anthropic in October 2025.
- It is the lightest, fastest, and most cost-effective model in the Claude family, while still delivering robust intelligence capabilities.
- It can be used across Anthropic’s development platform (API), Amazon Bedrock, Google Cloud Vertex AI, and other environments.
- Pricing is set at $1 per million input tokens and $5 per million output tokens.
Key Features
- In coding tasks, computer use, and agentic workflows, its performance matches that of Claude Sonnet 4.
- It offers exceptionally fast inference speeds—more than twice as quick as Sonnet 4.
- It provides excellent cost efficiency, operating at about one-third the cost of Sonnet 4.
- It excels in safety: According to Anthropic’s security assessments, Haiku 4.5 exhibits minimal bias and demonstrates significantly lower overall misalignment behavior compared to both Sonnet 4.5 and Opus 4.1, earning it an AI Safety Level 2 rating.
Technical Highlights
- In the SWE-bench Verified benchmark (a foundational assessment for software engineering), Haiku 4.5 achieved a score of 73.3%, making it one of the world’s top-performing coding models.
- It supports parallel execution: Anthropic recommends using Sonnet 4.5 for high-level planning, then deploying multiple instances of Haiku 4.5 in parallel to handle subtasks, ensuring maximum efficiency.
- Optimized for latency-sensitive scenarios: Its low-latency design delivers smoother, more responsive experiences in conversational applications, customer service, and real-time agent workflows.
- In safety testing, a dedicated system card was publicly evaluated for its performance in identifying harmful behaviors—including risks related to chemicals, biology, radiology/nuclear threats—and other hazardous activities.
Application Scenarios
- Real-Time Chat and Customer Service Agents: Haiku 4.5’s rapid speed and low latency make it ideal for building real-time dialogue systems, customer service bots, and similar applications.
- Multi-Agent Systems (Agent Orchestration): Use Sonnet 4.5 for strategic planning, then delegate complex tasks to multiple Haiku 4.5 sub-agents working in parallel for efficient task completion.
- Cost-Effective Scalable Deployment: Perfect for budget-conscious applications, enabling Claude models to be deployed in free-tier environments or scaled to reach large user bases.
- Coding and Rapid Prototyping: In Claude Code, it significantly enhances responsiveness and efficiency in collaborative sub-agent tasks, such as code refactoring, migration, and other complex operations.
- Data Monitoring and Analysis: Ideal for tracking massive data streams—such as financial data, regulatory signals, and more—to generate real-time analytical insights.
- Research Assistance: In research subtasks like literature reviews and data synthesis, Haiku 4.5 can simultaneously process information from multiple sources, accelerating the pace of scholarly work.
Simply change the API Base URL in Claude Code to:https://api.302.ai/cc or https://api.302ai.cn/cc .Use the key generated in the backend directly as your APIKey. The official API charges only 30% of the standard rate, while also supporting cache hits.
Playground
Log in to explore more features! Click to Log In
API Analytics
API Reference (8)
API Pricing
$¥ 円 ₽