claude-sonnet-4-5-20250929-thinking

claude-sonnet-4-5-20250929-thinking

A model optimized for deep thinking, based on Claude-sonnet-4-5-20250929
2025-09-30
LLM
Model capability: imageModel capability: thinkingModel capability: function_call
Input:
$3/1M tokensstarting from
Output:
$15/1M tokensstarting from
Bulk order? Contact your manager for exclusive deals

API Overview

Basic Information

Claude Sonnet 4.5 is the latest-generation AI model officially released by Anthropic on September 29, 2025, built on Anthropic’s self-developed technical architecture. It represents a significant upgrade to the Claude series, with the model API ID claudesonnet-4-5-20250929, marking a major technological breakthrough by Anthropic in the fields of AI programming and complex agent development. Ordinary users can access it via the web interface, as well as through the Claude.ai app on iOS and Android. Developers can call it through the Claude Developer Platform, Amazon Bedrock, and Google Cloud’s Vertex AI. The knowledge cutoff date is January 2025, and the training data cutoff date is July 2025.

Core Features

Top-Tier Programming Capabilities: Positioned by the company as “the smartest model for complex agents and coding,” it excels in the SWE-bench Verified test, which measures real-world software engineering capabilities. Its single-model configuration accuracy reaches 77.2%, and further improves to 82.0% when parallel inference optimization is enabled—far surpassing similar competitors. It supports continuous autonomous work for over 30 hours and can generate approximately 11,000 lines of code at once, covering the entire workflow from project development and bug debugging to code refactoring, making it well-suited for enterprise-level software development needs.

Advantages in Complex Agent Development: It is “the most powerful tool for building complex agents,” boasting strong planning and coordination, memory management, and sub-agent scheduling capabilities. Paired with Anthropic’s open Claude Agent SDK, developers can easily build sophisticated AI agents with permission systems and task-splitting features, significantly lowering the barrier to entry for developing high-end agent applications and making it ideal for enterprise-level agent-building scenarios.

High-Efficiency Computer Operation Skills: In the OSWorld real-computer-task benchmark, it leads comparable models with a score of 61.4%, a substantial improvement over the previous version’s 42.2%. It can autonomously perform operating-system-level tasks such as browser navigation, spreadsheet processing, file management, and data entry, seamlessly interacting with various office and professional software tools, making it suitable for automation of office work and operations & maintenance scenarios.

Outstanding Reasoning and Knowledge Processing Abilities: It performs exceptionally well in specialized domain tests, achieving an 83.4% score on the GPQA Diamond graduate-level reasoning test, earning a perfect score on the 2025 AIME math competition, and attaining an 89.1% accuracy rate on multilingual question answering (MMMLU). In professional scenarios such as financial analysis, legal document processing, medical research, and STEM fields, its logical reasoning and knowledge application abilities are outstanding, making it an efficient auxiliary tool for specialized domains.

Multi-modal and Parameter Advantages: It supports both text and image inputs and has comprehensive multi-language processing capabilities. The maximum output per session is 64K tokens, with a standard context window of 200K tokens; the beta version can support up to 1M tokens via specific headers, meeting the demands of large-scale tasks such as long-form content generation, integrated reviews of multiple documents, and large-scale code development.

Technical Highlights

Developer Tool Upgrade: Equipped with Claude Code v2, it introduces a “checkpoint” feature that allows progress saving and rollback, preventing loss of progress due to operational errors. It provides a native VS Code extension and a brand-new terminal interface, enabling direct code execution and file creation within conversation contexts, greatly simplifying the development process and boosting efficiency.

Advanced Security Framework: Adopting the AI Safety Level 3 (ASL-3) release framework, it includes a high-precision classifier filter that proactively blocks high-risk content such as chemical, biological, and radioactive materials. Its ability to defend against prompt injection attacks has significantly improved, with a false-positive rate reduced by more than tenfold compared to the previous version. Additionally, it introduces for the first time a mechanism for interpretability, strengthening security assessment and control.

Cost and Deployment Optimization: It supports flexible deployment across multiple platforms, with an input pricing of $3 per million tokens and an output pricing of $15 per million tokens. An innovative prompt caching feature reduces costs by up to 90%, while batch-processing scenarios save up to 50% of costs. This achieves a balance between cost and efficiency on a high-performance basis, making it suitable for enterprises of all sizes.

Market Impact

The launch of Claude Sonnet 4.5 is seen by the industry as a major milestone in the fields of AI programming and complex agent development. Its three-dimensional core capabilities—top-tier programming, complex agent development, and high-efficiency computer operation—mark the entry of AI into a deeper stage of application in production-level scenarios. With its leading performance in specialized tests, it has become the preferred model for enterprise-level software development, complex agent construction, high-end automated office work, and specialized research, poised to drive efficiency innovations across related industries. It is particularly favored by sectors such as finance, high-end software R&D, and biopharmaceuticals, where performance and security are critical requirements.


Related Review: “Claude Sonnet 4.5 vs. GLM-4.6: A Battle of Chinese and Foreign Large Models in Programming—Has the Winner Been Decided Yet?”

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (7)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(Talk)
POST
Stable
View Details
Chat(Analyze image)
POST
Stable
View Details
Chat(Function Call)
POST
Stable
View Details
Messages(Original format)
POST
Stable
View Details
Messages(Function Call)
POST
Stable
View Details
Messages(Thinking mode)
POST
Stable
View Details
Messages(128k output)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

claude-sonnet-4-5-20250929-thinking

≤ 200K input tokens , Cache write: $3.75 /1M tokens, Cache Read: $0.3 /1M tokens
200000

Input$3 / 1M tokens
Output$15 / 1M tokens

Input$3/ 1M tokens
Output$15/ 1M tokens
Original Price

claude-sonnet-4-5-20250929-thinking

> 200K input tokens , Cache write: $3.75 /1M tokens, Cache Read: $0.3 /1M tokens
200000

Input$6 / 1M tokens
Output$22.5 / 1M tokens

Input$6/ 1M tokens
Output$22.5/ 1M tokens
Original Price