gpt-5.1

gpt-5.1

The best model for coding and agentic tasks with configurable reasoning effort
2025-11-14
LLM
Model capability: imageModel capability: function_call
Input:
$1.25/1M tokens
Output:
$10/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

GPT-5.1 is a next-generation GPT-5 series model designed for agents and coding tasks, featuring enhanced instruction understanding, controllable tool invocation, and adaptive reasoning capabilities. Its key strengths lie in maintaining high reliability in complex tasks and delivering ultra-fast responses in simple tasks, while also boasting state-of-the-art code editing and long-term task execution abilities. It’s ideal for developers, automation engineers, and agent builders looking to achieve efficient coding, seamless toolchain execution, and low-latency applications.

───────────────────────────────────────────────────────────────────

  • Ultra-fast and energy-efficient: Equipped with a low-latency “no-inference” mode, it delivers lightning-fast response times, supports multi-turn interactions and efficient tool invocation, and is well-suited for high-performance computing needs.
  • Top-tier coding capabilities: Featuring the apply_patch tool and structured code editing, it supports Shell tool execution, achieving 76.3% SWE-bench Verified performance and adapting to a wide range of coding scenarios.
  • Agent optimization: Its adaptive reasoning capability ensures high reliability for complex tasks while providing ultra-fast responses for simple tasks, perfectly meeting the needs of developers, engineers, and agent system builders.

───────────────────────────────────────────────────────────────────

Core Capabilities

⚡ Adaptive Reasoning and Ultra-Fast Response

Automatically accelerates simple tasks while maintaining high reliability for complex ones; supports a low-latency no-inference mode for the fastest tool invocation.

🛠️ Top-Notch Coding and Controllable Editing

The apply_patch tool enables precise, structured code modifications, achieving 76.3% SWE-bench Verified performance and making it ideal for long-term, complex coding tasks.

📡 Powerful Toolchain Execution

Natively supports Shell tools, allowing you to build “plan—execute” agent loops through command execution.

💰 Lower Costs

A 24-hour prompt cache reduces input token costs by 90%, and token consumption for tool-intensive tasks is only half that of competitors.

🌐 Smooth Migration

Compatible with GPT-5 pricing and rate limits; the previous GPT-5 version remains available, making it easy to seamlessly transition to this new model without disrupting your existing workflows.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (5)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(Talk)
POST
Stable
View Details
Chat (Image Analysis)
POST
Stable
View Details
Chat (Structured Output)
POST
Stable
View Details
Chat (function call)
POST
Stable
View Details
Responses
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

gpt-5.1

-
400000

Input$1.25 / 1M tokens
Output$10 / 1M tokens

Input$1.25/ 1M tokens
Output$10/ 1M tokens
Original Price