gpt-5.4-nano-2026-03-17

gpt-5.4-nano-2026-03-17

A low-cost, low-latency model focused on handling high-frequency, small-scale tasks.
2026-03-19
LLM
Model capability: imageModel capability: function_call
Input:
$0.2/1M tokens
Output:
$1.25/1M tokens
Bulk order? Contact your manager for exclusive deals
稳定性
Stable

API Overview

GPT-5.4-Nano is the smallest and fastest “miniaturized” model in the GPT-5.4 series, optimized specifically for scenarios requiring AI logic to run with extremely low resource consumption. It sheds unnecessary complexity while retaining the core logical framework of the 5.4 series, enabling precise understanding of short text sequences, logical reasoning, and instruction execution in a remarkably short time. The Nano model is not only an excellent foundation for embedded devices, mobile applications, and lightweight APIs but also the ideal choice for building AI applications with “high-frequency perception” capabilities.───────────────────────────────────────────────────────────────────

Core Capabilities


Ultra-lightweight Deployment: With an extremely small parameter size, it significantly reduces reliance on computing resources such as CPUs and GPUs. It can run not only in the cloud but also be easily embedded into local devices, providing dual guarantees of privacy and efficiency.

Millisecond-level Real-time Response: It achieves truly “instantaneous response,” compressing latency to nearly imperceptible microsecond or millisecond levels in dialogue interactions, intent recognition, and fast-triggered tasks.

Extreme Cost-effectiveness: Offering exceptionally high cost performance per unit resource, it supports massive concurrent requests, making it a critical infrastructure for reducing operational costs in large-scale AI applications.

Superior Tool Compatibility: Optimized specifically for “instruction execution” and “function calls,” it can precisely trigger downstream API actions using the fewest possible tokens, serving as a “bridge model” that connects complex intelligent systems with the real world.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (5)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(Talk)
POST
Stable
View Details
Chat (Image Analysis)
POST
Stable
View Details
Chat (Structured Output)
POST
Stable
View Details
Chat (function call)
POST
Stable
View Details
Responses
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

gpt-5.4-nano-2026-03-17

-
400000

Input$0.2 / 1M tokens
Output$1.25 / 1M tokens

Input$0.2/ 1M tokens
Output$1.25/ 1M tokens
Original Price