gpt-5.4-mini-2026-03-17

gpt-5.4-mini-2026-03-17

A lightweight version of GPT-5.4 for high-frequency, simple scenarios that are sensitive to cost and speed.
2026-03-19
LLM
Model capability: imageModel capability: function_call
Input:
$0.75/1M tokens
Output:
$4.5/1M tokens
Bulk order? Contact your manager for exclusive deals
稳定性
Stable

API Overview

GPT-5.4-Mini is a lightweight model in OpenAI’s GPT-5.4 series, renowned for its exceptional energy efficiency ratio. While inheriting GPT-5.4’s powerful logical reasoning and instruction-following capabilities, it achieves faster inference speeds and extremely low per-unit operating costs through a streamlined architectural design. The Mini model is designed to provide developers with the optimal balance between high performance and cost-effectiveness, making it an ideal foundation for building high-frequency real-time applications, mobile AI features, and agents for large-scale production environments.

───────────────────────────────────────────────────────────────────

Core Capabilities


Ultimate Reasoning Cost-Effectiveness: Delivers GPT-5.4-level intelligent experiences while significantly optimizing token consumption and inference latency, dramatically reducing long-term operational costs for enterprise-level applications.

Low-Latency Interaction Experience: With ultra-short prefill time (Time to First Token) and outstanding generation rates, it ensures a smooth and seamless experience in dialogue, retrieval-augmented generation (RAG), and real-time recommendation scenarios.

High-Performance Optimization for Agents: Specifically trained to enhance performance in short-sequence tasks, function calls, and tool usage, making it the preferred choice for lightweight agents and automated task execution.

Multi-Environment Deployment Flexibility: Thanks to its lightweight nature, it’s not only suitable for cloud-based API calls but also ideally integrated into resource-constrained environments—such as edge devices and on-premises clients—meeting diverse business deployment needs.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (5)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(Talk)
POST
Stable
View Details
Chat (Image Analysis)
POST
Stable
View Details
Chat (Structured Output)
POST
Stable
View Details
Chat (function call)
POST
Stable
View Details
Responses
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

gpt-5.4-mini-2026-03-17

-
400000

Input$0.75 / 1M tokens
Output$4.5 / 1M tokens

Input$0.75/ 1M tokens
Output$4.5/ 1M tokens
Original Price