gpt-image-2

gpt-image-2

OpenAI's latest flagship image generation model
2026-04-22
Image Generations
Input:
$5/1M tokensstarting from
Output:
$30/1M tokensstarting from
Bulk order? Contact your manager for exclusive deals

API Overview

Compared to the previous V1.5 version, GPT-Image-2 demonstrates an intergenerational leap forward, with its core strengths centered on commercially viable performance, highly accurate text rendering, and exceptionally realistic photographic质感.

───────────────────────────────────────────────────────────────────

Core Capabilities

Ultra-high-level text rendering capability: This addresses the long-standing “garbled text” issue in the AI image generation field, enabling accurate rendering of long strings, UI labels, signboard texts, and more within images. It not only supports multilingual generation but also significantly improves text accuracy in complex environments—reaching an accuracy rate that is nearly 100%.

Photographic-level realism and material textures: This markedly reduces the “plastic-like” appearance commonly seen in earlier AI-generated images, delivering a level of realism in lighting, physical perspective, and material textures that closely mimics actual photographs taken by a camera. It achieves top-tier performance in areas such as game visual simulation, fine skin texture rendering, and creating a truly photographic feel.

Strong prompt control and complex layout capabilities: It exhibits superior logical restoration ability when handling long instructions and complex textual descriptions. It can precisely control composition and layout, excelling in scenarios requiring exact typography, data visualization, and the creation of information graphics with high information density.

Commercial-grade application capabilities: Beyond entertainment creation, it better aligns with real-world workflows, enabling the production of professional-quality UI interface screenshots, product posters, design sketches, and more. Character settings are more stable, making it easier to distinguish and maintain complex character designs.

Evolution and improvements over v1.5: It corrects common color deviation issues found in earlier versions. The human body structure is more anatomically accurate, reducing errors such as limb distortion.

API Console

Log in to explore more features! Click to Log In

API Analytics

API Reference (2)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Generations(Generate image)
POST
Stable
View Details
Edit(Modify Image)
POST
Stable
View Details

API Pricing

$
ModelDescriptionOfficial Price302.AI Price

Generations(Generate image)

text input

Input$5 / 1M tokens
Output$30 / 1M tokens

Input$5/ 1M tokens
Output$30/ 1M tokens
Original Price

Generations(Generate image)

Image input

Input$8 / 1M tokens
Output$30 / 1M tokens

Input$8/ 1M tokens
Output$30/ 1M tokens
Original Price

Edit(Modify Image)

Text input

Input$5 / 1M tokens
Output$30 / 1M tokens

Input$5/ 1M tokens
Output$30/ 1M tokens
Original Price

Edit(Modify Image)

Image input

Input$8 / 1M tokens
Output$30 / 1M tokens

Input$8/ 1M tokens
Output$30/ 1M tokens
Original Price