Category

API

App

LLM
Date
Price

gpt-5.5

OpenAI’s most powerful agent model to date
Model
LLM
Model capability: imageModel capability: function_call
Input:
$5/1M tokensstarting from
Output:
$30/1M tokensstarting from

happyhorse-1.0-r2v

Alibaba Group’s next-generation cutting-edge AI video generation model
API
Video Generation
Pricing:
$0.156/second

starting from

happyhorse-1.0-i2v

Alibaba Group’s next-generation cutting-edge AI video generation model
API
Video Generation
Pricing:
$0.156/second

starting from

happyhorse-1.0-t2v

Alibaba Group’s next-generation cutting-edge AI video generation model
API
Video Generation
Pricing:
$0.156/sec

starting from

deepseek-v4-pro

The latest flagship AI model released by the DeepSeek series represents the current highest standard in both scale and performance among open-source models.
Model
LLM
Model capability: thinkingModel capability: function_call
Input:
$1.72/1M tokens
Output:
$3.43/1M tokens

deepseek-v4-flash

DeepSeek’s newly released language model, designed for high-performance production scenarios, is specifically optimized for ultimate inference efficiency and response speed.
Model
LLM
Model capability: thinkingModel capability: function_call
Input:
$0.143/1M tokens
Output:
$0.286/1M tokens

gpt-image-2

OpenAI's latest flagship image generation model
API
Image Generations
Input:
$5/1M tokensstarting from
Output:
$30/1M tokensstarting from
GPT Image 2 Canvas
View Details
Try Now

GPT Image 2 Canvas

Generate and edit images using the GPT Image 2 model, including annotation and image stitching.
App
Image Processing
Pricing:
Depends on the specific model used

kimi-k2.6

Kimi K2.6 is Kimi's newest and most intelligent model.
Model
LLM
Model capability: imageModel capability: videoModel capability: thinkingModel capability: function_call
Input:
$0.95/1M tokens
Output:
$4/1M tokens

qwen3.6-flash

The Qwen3.6 native vision-language series Flash model delivers significantly improved performance compared to the 3.5-Flash model.
Model
LLM
Model capability: imageModel capability: videoModel capability: thinkingModel capability: function_call
Input:
$0.19/1M tokensstarting from
Output:
$1.13/1M tokensstarting from

qwen3.6-35b-a3b

The Qwen3.6 series 35B-A3B native vision-language model, designed based on a hybrid architecture, integrates linear attention mechanisms with sparse mixture-of-experts models.
Model
LLM
Model capability: imageModel capability: videoModel capability: thinkingModel capability: function_call
Input:
$0.28/1M tokens
Output:
$1.7/1M tokens

wan2.7-videoedit

The video editing features of the 2.7 series consistently preserve detailed information such as the image subject, style, and text.
API
Video Generation
Pricing:
$0.1/second

starting from

wan2.7-r2v

The reference video function of the wan2.7 series stably preserves details such as the image subject, style, and text.
API
Video Generation
Pricing:
$0.1/second

starting from

wan2.7-t2v

The text-to-video feature of the wan2.7 series, with its smooth dynamic capabilities, cinematic aesthetic control, and precise command adherence.
API
Video Generation
Pricing:
$0.1/sec

starting from

wan2.7-i2v

The image-to-video feature of the wan2.7 series stably preserves detailed information such as the main subject, style, and text of the image
API
Video Generation
Pricing:
$0.1/second

starting from