Category

API

App

SiliconFlow
Date
Price

Pro/zai-org/GLM-5

Zhipu AI has launched a new-generation flagship foundation large model, specifically designed for complex system engineering and long-term agent tasks, offering a real programming experience that closely rivals Claude Opus 4.5.
Model
LLM
Model capability: thinkingModel capability: function_call
Input:
$0.572/1M tokensstarting from
Output:
$2.58/1M tokensstarting from

Pro/moonshotai/Kimi-K2.5

Kimi is currently the most versatile and intelligent multimodal model, excelling in agent capabilities, code, and visual understanding.
Model
LLM
Model capability: imageModel capability: videoModel capability: thinkingModel capability: function_call
Input:
$0.572/1M tokens
Output:
$3/1M tokens

Pro/zai-org/GLM-4.7

Zhipu’s flagship text model, covering complex task solving and efficient creation across multiple scenarios.
Model
LLM
Model capability: thinkingModel capability: function_call
Input:
$0.572/1M tokens
Output:
$2.286/1M tokens

Pro/deepseek-ai/DeepSeek-V3.2

The open-source large model with a multi-expert mixture-of-experts (MoE) architecture launched by DeepSeek.
Model
LLM
Model capability: thinkingModel capability: function_call
Input:
$0.286/1M tokens
Output:
$0.429/1M tokens

deepseek-ai/DeepSeek-V3.2

The open-source large model with a multi-expert mixture-of-experts (MoE) architecture launched by DeepSeek.
Model
LLM
Model capability: thinkingModel capability: function_call
Input:
$0.286/1M tokens
Output:
$0.429/1M tokens

Qwen/Qwen3-VL-32B-Thinking

The inference version of the largest Dense model in the Qwen3-VL series, with multimodal reasoning capabilities second only to Qwen3-VL-235B-Thinking.
Model
LLM
Model capability: imageModel capability: function_call
Input:
$0.143/1M tokens
Output:
$1.429/1M tokens

Qwen/Qwen3-VL-32B-Instruct

The non-inference version of the largest Dense model in the Qwen3-VL series, with overall performance second only to Qwen3-VL-235B-Instruct.
Model
LLM
Model capability: imageModel capability: function_call
Input:
$0.143/1M tokens
Output:
$0.572/1M tokens

Kwaipilot/KAT-Dev

An open-source 32B-parameter model designed for software engineering tasks
Model
LLM
Input:
$0.143/1M tokens
Output:
$0.572/1M tokens

inclusionAI/Ling-flash-2.0

A large language model launched by Alibaba, built on a Mixture of Experts (MoE) architecture.
Model
LLM
Model capability: thinkingModel capability: function_call
Input:
$0.143/1M tokens
Output:
$0.572/1M tokens