Qwen/Qwen3-Omni-30B-A3B-Thinking

Qwen/Qwen3-Omni-30B-A3B-Thinking

A model from Alibaba focusing on complex multimodal reasoning
2025-09-23
LLM
Model capability: imageModel capability: thinkingModel capability: function_call
Input:
$0.1/1M tokens
Output:
$0.4/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

Qwen3-Omni-30B-A3B-Thinking is the core "Thinker" component within the Qwen3-Omni multimodal model. It is specifically designed to handle multi-modal inputs—including text, audio, images, and video—and perform complex chain-of-thought reasoning. As the brain behind the reasoning process, this model unifies all inputs into a common representation space for understanding and analysis, though its output remains in text form. This design enables it to excel in tackling intricate problems that require deep thinking and cross-modal comprehension—such as solving math problems embedded in images—and serves as the key to unlocking the powerful cognitive capabilities of the entire Qwen3-Omni architecture.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(SiliconFlow)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

Qwen/Qwen3-Omni-30B-A3B-Thinking

-
64000

Input$0.1 / 1M tokens
Output$0.4 / 1M tokens

Input$0.1/ 1M tokens
Output$0.4/ 1M tokens
Original Price