
Qwen/Qwen3-Omni-30B-A3B-Thinking
API Overview
Qwen3-Omni-30B-A3B-Thinking is the core "Thinker" component within the Qwen3-Omni multimodal model. It is specifically designed to handle multi-modal inputs—including text, audio, images, and video—and perform complex chain-of-thought reasoning. As the brain behind the reasoning process, this model unifies all inputs into a common representation space for understanding and analysis, though its output remains in text form. This design enables it to excel in tackling intricate problems that require deep thinking and cross-modal comprehension—such as solving math problems embedded in images—and serves as the key to unlocking the powerful cognitive capabilities of the entire Qwen3-Omni architecture.
Playground
Log in to explore more features! Click to Log In