
Qwen/Qwen3-VL-30B-A3B-Instruct
A visual language model from Alibaba
2025-09-23
Input:
$0.1/1M tokens
Output:
$0.4/1M tokens
Bulk order? Contact your manager for exclusive deals
API Overview
Qwen3-VL is the most powerful visual language model in the Qwen series to date. The model has undergone a comprehensive upgrade, featuring enhanced text understanding and generation, deeper visual perception and reasoning capabilities, extended context length, improved spatial and video dynamic understanding, and stronger agent interaction abilities. As an instruction-fine-tuned version based on a Mixture-of-Experts (MoE) architecture, it is specifically designed for flexible, on-demand deployment, offering robust visual agents, visual encoding, and video comprehension capabilities, while natively supporting up to 256K context length.
Playground
Log in to explore more features! Click to Log In
API Analytics
API Reference (1)
API Pricing
$¥ 円 ₽