
Qwen/Qwen3-VL-8B-Thinking
The Vision-Language Model of the Qwen3 Series
2025-10-15
Input:
$0.072/1M tokens
Output:
$0.715/1M tokens
Bulk order? Contact your manager for exclusive deals
API Overview
Qwen3-VL-8B-Thinking is a vision-language model from the Qwen3 series, optimized specifically for scenarios requiring complex reasoning. It is based on the Qwen3-8B-Instruct model and supports “Thinking Mode.” In this mode, the model performs step-by-step thinking and reasoning before providing the final answer, significantly enhancing its performance in complex visual question answering, multi-step instruction following, and logical reasoning tasks. This model also excels in general visual understanding, image-and-text dialogue, and multilingual text recognition, and supports high-definition images with resolutions up to 1024x1024.
Playground
Log in to explore more features! Click to Log In
API Analytics
API Reference (1)
API Pricing
$¥ 円 ₽