Qwen/Qwen3-VL-8B-Thinking

Qwen/Qwen3-VL-8B-Thinking

The Vision-Language Model of the Qwen3 Series
2025-10-15
LLM
Model capability: image
Input:
$0.072/1M tokens
Output:
$0.715/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

Qwen3-VL-8B-Thinking is a vision-language model from the Qwen3 series, optimized specifically for scenarios requiring complex reasoning. It is based on the Qwen3-8B-Instruct model and supports “Thinking Mode.” In this mode, the model performs step-by-step thinking and reasoning before providing the final answer, significantly enhancing its performance in complex visual question answering, multi-step instruction following, and logical reasoning tasks. This model also excels in general visual understanding, image-and-text dialogue, and multilingual text recognition, and supports high-definition images with resolutions up to 1024x1024.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(SiliconFlow)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

Qwen/Qwen3-VL-8B-Thinking

-
256000

Input$0.072 / 1M tokens
Output$0.715 / 1M tokens

Input$0.072/ 1M tokens
Output$0.715/ 1M tokens
Original Price