qwen-vl-max-latest

qwen-vl-max-latest

Image recognition model from Alibaba Qwen
2025-08-15
LLM
Model capability: image
Input:
$0.23/1M tokens
Output:
$0.572/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

Qwen-VL-Max (qwen-vl-max), the ultra-large-scale vision-language model from Qwen. Compared to the enhanced version, it further improves visual reasoning and instruction-following capabilities, delivering higher levels of visual perception and cognition. It provides optimal performance across a wider range of complex tasks.

Application Scenarios

  • Image Question Answering: Describe the content in images or classify and label them, such as identifying people, locations, flowers, birds, fish, and other creatures.
  • Solving Math Problems: Answer math questions presented in images, suitable for primary, secondary, university, and adult education levels.
  • Video Understanding: Analyze video content, such as pinpointing specific events and retrieving timestamps, or generating summaries of key time periods.
  • Object Localization: Locate objects within images and return the coordinates of the top-left and bottom-right corners of their bounding boxes, or the coordinates of their center points.
  • Document Parsing: Convert image-based documents (such as scanned copies or image-based PDFs) into QwenVL's HTML format, which not only accurately identifies text but also captures the positional information of images, tables, and other elements.
  • Text Recognition and Information Extraction: Recognize text and formulas within images, or extract information from documents like bills, certificates, and forms, with support for formatted text output. Supported languages include Chinese, English, Japanese, Korean, Arabic, Vietnamese, French, German, Italian, Spanish, and Russian.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat (Tongyi Qianwen-VL)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

qwen-vl-max-latest

-
128000

Input$0.23 / 1M tokens
Output$0.572 / 1M tokens

Input$0.23/ 1M tokens
Output$0.572/ 1M tokens
Original Price