qwen3.5-flash

qwen3.5-flash

Alibaba’s Qwen3.5 series supports multimodal capabilities.
2026-02-25
LLM
Model capability: imageModel capability: videoModel capability: thinkingModel capability: function_call
Input:
$0.03/1M tokensstarting from
Output:
$0.29/1M tokensstarting from
Bulk order? Contact your manager for exclusive deals

API Overview

Qwen3.5-Flash is a lightweight model in the Tongyi Qianwen series that emphasizes “high cost-effectiveness.” Building on the powerful logical reasoning and multimodal capabilities of the Qwen3.5 series, it has been deeply optimized for high-concurrency, low-latency business scenarios. Qwen3.5-Flash is designed to provide developers with the fastest inference speed and extremely low invocation costs, making it an ideal high-performance foundation for handling large-scale, high-frequency tasks, building edge AI applications, and enabling real-time interactive scenarios. ───────────────────────────────────────────────────────────────────

Core Capabilities

Ultimate Inference Efficiency: Specifically designed for high-throughput scenarios, it significantly reduces response times, meeting the latency-sensitive requirements of real-time chat and automated task processing.

Outstanding Cost Efficiency: While maintaining high-quality outputs, it dramatically lowers computing and invocation costs, making it particularly suitable for enterprises that need to process large volumes of data in batches or build highly concurrent applications.

Multimodal Processing Capability: Although positioned as a lightweight model, it still boasts excellent text and visual semantic understanding capabilities, supporting rapid image-and-text question answering and basic visual tasks, striking a balance between lightness and intelligence.

Highly Friendly to Production Environments: Optimized for production environments, it features stable output performance and ease of use, seamlessly integrating with various mainstream development frameworks for rapid deployment of AI functionalities.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
qwen3.5-flash
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

qwen3.5-flash

Input <= 128K
992000

Input$0.03 / 1M tokens
Output$0.29 / 1M tokens

Input$0.03/ 1M tokens
Output$0.29/ 1M tokens
Original Price

qwen3.5-flash

128K-256K
992000

Input$0.12 / 1M tokens
Output$1.15 / 1M tokens

Input$0.12/ 1M tokens
Output$1.15/ 1M tokens
Original Price

qwen3.5-flash

256K-1M
992000

Input$0.18 / 1M tokens
Output$1.72 / 1M tokens

Input$0.18/ 1M tokens
Output$1.72/ 1M tokens
Original Price