glm-4.6v-flash

glm-4.6v-flash

Zhipu Lightweight Visual Understanding Model
2025-12-08
LLM
Model capability: imageModel capability: function_call
Pricing:
Free
Bulk order? Contact your manager for exclusive deals

API Overview

GLM-4.6V is the flagship multimodal visual understanding model launched by Zhipu AI. GLM-4.6V-Flash is its 9B lightweight version, designed for local deployment and low-latency scenarios. It is compatible with ordinary hardware and mainstream inference frameworks, and its overall performance surpasses that of Qwen3-VL-8B with the same parameters.

  • Clear Positioning: Supports multiple types of input and output, with a context window of up to 128K tokens, focusing on flagship-level visual reasoning.
  • Multiple Capabilities: Features core abilities including deep thinking, visual understanding, and streaming output.
  • Rich Scenarios: Suitable for complex tasks such as image understanding, video understanding, and document question answering.
  • Technologically Leading: Natively supports multimodal tool calls and achieves state-of-the-art performance on multimodal evaluation benchmarks.

───────────────────────────────────────────────────────────────────

Core Capabilities

🔍 Visual Understanding: Supports various types of input, accurately identifying content, attributes, and scenes, and can be used for tasks like invoice OCR.

🤔 Deep Thinking: The thinking mode can be flexibly enabled or disabled, enhancing the reasoning and analytical capabilities for handling complex tasks.

💬 Streaming Output: Generates responses in real time, optimizing the user interaction experience in scenarios such as dialogue systems.

🛠 Function Call: Natively supports tool calls and integrates external tools, enabling features like mixed-text-and-image output.

⚡ Context Caching: Smart caching optimizes performance for long conversations, supporting efficient processing of long texts and videos.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat (Zhipu GLM-4V)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContext302.AI Price

glm-4.6v-flash

glm-4.6v-flash
128000

Free