glm-4.6v

glm-4.6v

Zhipu High-Performance Visual Understanding Model
2025-12-08
LLM
Model capability: imageModel capability: function_call
Input:
$0.145/1M tokensstarting from
Output:
$0.43/1M tokensstarting from
Bulk order? Contact your manager for exclusive deals

API Overview

GLM-4.6V is a flagship multimodal visual understanding model launched by Zhipu AI, primarily designed for efficient visual reasoning and equipped with powerful capabilities such as deep thinking and streaming output.

  • Clear Positioning: Supports multiple types of input and output, with a context window of up to 128K tokens, focusing on flagship-level visual reasoning.
  • Multiple Capabilities: Features core abilities including deep thinking, visual understanding, and streaming output.
  • Rich Scenarios: Suitable for complex tasks such as image understanding, video understanding, and document question answering.
  • Technologically Leading: Natively supports multimodal tool calls and achieves state-of-the-art performance on multimodal evaluation benchmarks.

───────────────────────────────────────────────────────────────────

Core Capabilities

🔍 Visual Understanding: Supports various types of input, accurately identifying content, attributes, and scenes, and can be used for tasks like invoice OCR.

🤔 Deep Thinking: The thinking mode can be flexibly enabled or disabled, enhancing the reasoning and analytical capabilities for handling complex tasks.

💬 Streaming Output: Generates responses in real time, optimizing the user interaction experience in scenarios such as dialogue systems.

🛠 Function Call: Natively supports tool calls and integrates external tools, enabling features like mixed-text-and-image output.

⚡ Context Caching: Smart caching optimizes performance for long conversations, supporting efficient processing of long texts and videos.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat (Zhipu GLM-4V)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

glm-4.6v

Input length [0, 32k] price
128000

Input$0.145 / 1M tokens
Output$0.43 / 1M tokens

Input$0.145/ 1M tokens
Output$0.43/ 1M tokens
Original Price

glm-4.6v

Input length [32k, 128k] price
128000

Input$0.29 / 1M tokens
Output$0.86 / 1M tokens

Input$0.29/ 1M tokens
Output$0.86/ 1M tokens
Original Price