glm-4-0520

glm-4-0520

The high-performance Chinese foundation model launched by Zhipu AI
2024-05-20
LLM
Model capability: function_call
Input:
$14/1M tokens
Output:
$14/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

GLM-4 is the flagship language model launched by Zhipu AI, with a core positioning as a high-performance Chinese foundation model that features a 128K context window, agent capabilities, and multimodal integration, enabling autonomous handling of complex tasks and efficient enterprise-level deployment.

  • Chinese capabilities on par with GPT-4: In multiple evaluations, its Chinese understanding and generation abilities have reached GPT-4 levels, making it a benchmark for domestically developed models.
  • Ultra-long 128K context window: It can process up to 300 pages of text in a single session and supports “needle-in-a-haystack” tests with 100% precision recall, perfectly meeting the needs of deep analysis of long documents.
  • Autonomous agent execution: It can automatically break down complex instructions, call browsers, code interpreters, and text-to-image toolchains, and complete cross-modal tasks.
  • Structured output support: Outputs natively in JSON format, directly interfacing with business systems and reducing the need for secondary development.
  • Influence in the open-source ecosystem: The open-source version has been downloaded over 10 million times worldwide, with 13 million downloads on Hugging Face, ranking first among domestic models.

───────────────────────────────────────────────────────────────────

Core Capabilities

⚡ Ultra-long text processing: A 128K context window plus an intelligent caching mechanism boosts the coherence of causal chain analysis in long documents by 50%.

🔧 Intelligent agent task chains: Exclusively supports autonomous breakdown of complex instructions and cross-tool calls to complete multi-step tasks (such as report generation + data visualization).

🌐 Full multimodal integration: Integrates web retrieval, code execution, and multimodal understanding, enabling end-to-end processing from text to image to data with a single call.

🔑 Enterprise private customization: LoRA fine-tuning can be completed in just 3 steps, increasing model availability from 60% to 89%.

🔍 Structured data output: Native JSON support plus coordinate localization (such as Grounding) allows direct generation of interactive code or structured data.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat (Zhipu GLM-4)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

glm-4-0520

-
128000

Input$14 / 1M tokens
Output$14 / 1M tokens

Input$14/ 1M tokens
Output$14/ 1M tokens
Original Price