glm-5

glm-5

Zhipu AI has launched a new-generation flagship foundation large model, specifically designed for complex system engineering and long-term agent tasks, offering a real programming experience that closely rivals Claude Opus 4.5.
2026-02-12
LLM
Model capability: thinkingModel capability: function_call
Input:
$0.6/1M tokensstarting from
Output:
$2.6/1M tokensstarting from
Bulk order? Contact your manager for exclusive deals

API Overview

GLM-5 is the next-generation flagship foundation model released by Zhipu AI. Its core positioning is as a “general-purpose agent foundation for agentic engineering,” specifically designed for complex systems engineering and long-term agent tasks. It achieves state-of-the-art performance in coding and agent capabilities among open-source models, with a real programming experience approaching that of Claude Opus 4.5.

  • Capability Leap: The model’s parameter scale has expanded from 35.5B (with 32B activated) to 74.4B (with 40B activated), and its pre-training data reaches 28.5T tokens. Combined with the brand-new “Slime” asynchronous reinforcement learning framework and DeepSeek’s sparse attention mechanism, it significantly reduces deployment costs while maintaining context processing performance around 200K tokens.
  • Programming Strength: GLM-5 achieves the highest scores among open-source models on SWE-bench-Verified (77.8) and Terminal Bench 2.0 (56.2), surpassing Gemini 3.0 Pro. It can independently handle system-level engineering tasks such as backend refactoring, deep debugging, and full-stack development.
  • Agent Capabilities: In multi-tool long-term task evaluations including BrowseComp, MCP-Atlas, and τ²-Bench, GLM-5 ranks first among open-source models, demonstrating strong goal consistency, resource scheduling, and multi-step dependency handling capabilities.
  • Office Integration: GLM-5 natively supports GLM in Excel and is compatible with Microsoft’s official AI plugin, enabling spreadsheet automation. It also supports structured outputs (JSON), function calls, MCP tool invocations, and context caching.
  • Recommended Scenarios: Agentic Coding, end-to-end agent execution, cross-phase office tasks, immersive role-playing, professional script/storyboard generation, precise translation, structured data extraction, and information quality inspection.

───────────────────────────────────────────────────────────────────

Core Capabilities

🧠 Agentic-Ready Architecture

Supports the leap from “writing code” to “writing engineering,” enabling autonomous planning, execution, verification, and delivery of complete system-level products.

💻 SOTA-Level Programming Capability

Provides a real development environment experience close to Claude Opus 4.5, significantly reducing manual intervention and covering the entire frontend, backend, and data processing pipeline.

🛠️ Powerful Tool Ecosystem Integration

Supports function calls and the MCP protocol, allowing flexible invocation of external tools and data sources to build complex agent workflows.

📊 Deep Adaptation for Office Productivity

Through GLM in Excel and long-context memory, it stably handles multi-step, highly logically connected table and document tasks.

🎭 Highly Consistent Role-Playing

Consistently maintains character settings, emotions, and narrative logic during long-text interactions, enabling evolving, immersive conversations.

📄 Professional Content Generation

Long-text outputs such as scripts and storyboards exhibit production-quality usability, with greatly enhanced character development and plot coherence.

🔍 Precise Information Processing

It can extract structured data from complex texts like contracts, financial reports, and customer service tickets, and automatically perform quality checks and risk identification.

Efficient Inference Optimization

With a 200K context window, a maximum output of 128K, and a sparse attention mechanism, it balances long-term task capabilities with token efficiency.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat (Zhipu GLM Multimodal)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

glm-5

Input length [0, 32k]
200000

Input$0.6 / 1M tokens
Output$2.6 / 1M tokens

Input$0.6/ 1M tokens
Output$2.6/ 1M tokens
Original Price

glm-5

Input length [32k, 200k]
200000

Input$0.9 / 1M tokens
Output$3.2 / 1M tokens

Input$0.9/ 1M tokens
Output$3.2/ 1M tokens
Original Price