sophnet/GLM-5

sophnet/GLM-5

Zhipu AI has launched a new-generation flagship foundation large model, specifically designed for complex system engineering and long-term agent tasks, offering a real programming experience that closely rivals Claude Opus 4.5.
2026-02-15
LLM
Model capability: thinkingModel capability: function_call
Input:
$0.572/1M tokensstarting from
Output:
$2.286/1M tokensstarting from
Bulk order? Contact your manager for exclusive deals

API Overview

GLM-5 is the next-generation flagship foundation model launched by Zhipu AI. Its core positioning is as a “general-purpose agent foundation for Agentic Engineering,” specifically designed for complex system engineering and long-term agent tasks. It achieves state-of-the-art (SOTA) performance in coding and agent capabilities among open-source models, with a real programming experience approaching that of Claude Opus 4.5.

  • Capability Leap: The model’s parameter scale has expanded from 35.5B (activated at 32B) to 74.4B (activated at 40B), with pre-training data reaching 28.5T tokens. Combined with the brand-new “Slime” asynchronous reinforcement learning framework and DeepSeek’s sparse attention mechanism, it significantly reduces deployment costs while maintaining context performance around 200K tokens.
  • Programming Strength: GLM-5 achieves the highest scores among open-source models on SWE-bench-Verified (77.8) and Terminal Bench 2.0 (56.2), surpassing Gemini 3.0 Pro. It can independently complete system-level engineering tasks such as backend refactoring, deep debugging, and full-stack development.
  • Agent Capabilities: GLM-5 ranks first among open-source models in multi-tool long-term task evaluations including BrowseComp, MCP-Atlas, and τ²-Bench. It demonstrates goal consistency, resource scheduling, and multi-step dependency handling capabilities.
  • Office Integration: GLM-5 natively supports GLM in Excel and is compatible with Microsoft’s official AI plugin, enabling spreadsheet automation. It also supports structured outputs (JSON), function calls, MCP tool invocations, and context caching.
  • Recommended Scenarios: Agentic Coding, end-to-end agent execution, cross-phase office tasks, immersive role-playing, professional script/storyboard generation, precise translation, structured data extraction, and information quality inspection.

───────────────────────────────────────────────────────────────────

Core Capabilities

🧠 Agentic-Ready Architecture

Supports the leap from “writing code” to “writing engineering,” enabling autonomous planning, execution, verification, and delivery of complete system-level products.

💻 SOTA-Level Programming Capability

Delivers an experience close to Claude Opus 4.5 in real development environments, significantly reducing manual intervention and covering the entire frontend, backend, and data processing pipeline.

🛠️ Powerful Tool Ecosystem Integration

Supports Function Calls and the MCP protocol, allowing flexible invocation of external tools and data sources to build complex agent workflows.

📊 Deep Adaptation for Office Productivity

Through GLM in Excel and long-context memory, it stably handles multi-step, highly logically connected table and document tasks.

🎭 Highly Consistent Role-Playing

Maintains consistent character settings, emotions, and narrative logic throughout long-text interactions, enabling evolving, immersive conversations.

📄 Professional Content Generation

Script and storyboard outputs are production-ready, with greatly enhanced character development and plot coherence.

🔍 Precise Information Processing

Can extract structured data from complex texts such as contracts, financial reports, and customer service tickets, and automatically perform quality checks and risk identification.

Efficient Inference Optimization

With a 200K context window, up to 128K maximum output, and a sparse attention mechanism, it balances long-term task capability with token efficiency.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(SophNet)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

sophnet/GLM-5

Input < 32k
202000

Input$0.572 / 1M tokens
Output$2.286 / 1M tokens

Input$0.572/ 1M tokens
Output$2.286/ 1M tokens
Original Price

sophnet/GLM-5

Input: [32k,64k)
202000

Input$0.858 / 1M tokens
Output$3.556 / 1M tokens

Input$0.858/ 1M tokens
Output$3.556/ 1M tokens
Original Price

sophnet/GLM-5

64k <= input <= 202k
202000

Input$1.144 / 1M tokens
Output$4.064 / 1M tokens

Input$1.144/ 1M tokens
Output$4.064/ 1M tokens
Original Price