codegeex-4

codegeex-4

An open-source, professional code generation model continuously trained on GLM-4-9B, covering all development scenarios including code completion and repository-level question answering.
2024-07-05
LLM
Input:
$0.014/1M tokens
Output:
$0.014/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

CodeGeeX4-ALL-9B is an open-source, professional code-generation model developed by the Tsinghua University team, with a core focus on being a multilingual, all-purpose development assistant that supports ultra-long contexts of up to 128K tokens. It is continuously trained based on GLM-4-9B and achieves enterprise-level performance with lightweight parameters, covering all development scenarios—including code completion and repository-level question answering.

  • Performance Breakthrough: On authoritative benchmarks such as BigCodeBench and NaturalCodeBench, it sets a new record for 10B-parameter models with a score of 48.9 points, surpassing large models like Llama3-70B
  • Scenario Adaptation: It uniquely supports function calls, code interpreters, and repository-level code question answering, covering the entire development workflow from single-line completion to project refactoring
  • Efficiency Advantage: Its inference cost is only 1/4 of comparable models, and its development cycle is shortened by 50%, making it especially suitable for resource-sensitive teams
  • Long-Text Processing: With support for 128K contexts, its code retrieval accuracy reaches 100% (CRUXEval test)
  • Well-Established Ecosystem: It provides VS Code/Jetbrains plugins, local deployment guides, and multi-framework deployment solutions (vLLM/Candle)

───────────────────────────────────────────────────────────────────

Core Capabilities

⚡ Ultra-Long Context Handling: With a sequence length of 128K tokens, it easily handles project analysis involving millions of lines, enabling precise code retrieval and cross-file completion

🌐 Multilingual Code Generation: It supports over 73 languages including Python and Rust, achieving a score of 82.3 on the HumanEval test—surpassing DeepSeek Coder 33B

🔧 Enterprise-Level Function Calls: It uniquely implements Function Call capability, with a success rate exceeding GPT-4 and directly interfacing with APIs and database operations

📊 Intelligent Repository-Level Interaction: It completes code review and refactoring suggestions in a single model, validated by NaturalCodeBench with a score of 40.4, and supports cross-file understanding

🚀 Rapid Deployment Solution: It is compatible with vLLM/Ollama/Candle frameworks, providing out-of-the-box OpenAI-compatible interfaces and allowing you to set up a local service in just 5 minutes

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (1)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat (Zhipu GLM-4)
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

codegeex-4

-
128000

Input$0.014 / 1M tokens
Output$0.014 / 1M tokens

Input$0.014/ 1M tokens
Output$0.014/ 1M tokens
Original Price