
grok-4-0709
API Overview
Grok 4 is a leading AI model launched by xAI, positioned as a high-end AI system with native tool usage capabilities, real-time search integration, and cutting-edge reasoning abilities. It focuses on delivering intelligent reasoning and efficient task processing, tailored to meet the demands of professional scenarios.
- Breakthrough in Reinforcement Learning: Leveraging the 200,000-GPU cluster Colossus and self-developed algorithms, Grok 4 achieves a sixfold increase in computational efficiency, enabling training across a wider range of domains and delivering performance improvements exceeding one order of magnitude.
- Native Tool Usage: Equipped with the ability to autonomously select and use tools—including code interpreters, web browsing, semantic search, and more—Grok 4 can obtain information in real time and provide in-depth answers to complex questions.
- Leading Intelligent Performance: Grok 4 excels in multiple closed-model and benchmark tests. In the ARC-AGI V2 test, it achieved a record-breaking score of 15.9%, and in the Agentic Vending-Bench test, it significantly outperformed its peers.
- Support for Parallel Testing (Grok 4 Heavy): Grok 4 Heavy can simultaneously process multiple hypotheses.
- Academic Benchmark Breakthrough (Grok 4 Heavy): In the "Ultimate Human Exam," Grok 4 Heavy scored over 50%, and in the USAMO 2025 and AIME’25 math tests, it achieved scores of 61.9% and 100%, respectively.
───────────────────────────────────────────────────────────────────
Core Capabilities
⚡ Version-Specific Adaptation for Diverse Needs
Grok 4 offers a basic version (suitable for SuperGrok and Premium+ users) and an advanced version (Grok 4 Heavy, supporting multi-agent parallel reasoning), catering to a wide variety of task requirements.
🛠️ Performance Breakthroughs in Professional Scenarios
- Tool Invocation Capability: Through real-time tool usage, Grok 4 addresses diverse needs—from coding to complex academic problems.
- Mathematics and Academic Testing: Particularly outstanding in academic fields, Grok 4 assists researchers in tackling highly challenging mathematical tasks and has already demonstrated exceptional performance in top-tier tests.
📄 Enhanced Long Context and Tool Invocation
- Long Context Processing: Supporting a context window of 256,000 tokens, Grok 4 can handle complex reports and multi-document projects, providing efficient information processing.
- Multi-modal Integration Capability: Supporting voice and visual interactions, Grok 4 generates precise answers based on real-time searches and multi-modal data, enhancing user experience.
───────────────────────────────────────────────────────────────────
Related Evaluations
Playground
Log in to explore more features! Click to Log In
