grok-3

grok-3

The most advanced AI model series launched by xAI, the “Reasoning Agent,” which combines exceptional reasoning capabilities with vast pre-trained knowledge.
2025-02-21
LLM
Input:
$3/1M tokens
Output:
$15/1M tokens
Bulk order? Contact your manager for exclusive deals

API Overview

Grok 3 is an AI model series launched by xAI, positioned as a “reasoning agent” that seamlessly integrates exceptional reasoning capabilities with vast pre-trained knowledge. It focuses on delivering high-precision, traceable outputs in tasks such as mathematics, coding, scientific reasoning, and long-document processing.

  • Superior Training Foundation: Trained on xAI’s Colossus supercluster, Grok 3 boasts a computational scale 10 times greater than that of Grok’s previous state-of-the-art models. Grok 3 mini, on the other hand, represents a new direction for efficient reasoning, tailored for STEM tasks that do not require extensive world knowledge.
  • Top-Tier Reasoning Performance: Grok 3 (Think) has been optimized through large-scale reinforcement learning (RL), enabling it to think for seconds to minutes and supporting backtracking correction and multi-hypothesis exploration. Grok 3 mini (Think) achieved 95.8% accuracy on the 2024 AIME and 80.4% on LiveCodeBench.
  • Comprehensive Academic Leadership: In non-reasoning modes, Grok 3 excels across multiple academic benchmarks, while Grok 3 mini also maintains strong competitiveness on corresponding benchmarks (e.g., 78.9% on MMLU-pro, 83.1% on LOFT [128k]).
  • Contextual Processing: On the LOFT (128k) long-text RAG benchmark, Grok 3 mini achieves industry-leading average accuracy across 12 tasks, supporting complex document processing and precise information retrieval.
  • Agent and Tool Integration: Grok 3 supports code interpreters and internet access, enabling proactive queries for missing context and dynamic strategy adjustments. It introduces the first AI agent, “DeepSearch,” which performs deep cross-human-knowledge corpus searches, synthesizes key information, resolves conflicting viewpoints, and generates concise, comprehensive reports—ideal for real-time news, social problem consultations, and advanced research scenarios.
  • Practical Features: Users can enable reasoning functionality via the “Think” button to view the full reasoning process. Grok 3 also offers enhanced factual accuracy and stylistic control, reducing “hallucination” outputs.───────────────────────────────────────────────────────────────────

Core Capabilities

⚡ Multi-version adaptation for diverse needs

  • Grok 3: Primarily targets high-complexity tasks, enhancing reasoning, mathematics, coding, world knowledge, and instruction-following abilities, suitable for deep academic research and complex code development scenarios.
  • Grok 3 mini: Balances efficiency and cost, focusing on STEM tasks that do not require extensive world knowledge, ideal for lightweight reasoning and rapid-response requirements.
  • Grok 3 (Think)/Grok 3 mini (Think): Beta versions of reasoning models, optimized through RL-enhanced chain-of-thought (CoT) processes, supporting long-term thinking and multi-hypothesis validation, perfect for challenging mathematical and logical reasoning tasks.

🛠️ Key Scenario Performance Breakthroughs

  • Mathematics and Scientific Reasoning: Covers competition-level mathematics (AIME) and graduate-level science problems (GPQA), capable of breaking down complex problems, backtracking corrections, verifying solution accuracy, and supporting research assistance and advanced education scenarios.
  • Coding Capability: Can generate fully functional code—for example, developing a “Break-Pong” hybrid game based on pygame (combining Pong and Breakout elements)—including smooth animations, particle effects, complete game logic, and interactive controls. The code features clear structure and detailed comments.
  • Multi-modal Understanding: Achieves 78% accuracy on the MMMU (multi-modal understanding) test, supporting image and video-related task comprehension and adapting to multi-modal information processing scenarios.
  • Intelligent Retrieval and Synthesis: Through the “DeepSearch” agent, Grok 3 can retrieve information across massive knowledge sources, organize complex information, resolve conflicting viewpoints, and produce structured reports—far surpassing the information integration capabilities of traditional browser searches.

Playground

Log in to explore more features! Click to Log In

API Analytics

API Reference (5)

API DescriptionAPI EndpointRequest MethodStabilityParameter Description
Chat(grok-2)
POST
Stable
View Details
Chat(grok-2-vision)
POST
Stable
View Details
Chat(grok-3)
POST
Stable
View Details
Async Get Result
GET
Stable
View Details
Asynchronous request to chat
POST
Stable
View Details

API Pricing

$
ModelDescriptionContextOfficial Price302.AI Price

grok-3

-
131072

Input$3 / 1M tokens
Output$15 / 1M tokens

Input$3/ 1M tokens
Output$15/ 1M tokens
Original Price