Grok 3 Thinking

Grok 3 Thinking

Grok 3 Thinking exposes the full chain-of-thought process during problem-solving, including error backtracking and alternative solution exploration. Scores 84.6% on GPQA Diamond benchmark for expert-level Q&A.

ConversationReasoningCode GenerationAnalysis
Provider
Xai
Release Date
February 19, 2025
Size
XLARGE
Parameters
2.7T

Benchmark Performance

Performance metrics on industry standard AI benchmarks that measure capabilities across reasoning, knowledge, and specialized tasks.

MMLU

86.2%
Source:x.ai

GPQA Diamond

84.6%
Source:x.ai

MATH

80.5%
Source:x.ai

Model Insights

All Model Responses