Technology

Grok 4 is crushing it — Elon Musk’s AI just topped the leaderboard that matters most

By Amanda Caswell

Copyright tomsguide

Grok 4 is crushing it — Elon Musk’s AI just topped the leaderboard that matters most

Skip to main content

Tom’s Guide

Newsletters

View Profile

Search Tom’s Guide

You May Like

Grok 4 is live — Elon Musk introduces ‘smartest AI’ and addresses antisemitic posts

Grok 4 just revealed a $300 a month plan — here’s what it includes

ChatGPT-5 beats Gemini and Grok in tests — here’s why that’s important

Phone Insights

Phone Best Picks

Phone Deals

Phone Face-Offs

Phone How-Tos

Phone Reviews

Network Carriers

Android Phones

Google Phones

Motorola Phones

OnePlus Phones

Samsung Phones

Nothing Phone

TV Best Picks

TV Face-Offs

Audio Insights

Audio Best Picks

Audio Deals

Audio Face-Offs

Audio How-Tos

Audio Reviews

Over-Ear Headphones

Bluetooth Speakers

Entertainment

Streaming Devices

Prime Video

Paramount Plus

Playstation

Gaming Peripherals

Connections

Computing Insights

Computing Best Picks

Computing Deals

Computing Face-Offs

Computing How-Tos

Computing News

Computing Reviews

VPN Best Picks

VPN Face-Offs

VPN How-Tos

VPN Reviews

Operating Systems

Malware & Adware

Smart Glasses

Chromebooks

Gaming Laptops

Apple Desktops

Gaming Desktops

Android Tablets

Computing Brands

AI Insights

AI Best Picks

AI Face-Offs

Google Gemini

Apple Intelligence

Mattress Best Picks

Mattress Deals

Mattress Face-Offs

Mattress How-Tos

Mattress News

Mattress Reviews

Mattress Care

Mattress Toppers

Pillows & Bedding

Smartwatches

Fitness Trackers

Smart Rings

Apple Watch

Home Insights

Home Best Picks

Home Face-Offs

Home How-Tos

Home Reviews

Home Topics

Home Appliances

Home Office

Home Security

Home Brands

Popular Brands

View Phones

Phone Insights

Phone Best Picks

Phone Deals

Phone Face-Offs

Phone How-Tos

Phone Reviews

Network Carriers

View Network Carriers

Android Phones

View Android Phones

Google Phones

Motorola Phones

OnePlus Phones

Samsung Phones

Nothing Phone

TV Best Picks

TV Face-Offs

Audio Insights

View Audio Insights

Audio Best Picks

Audio Deals

Audio Face-Offs

Audio How-Tos

Audio Reviews

Headphones

View Headphones

Over-Ear Headphones

View Speakers

Bluetooth Speakers

Entertainment

View Entertainment

View Streaming

Streaming Devices

Prime Video

Paramount Plus

View Gaming

Playstation

Gaming Peripherals

Word Games

Connections

View Computing

Computing Insights

Computing Best Picks

Computing Deals

Computing Face-Offs

Computing How-Tos

Computing News

Computing Reviews

VPN Best Picks

VPN Face-Offs

VPN How-Tos

VPN Reviews

View Hardware

View Software

Operating Systems

View Security

Malware & Adware

View VR & AR

Smart Glasses

View Laptops

Chromebooks

Gaming Laptops

View Desktops

Apple Desktops

Gaming Desktops

View Tablets

Android Tablets

Computing Brands

AI Insights

AI Best Picks

AI Face-Offs

AI Engines

Google Gemini

Apple Intelligence

View Wellness

Mattresses

View Mattresses

Mattress Best Picks

Mattress Deals

Mattress Face-Offs

Mattress How-Tos

Mattress News

Mattress Reviews

Mattress Care

Mattress Toppers

Pillows & Bedding

View Fitness

Smartwatches

Fitness Trackers

Smart Rings

Apple Watch

Home Insights

Home Best Picks

Home Face-Offs

Home How-Tos

Home Reviews

Home Topics

Home Appliances

Home Office

Home Security

View Outdoors

Home Brands

Popular Brands

Meta Connect LIVE
iPhone 17 Pro Max Review
iPhone Air Review
iPhone 17 Review
Best laptops

Best Mattress

Don’t miss these

Grok 4 is live — Elon Musk introduces ‘smartest AI’ and addresses antisemitic posts

Grok 4 just revealed a $300 a month plan — here’s what it includes

ChatGPT-5 beats Gemini and Grok in tests — here’s why that’s important

I tested ChatGPT-5 vs Grok 4 with 9 prompts — and there’s a clear winner

27 AI models were ranked by the public and ChatGPT came 8th — these are the models that beat it

AI Image & Video
I tested Grok and Gemini on 7 tough image prompts — and the winner isn’t the one most people expect

5 Reasons why ChatGPT-5 beat Gemini in our coding face-off

I ditched Claude for GPT-5 because of these 5 features — and I know I’ll use them every day

GPT-5 vs GPT-4: Here’s what’s different (and what’s not) in ChatGPT’s latest upgrade

I just tested ChatGPT-5 vs Deepseek with 9 prompts — and there’s a clear winner

GPT-5 will be here soon — here’s why I’ll be using it over Claude, Gemini and Grok

I tested ChatGPT-5 vs Google Gemini 2.5 with 10 prompts — and there’s a clear winner

GPT-5 vs GPT-4: I tested both on 7 real-world challenges — one dominated

I tested ChatGPT vs Gemini 2.5 Pro with these 3 prompts – and it shows what GPT-5 needs to do

GPT-5 users aren’t happy with the update — try these alternative chatbots instead

Grok 4 is crushing it — Elon Musk’s AI just topped the leaderboard that matters most

Amanda Caswell

17 September 2025

Love it or hate it, the numbers are in

When you purchase through links on our site, we may earn an affiliate commission. Here’s how it works.

(Image credit: VINCENT FEURAY / Getty Images)

When it comes to chatbots, it’s easy to forget about Grok because it seems like other big tech is always in the news. With Google’s Nano Banana starting new trends and OpenAI’s ChatGPT hyping their latest models, Elon Musk’s chatbot simply exists in the background.

I’ve definitely found myself rolling my eyes at some of Grok’s decisions, especially when it comes to image generation. However, it’s clear that there are some reasons to sit in awe of what Elon Musk calls “the smartest AI in the world.”

As someone who has spent hours testing it, the truth is, it’s not just hype. From near-instant web searches to jaw-dropping results on complex engineering queries, Grok 4 is delivering in ways its predecessors and rivals haven’t quite managed. Whether you love the direction or cringe at the controversies, Grok 4 may always be the underdog that quietly crushes it.
What makes xAI’s Grok different

I now think @xAI has a chance of reaching AGI with @Grok 5. Never thought that before. https://t.co/FaBUYegl3DSeptember 17, 2025

Elon Musk posted on X highlighting that Grok 4 is at the top of the ARC-AGI leaderboard. To understand why that’s impressive, it’s important to become familiar with how models are tracked on it.

Essentially, the ARC-AGI leaderboard is a scoreboard for AI, that not only tracks how many problems a model can solve, but also how efficiently it solves them. In other words, it’s measuring both the brain and the resourcefulness of the model. High performance with low cost per task is what matters most.

So, Grok’s position at the very top is extrememly significant because it means the xAI model is not only keeping up with rivals like Gemini and ChatGPT, but outpacing them on some of the toughest benchmark criteria possible.

Beating every other chatbot suggests that Grok 4 is powerful and efficient, which is exactly the type of breakthrough that supports true progress in the evolution of artifical general intelligence (AGI).

You may like

Grok 4 is live — Elon Musk introduces ‘smartest AI’ and addresses antisemitic posts

Grok 4 just revealed a $300 a month plan — here’s what it includes

ChatGPT-5 beats Gemini and Grok in tests — here’s why that’s important

Where Grok still stumbles

(Image credit: Shutterstock)
Whether used on X or on the standalone platform, real-time search pulls in fresh infromation from both the web and X, so it can keep up with breaking news at a moment’s notice.

However, the accuracy and bias concerns are what critics keep coming back to. Grok has made some claims that turned out false, and there are questions about how its alignment is being guided (e.g. how much Musk’s own views factor in).
The model also struggles with issues of content moderation after xAI scrambled to pull posts and update filters when anitsemitc content popped up.
The takeaway
Despite the model beating it’s rivals, questions still remain like, will it stay reliable as usage increases? Will “garbage data” or bias creep back in under pressure? How well will xAI handle moderation long-term? The past controversies suggest it’s an ongoing battle.

Sign up to get the BEST of Tom’s Guide direct to your inbox.
Get instant access to breaking news, the hottest reviews, great deals and helpful tips.
Contact me with news and offers from other Future brandsReceive email from us on behalf of our trusted partners or sponsorsBy submitting your information you agree to the Terms & Conditions and Privacy Policy and are aged 16 or over.
There are no doubts that Grok is not perfect. It carries some extremely controversial baggage, but the proof of what it does better in terms of speed, real-time data and flexible thinking makes it a serious contender in the AI race.
More from Tom’s Guide

I tested Pangram, the ‘black light’ of AI detection built by ex-Tesla and Google engineers — here’s how well it worked
I tested ChatGPT vs Claude with 7 personal productivity tests — here’s the clear winner
Nano Banana just broke the internet with these viral trends — I tried these 5 prompts and I’m blown away

Back to Laptops

Intel Core i3

Intel Core i5

Intel Core i7

Storage Size

Screen Size

Refurbished

Screen Type

Showing 10 of 288 deals

Apple 13″ MacBook Air M4 (2025)

(256GB SSD)

Apple 15″ MacBook Air M4 (2025)

(15-inch 2TB)

$1,999.95View

Dell XPS 13 (2016)

Lenovo Yoga Slim 7x (Gen 9)

(512GB OLED)

$858.11View

Lenovo Chromebook Plus 14

$748.95View

Asus ROG Zephyrus G14 (2025)

(14-inch 1TB)

$1,579View

Apple 13″ MacBook Air M4 (2025)

Apple 15″ MacBook Air M4 (2025)

(15-inch 256GB)

Dell XPS 13 Plus

$869.99View

Lenovo Yoga Slim 7x (Gen 9)

$979.99View

See more AI News

Amanda Caswell

Social Links Navigation

Amanda Caswell is an award-winning journalist, bestselling YA author, and one of today’s leading voices in AI and technology. A celebrated contributor to various news outlets, her sharp insights and relatable storytelling have earned her a loyal readership. Amanda’s work has been recognized with prestigious honors, including outstanding contribution to media.
Known for her ability to bring clarity to even the most complex topics, Amanda seamlessly blends innovation and creativity, inspiring readers to embrace the power of AI and emerging technologies. As a certified prompt engineer, she continues to push the boundaries of how humans and AI can work together.
Beyond her journalism career, Amanda is a bestselling author of science fiction books for young readers, where she channels her passion for storytelling into inspiring the next generation. A long-distance runner and mom of three, Amanda’s writing reflects her authenticity, natural curiosity, and heartfelt connection to everyday life — making her not just a journalist, but a trusted guide in the ever-evolving world of technology.

You must confirm your public display name before commenting

Please logout and then login again, you will then be prompted to enter your display name.

Grok 4 is live — Elon Musk introduces ‘smartest AI’ and addresses antisemitic posts

Grok 4 just revealed a $300 a month plan — here’s what it includes

ChatGPT-5 beats Gemini and Grok in tests — here’s why that’s important

I tested ChatGPT-5 vs Grok 4 with 9 prompts — and there’s a clear winner

27 AI models were ranked by the public and ChatGPT came 8th — these are the models that beat it

I tested Grok and Gemini on 7 tough image prompts — and the winner isn’t the one most people expect

Latest in AI

OpenAI just changed ChatGPT for teens — as a mom and AI editor, here’s what it means to me

27 AI models were ranked by the public and ChatGPT came 8th — these are the models that beat it

I tested ChatGPT vs Claude with 7 personal productivity tests — here’s the clear winner

I tested the ‘black light’ of AI detection — here’s how the tool built by ex-Tesla and Google engineers works

I used Google’s Nano Banana to try a bunch of different hairstyles — and the results blew me away

New study shows how people are using ChatGPT — and Google should be worried

Latest in News

Grok 4 is crushing it — Elon Musk’s AI just topped the leaderboard that matters most

Logitech’s RS50 could shake up mid-range sim racing like never before — here’s why

iPhone 17 Pro Max sustained performance tested — does the vapor chamber cooling actually work?

Your next gaming mouse could have haptic feedback — here’s why Logitech’s new mouse is a big deal

Garmin launches Bounce 2 — new smartwatch for kids with LTE challenges Apple Watch SE 3

Samsung security flaw could let hackers remotely control your device — update your Galaxy phone right now

LATEST ARTICLES

Grok 4 is crushing it — Elon Musk’s AI just topped the leaderboard that matters most

I tried TikTok’s ‘3×3 by 12’ health trend that’s everywhere for a week — here’s what happened

Logitech’s RS50 could shake up mid-range sim racing like never before — here’s why

iPhone 17 Pro benchmarks: Apple’s A19 Pro silicon puts Snapdragon on notice

I tested the Withings Sleep Analyzer to see if it really is the gold standard for sleep tracking

Tom’s Guide is part of Future US Inc, an international media group and leading digital publisher. Visit our corporate site.

Terms and conditions

Contact Future’s experts

Privacy policy

Cookies policy

Accessibility Statement

Advertise with us

Future US, Inc. Full 7th Floor, 130 West 42nd Street,

Please login or signup to comment

Please wait…