Other

27 AI models were ranked by the public and ChatGPT came 8th — these are the models that beat it

By Alex Hughes

Copyright tomsguide

27 AI models were ranked by the public and ChatGPT came 8th — these are the models that beat it

Skip to main content

Tom’s Guide

Newsletters

View Profile

Search Tom’s Guide

You May Like

ChatGPT-5 beats Gemini and Grok in tests — here’s why that’s important

I tested ChatGPT-5 vs Google Gemini 2.5 with 10 prompts — and there’s a clear winner

5 Reasons why ChatGPT-5 beat Gemini in our coding face-off

Phone Insights

Phone Best Picks

Phone Deals

Phone Face-Offs

Phone How-Tos

Phone Reviews

Network Carriers

Android Phones

Google Phones

Motorola Phones

OnePlus Phones

Samsung Phones

Nothing Phone

TV Best Picks

TV Face-Offs

Audio Insights

Audio Best Picks

Audio Deals

Audio Face-Offs

Audio How-Tos

Audio Reviews

Over-Ear Headphones

Bluetooth Speakers

Entertainment

Streaming Devices

Prime Video

Paramount Plus

Playstation

Gaming Peripherals

Connections

Computing Insights

Computing Best Picks

Computing Deals

Computing Face-Offs

Computing How-Tos

Computing News

Computing Reviews

VPN Best Picks

VPN Face-Offs

VPN How-Tos

VPN Reviews

Operating Systems

Malware & Adware

Smart Glasses

Chromebooks

Gaming Laptops

Apple Desktops

Gaming Desktops

Android Tablets

Computing Brands

AI Insights

AI Best Picks

AI Face-Offs

Google Gemini

Apple Intelligence

Mattress Best Picks

Mattress Deals

Mattress Face-Offs

Mattress How-Tos

Mattress News

Mattress Reviews

Mattress Care

Mattress Toppers

Pillows & Bedding

Smartwatches

Fitness Trackers

Smart Rings

Apple Watch

Home Insights

Home Best Picks

Home Face-Offs

Home How-Tos

Home Reviews

Home Topics

Home Appliances

Home Office

Home Security

Home Brands

Popular Brands

View Phones

Phone Insights

Phone Best Picks

Phone Deals

Phone Face-Offs

Phone How-Tos

Phone Reviews

Network Carriers

View Network Carriers

Android Phones

View Android Phones

Google Phones

Motorola Phones

OnePlus Phones

Samsung Phones

Nothing Phone

TV Best Picks

TV Face-Offs

Audio Insights

View Audio Insights

Audio Best Picks

Audio Deals

Audio Face-Offs

Audio How-Tos

Audio Reviews

Headphones

View Headphones

Over-Ear Headphones

View Speakers

Bluetooth Speakers

Entertainment

View Entertainment

View Streaming

Streaming Devices

Prime Video

Paramount Plus

View Gaming

Playstation

Gaming Peripherals

Word Games

Connections

View Computing

Computing Insights

Computing Best Picks

Computing Deals

Computing Face-Offs

Computing How-Tos

Computing News

Computing Reviews

VPN Best Picks

VPN Face-Offs

VPN How-Tos

VPN Reviews

View Hardware

View Software

Operating Systems

View Security

Malware & Adware

View VR & AR

Smart Glasses

View Laptops

Chromebooks

Gaming Laptops

View Desktops

Apple Desktops

Gaming Desktops

View Tablets

Android Tablets

Computing Brands

AI Insights

AI Best Picks

AI Face-Offs

AI Engines

Google Gemini

Apple Intelligence

View Wellness

Mattresses

View Mattresses

Mattress Best Picks

Mattress Deals

Mattress Face-Offs

Mattress How-Tos

Mattress News

Mattress Reviews

Mattress Care

Mattress Toppers

Pillows & Bedding

View Fitness

Smartwatches

Fitness Trackers

Smart Rings

Apple Watch

Home Insights

Home Best Picks

Home Face-Offs

Home How-Tos

Home Reviews

Home Topics

Home Appliances

Home Office

Home Security

View Outdoors

Home Brands

Popular Brands

iPhone 17 Pro Max Review
iPhone Air Review
iPhone 17 Review
Wordle Today
Best laptops

Best Mattress

Don’t miss these

ChatGPT-5 beats Gemini and Grok in tests — here’s why that’s important

I tested ChatGPT-5 vs Google Gemini 2.5 with 10 prompts — and there’s a clear winner

5 Reasons why ChatGPT-5 beat Gemini in our coding face-off

3 ways ChatGPT-5 outshines Gemini in AI image tests

I tested ChatGPT-5 vs Claude with 7 challenging prompts — here’s the winner

I tested ChatGPT-5 vs Gemini 2.5 Pro with 5 coding prompts — here’s the winner

I just tested ChatGPT-5 vs Deepseek with 9 prompts — and there’s a clear winner

Forget ChatGPT and Gemini — this lesser-known chatbot just ranked No. 1 for privacy

I tested ChatGPT-5 vs Gemini 2.5 Pro with 11 AI image prompts — here’s the winner

I ditched Claude for GPT-5 because of these 5 features — and I know I’ll use them every day

GPT-5 users aren’t happy with the update — try these alternative chatbots instead

I put ChatGPT vs Gemini vs Claude through the same job interview — here’s the one that got hired

I tested ChatGPT-5 vs Grok 4 with 9 prompts — and there’s a clear winner

I asked ChatGPT, Claude, and Gemini tough teen questions — only one earned my trust

I tested ChatGPT vs Gemini 2.5 Pro with these 3 prompts – and it shows what GPT-5 needs to do

27 AI models were ranked by the public and ChatGPT came 8th — these are the models that beat it

Alex Hughes

17 September 2025

A surprising set of results for the big AI chatbots

When you purchase through links on our site, we may earn an affiliate commission. Here’s how it works.

(Image credit: Tom’s Guide/Shutterstock)

While the world of AI can often feel a bit like the wild west, there is a surprisingly high amount of analysis, benchmarking, and testing that goes on behind the scenes. Not just from the companies themselves, but from groups set up to establish their own rankings.

These groups test everything from a chatbot’s ability to complete mathematical tests, create images, show reasoning, offer medical advice, or simply how emotionally intelligent they are.
Across these different tests, models go up and down, showing their strengths and weaknesses in different areas. For example, while GPT-5 is great at scientific reasoning, it fell behind the likes of Gemini and Claude for its ability to adapt to new concepts.

You may like

ChatGPT-5 beats Gemini and Grok in tests — here’s why that’s important

I tested ChatGPT-5 vs Google Gemini 2.5 with 10 prompts — and there’s a clear winner

5 Reasons why ChatGPT-5 beat Gemini in our coding face-off

Each of these tests tells us something new about AI models, and they are important as a reminder of which tool is best in different scenarios. But one measurement is often lacking. Simply, which AI models offer the best user experience?

The Humaine ranking system

(Image credit: Humaine)
A UK-based tech company called Prolific has set up its own AI leaderboard called Humaine. Instead of testing AI’s ability to complete tasks, Prolific tested different users’ experiences of the models.
By evaluating 21,352 people’s experiences of 21,352 people with the tools, they could not only find an overall winner but also break down the results by age, location (tested in both the UK and the US), and political beliefs.
This includes individual lists for:

Sign up to get the BEST of Tom’s Guide direct to your inbox.
Get instant access to breaking news, the hottest reviews, great deals and helpful tips.
Contact me with news and offers from other Future brandsReceive email from us on behalf of our trusted partners or sponsorsBy submitting your information you agree to the Terms & Conditions and Privacy Policy and are aged 16 or over.

UK: age groups
UK: Ethnicity
UK: Political view
US: age groups
US: Ethnicity
US Political view
The team made each participant interact with two seperate AI models in a comparison, asking them to give feedback on which model was better in each interaction.
This led to an overall winner and scoreboard for performance, but also separate rankings for core task performance and reasoning, as well as a winner for communication, fluidity, and trust and ethics.
What do the results show?

(Image credit: Future)
After polling, there was a very clear winner, not just in the overall performance category but in most of the subcategories. Gemini 2.5-Pro came out on top in almost every filter that the test offered.

You may like

ChatGPT-5 beats Gemini and Grok in tests — here’s why that’s important

I tested ChatGPT-5 vs Google Gemini 2.5 with 10 prompts — and there’s a clear winner

5 Reasons why ChatGPT-5 beat Gemini in our coding face-off

18-34 year olds in the UK, Democrat voters, and those over 55 in the US all agreed that Gemini 2.5 Pro was the best overall model. The only area that all demographic groups ranked something above Gemini was in trust, ethics, and safety was Grok-3 — a somewhat ironic finding considering some of the safety and ethics issues the AI model has had of late.
Interestingly, the three models that come up after Gemini are Deepseek, Magistral Le Chat, and Grok. While Deepseek saw a huge amount of popularity earlier this year, it has fallen off the radar recently. Le Chat, on the other hand, is a less popular chatbot, but one with a loyal fanbase.
So, where is the world-famous ChatGPT in all of this? It’s a big scroll down, coming in 8th with the GPT-4.1 model ranking highest. Even worse is Claude, with its two version 4 models landing 11th and 12th in the overall ranking.
So what does this all mean?
Does this mean Gemini is the best AI chatbot in the world? Does it mean you should be ditching ChatGPT…? Well, not exactly.
These results don’t necessarily reflect the performance of these models. When tested on most other metrics, the options we normally see at the top are ChatGPT, Gemini, Claude and Grok.
This, however, is an important addition to these tests. It helps to give a better understanding of AI from a more human experience perspective. Le Chat, for example, doesn’t score as highly in benchmarks, but is frequently listed as a top option for experience and trust.
While Anthropic and OpenAI don’t do too well in this particular round of testing, it is another strong performance for both Gemini and Grok. Both companies frequently score highly in benchmarks and have continued to do so here, too.
More from Tom’s Guide

I tested Pangram, the ‘black light’ of AI detection built by ex-Tesla and Google engineers — here’s how well it worked
I used Google’s Nano Banana to try a bunch of different hairstyles — and the results blew me away
I tested ChatGPT vs Claude with 7 personal productivity tests — here’s the clear winner

Back to Laptops

Intel Core i3

Intel Core i5

Intel Core i7

Storage Size

Screen Size

Refurbished

Screen Type

Showing 10 of 289 deals

Apple 13″ MacBook Air M4 (2025)

(256GB SSD)

Apple 15″ MacBook Air M4 (2025)

(15-inch 2TB)

$1,999.95View

Dell XPS 13 (2016)

Lenovo Yoga Slim 7x (Gen 9)

(512GB OLED)

$858.11View

Lenovo Chromebook Plus 14

$748.95View

Asus ROG Zephyrus G14 (2025)

(14-inch 1TB)

$1,579View

Apple 13″ MacBook Air M4 (2025)

Apple 15″ MacBook Air M4 (2025)

(15-inch 256GB)

Dell XPS 13 Plus

$869.99View

Lenovo Yoga Slim 7x (Gen 9)

$979.99View

Alex Hughes

Social Links Navigation

Alex is the AI editor at TomsGuide. Dialed into all things artificial intelligence in the world right now, he knows the best chatbots, the weirdest AI image generators, and the ins and outs of one of tech’s biggest topics.
Before joining the Tom’s Guide team, Alex worked for the brands TechRadar and BBC Science Focus.
He was highly commended in the Specialist Writer category at the BSME’s 2023 and was part of a team to win best podcast at the BSME’s 2025.
In his time as a journalist, he has covered the latest in AI and robotics, broadband deals, the potential for alien life, the science of being slapped, and just about everything in between.
When he’s not trying to wrap his head around the latest AI whitepaper, Alex pretends to be a capable runner, cook, and climber.

You must confirm your public display name before commenting

Please logout and then login again, you will then be prompted to enter your display name.

ChatGPT-5 beats Gemini and Grok in tests — here’s why that’s important

I tested ChatGPT-5 vs Google Gemini 2.5 with 10 prompts — and there’s a clear winner

5 Reasons why ChatGPT-5 beat Gemini in our coding face-off

3 ways ChatGPT-5 outshines Gemini in AI image tests

I tested ChatGPT-5 vs Claude with 7 challenging prompts — here’s the winner

I tested ChatGPT-5 vs Gemini 2.5 Pro with 5 coding prompts — here’s the winner

Latest in AI

I’m a mom and AI editor — here’s why OpenAI’s new ChatGPT rules hit close to home for me

I tested ChatGPT vs Claude with 7 personal productivity tests — here’s the clear winner

I tested Pangram, the ‘black light’ of AI detection built by ex-Tesla and Google engineers — here’s how well it worked

I used Google’s Nano Banana to try a bunch of different hairstyles — and the results blew me away

New study shows how people are using ChatGPT — and Google should be worried

I tried 14 ChatGPT ‘cheat codes’ to unlock its full potential — here’s the 5 best ones to use

Latest in Features

iPhone 17 Pro Max battery life test results are in — this blew us away

27 AI models were ranked by the public and ChatGPT came 8th — these are the models that beat it

New study suggests these two types of exercise can help fight cancer cells

People are using espresso machines to clean their engagement rings — I asked jewelers if that’s a good idea

These are the only 3 gadgets I bought when I became a dad — and one’s less than $30

The Hypershell X Ultra is the personal exoskeleton that puts a literal spring in your step

LATEST ARTICLES

5 running shoes I’d buy in the Asics end of summer sale — starting from $54

iPhone 17 Pro Max battery life test results are in — this blew us away

It’s been nearly a year since I started using and fell in love with these headphones, and no, they aren’t Bose, Sony or JBL

Watch the iPhone Air survive 130 pounds of pressure in extreme bend test

iPhone 17 reviews live — we rate the iPhone 17 Pro, 17 Pro Max, iPhone Air and more

Tom’s Guide is part of Future US Inc, an international media group and leading digital publisher. Visit our corporate site.

Terms and conditions

Contact Future’s experts

Privacy policy

Cookies policy

Accessibility Statement

Advertise with us

Future US, Inc. Full 7th Floor, 130 West 42nd Street,

Please login or signup to comment

Please wait…