Try asking GPT-5, Clause, AI, and Gemini this fascinating science question, and you might be surprised which one handles it best

Ever since OpenAI unveiled GPT-5, I’ve been looking for ways to challenge it and compare it to the rest of the AI field. After exploring math theorems, SAT questions, and brain teasers, I settled on a tough science concept and the mind of a five-year-old.

What I found surprised me, and it illustrated the strengths and weaknesses of GPT-5, Gemini 2.5 Flash, Claude Sonnet 4, and Microsoft Copilot.