News
9dOpinion
Gadget on MSNWhy you can’t trust Grok 4’s benchmarksOn paper, the AI platform created by Elon Musk’s xAI shoots the lights out, but it's a different matter in practice, writes ...
Elon Musk and xAI employee announced on Wednesday night the launch of the startup's next flagship AI model, Grok 4.
Last week, Elon Musk’s xAI released the long-awaited Grok 4. And from our perspective, it likely marked the moment AI ...
Google DeepMind has rolled out Gemini2.5 Deep Think, claiming it to be a major upgrade in terms of advanced AI reasoning.
Rhea Purohit in Vibe Check Was this newsletter forwarded to you? Sign up to get it in your inbox. Grok 4 is topping some big AI benchmarks. So why have the responses to it been so mixed? And how come ...
AI owner Elon Musk made some big claims about Grok’s capability, saying it was “better than PhD level in every subject.” ...
Grok 4, the latest AI model from Elon Musk’s xAi, achieves innovative performance on benchmarks, outperforming competitors like Opus 4 and Gemini 2.5 Pro in reasoning and problem-solving tasks.
Grok 4 by xAI was released on July 9, and it's surged ahead of competitors like DeepSeek and Claude at LMArena, a leaderboard ...
Grok 4’s performance is unparalleled across a variety of disciplines. It demonstrates postgraduate-level intelligence in reasoning, mathematics, and science, excelling in rigorous benchmarks ...
Grok 4 is a huge leap from Grok 3, but how good is it compared to other models in the market, such as Gemini 2.5 Pro? We now have answers, thanks to new independent benchmarks.
If these leaked Grok 4 benchmarks are correct, 95 AIME, 88 GPQA, 75 SWE-bench, then XAI has the most powerful model on the market. The GPQA for Grok and SWE Bench rankings for Grok 4 code will also ...
Elon Musk has launched xAI’s Grok 4—calling it the “world’s smartest AI” and claiming it can ace Ph.D.-level exams and outpace rivals such as Google’s Gemini and OpenAI’s o3 on tough ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results