On Thursday, OpenAI released GPT-5.2, its newest family of AI models for ChatGPT, in three versions called Instant, Thinking, and […]
Category: AI benchmarks
Anthropic’s Claude Haiku 4.5 matches May’s frontier model at fraction of cost
And speaking of cost, Haiku 4.5 is included for subscribers of the Claude web and app plans. Through the API […]
OpenAI jumps gun on International Math Olympiad gold medal announcement
The early announcement has prompted Google DeepMind, which had prepared its own IMO results for the agreed-upon date, to move […]
ChatGPT’s new AI agent can browse the web and create PowerPoint slideshows
On Thursday, OpenAI launched ChatGPT Agent, a new feature that lets the company’s AI assistant complete multi-step tasks by controlling […]
Musk’s Grok 4 launches one day after chatbot generated Hitler praise on X
Musk has also apparently used the Grok chatbots as an automated extension of his trolling habits, showing examples of Grok […]
New Apple study challenges whether AI models truly “reason” through problems
Puzzle-based experiments reveal limitations of simulated reasoning, but others dispute findings. An illustration of Tower of Hanoi from Popular Science […]
With the launch of o3-pro, let’s talk about what AI “reasoning” actually does
inquiring artificial minds want to know New studies reveal pattern-matching reality behind the AI industry’s reasoning claims. On Tuesday, OpenAI […]
CMU research shows compression alone may unlock AI puzzle-solving abilities
Tis the season for a squeezin’ New research challenges prevailing idea that AI needs massive datasets to solve problems. A […]
