AI benchmarks – The TechBriefs

New scorecard allows manufacturers to assess the benefits of AI before implementing

As organizations race to adopt AI, many struggle with fragmented workflows, inconsistent data, and hidden coordination effort across departments. AI […]

AI companies want you to stop chatting with bots and start managing them

Claude Opus 4.6 and OpenAI Frontier pitch a future of supervising AI agents. On Thursday, Anthropic and OpenAI shipped products […]

OpenAI releases GPT-5.2 after “code red” Google threat alert

On Thursday, OpenAI released GPT-5.2, its newest family of AI models for ChatGPT, in three versions called Instant, Thinking, and […]

Anthropic’s Claude Haiku 4.5 matches May’s frontier model at fraction of cost

And speaking of cost, Haiku 4.5 is included for subscribers of the Claude web and app plans. Through the API […]

OpenAI jumps gun on International Math Olympiad gold medal announcement

The early announcement has prompted Google DeepMind, which had prepared its own IMO results for the agreed-upon date, to move […]

ChatGPT’s new AI agent can browse the web and create PowerPoint slideshows

On Thursday, OpenAI launched ChatGPT Agent, a new feature that lets the company’s AI assistant complete multi-step tasks by controlling […]

Musk’s Grok 4 launches one day after chatbot generated Hitler praise on X

Musk has also apparently used the Grok chatbots as an automated extension of his trolling habits, showing examples of Grok […]

New Apple study challenges whether AI models truly “reason” through problems

Puzzle-based experiments reveal limitations of simulated reasoning, but others dispute findings. An illustration of Tower of Hanoi from Popular Science […]

With the launch of o3-pro, let’s talk about what AI “reasoning” actually does

inquiring artificial minds want to know New studies reveal pattern-matching reality behind the AI industry’s reasoning claims. On Tuesday, OpenAI […]

CMU research shows compression alone may unlock AI puzzle-solving abilities

Tis the season for a squeezin’ New research challenges prevailing idea that AI needs massive datasets to solve problems. A […]