AI alignment – The TechBriefs

The rise of Moltbook suggests viral AI prompts may be the next big security threat

We don’t need self-replicating AI models to have problems, just self-replicating prompts. Credit: Aurich Lawson | Moltbook On November 2, […]

Does Anthropic believe its AI is conscious, or is that just what it wants Claude to think?

We have no proof that AI models suffer, but Anthropic acts like they might for training purposes. Anthropic’s secret to […]

From prophet to product: How AI came back down to earth in 2025

In a year where lofty promises collided with inconvenient research, would-be oracles became software tools. Credit: Aurich Lawson | Getty […]

Syntax hacking: Researchers discover sentence structure can bypass AI safety rules

Adventures in pattern-matching New research offers clues about why some prompt injection attacks may succeed. Researchers from MIT, Northeastern University, […]

Researchers surprised that with AI, toxicity is harder to fake than intelligence

The next time you encounter an unusually polite reply on social media, you might want to check twice. It could […]

Anthropic’s Claude Haiku 4.5 matches May’s frontier model at fraction of cost

And speaking of cost, Haiku 4.5 is included for subscribers of the Claude web and app plans. Through the API […]

OpenAI wants to stop ChatGPT from validating users’ political views

New paper reveals reducing “bias” means making ChatGPT stop mirroring users’ political language. “ChatGPT shouldn’t have political bias in any […]

OpenAI admits ChatGPT safeguards fail during extended conversations

Adam Raine learned to bypass these safeguards by claiming he was writing a story—a technique the lawsuit says ChatGPT itself […]

With AI chatbots, Big Tech is moving fast and breaking people

Why AI chatbots validate grandiose fantasies about revolutionary discoveries that don’t exist. Allan Brooks, a 47-year-old corporate recruiter, spent three […]

Is AI really trying to escape human control and blackmail people?

Mankind behind the curtain Opinion: Theatrical testing scenarios explain why AI models produce alarming outputs—and why we fall for it. […]