AI safety – The TechBriefs

The rise of Moltbook suggests viral AI prompts may be the next big security threat

We don’t need self-replicating AI models to have problems, just self-replicating prompts. Credit: Aurich Lawson | Moltbook On November 2, […]

Conflicting instructions? Expert explains how simple it could be to tweak Grok to block CSAM outputs. Credit: Aurich Lawson | […]

“A silly way to think about risk” “Widespread and powerful movement” keeps Trump from blocking state AI laws. A Donald […]

Lawsuits and safety concerns Character.AI was founded in 2021 by Noam Shazeer and Daniel De Freitas, two former Google engineers, […]

Earlier this month, the company unveiled a wellness council to address these concerns, though critics noted the council did not […]

“We are not in any way supported by or funded by Elon Musk and have a history of campaigning against […]

And speaking of cost, Haiku 4.5 is included for subscribers of the Claude web and app plans. Through the API […]

Independent AI researcher Simon Willison, reviewing the feature today on his blog, noted that Anthropic’s advice to “monitor Claude while […]

On Tuesday, OpenAI announced plans to roll out parental controls for ChatGPT and route sensitive mental health conversations to its […]

The company tested 123 cases representing 29 different attack scenarios and found a 23.6 percent attack success rate when browser […]