AI safety – Page 2 – The TechBriefs

OpenAI announces parental controls for ChatGPT after teen suicide lawsuit

On Tuesday, OpenAI announced plans to roll out parental controls for ChatGPT and route sensitive mental health conversations to its […]

The company tested 123 cases representing 29 different attack scenarios and found a 23.6 percent attack success rate when browser […]

Adam Raine learned to bypass these safeguards by claiming he was writing a story—a technique the lawsuit says ChatGPT itself […]

Mankind behind the curtain Opinion: Theatrical testing scenarios explain why AI models produce alarming outputs—and why we fall for it. […]

On Thursday, OpenAI launched ChatGPT Agent, a new feature that lets the company’s AI assistant complete multi-step tasks by controlling […]

Popular chatbots serve as poor replacements for human therapists, but study authors call for nuance. When Stanford University researchers asked […]

The code also details expectations for AI companies to respect paywalls, as well as robots.txt instructions restricting crawling, which could […]

OpenAI has a very scary problem on its hands. A new experiment by PalisadeAI reveals that the company’s ChatGPT o3 […]

Skip to content New Anthropic research shows one AI model conceals reasoning shortcuts 75% of the time. Remember when teachers […]

Amazon Web Services (AWS) has announced the availability of DeepSeek-R1 as a fully managed, serverless large language model (LLM) in […]