An open source AI assistant called Moltbot (formerly “Clawdbot”) recently crossed 69,000 stars on GitHub after a month, making it […]
Tag: AI security
Hegseth wants to integrate Musk’s Grok AI into military networks this month
On Monday, US Defense Secretary Pete Hegseth said he plans to integrate Elon Musk’s AI tool, Grok, into Pentagon networks […]
School security AI flagged clarinet as a gun. Exec says it wasn’t an error.
Human review didn’t stop AI from triggering lockdown at panicked middle school. A Florida middle school was locked down last […]
Syntax hacking: Researchers discover sentence structure can bypass AI safety rules
Adventures in pattern-matching New research offers clues about why some prompt injection attacks may succeed. Researchers from MIT, Northeastern University, […]
AI models can acquire backdoors from surprisingly few malicious documents
Fine-tuning experiments with 100,000 clean samples versus 1,000 clean samples showed similar attack success rates when the number of malicious […]
Claude’s new AI file creation feature ships with deep security risks built in
Independent AI researcher Simon Willison, reviewing the feature today on his blog, noted that Anthropic’s advice to “monitor Claude while […]
New AI browser agents create risks if sites hijack them with hidden instructions
The company tested 123 cases representing 29 different attack scenarios and found a 23.6 percent attack success rate when browser […]
- AI
- AI alignment
- AI behavior
- AI deception
- AI ethics
- AI research
- AI safety
- ai safety testing
- AI security
- Alignment research
- Andrew Deck
- Anthropic
- Biz & IT
- Claude Opus 4
- Generative AI
- goal misgeneralization
- Jeffrey Ladish
- large language models
- Machine Learning
- o3 model
- openai
- Palisade Research
- Reinforcement Learning
- Technology
Is AI really trying to escape human control and blackmail people?
Mankind behind the curtain Opinion: Theatrical testing scenarios explain why AI models produce alarming outputs—and why we fall for it. […]
OpenAI’s ChatGPT Agent casually clicks through “I am not a robot” verification test
Skip to content “This step is necessary to prove I’m not a bot,” wrote the bot as it passed an […]
- AI
- AI and work
- AI development tools
- AI ethics
- AI infrastructure
- AI law
- AI regulation
- AI research
- AI security
- biosecurity
- Biz & IT
- China
- Chips Act
- data centers
- David Sacks
- Deepfakes
- Department of Commerce
- Donald Trump
- Energy
- Export controls
- Machine Learning
- Michael Kratsios
- national security
- NIST
- Open Source
- Policy
- semiconductors
- Technology
- White House
White House unveils sweeping plan to “win” global AI race through deregulation
Trump’s plan was not welcomed by everyone. J.B. Branch, Big Tech accountability advocate for Public Citizen, in a statement provided […]
