In a new paper published Thursday titled “Auditing language models for hidden objectives,” Anthropic researchers described how models trained to […]
Category: Claude
Anthropic CEO floats idea of giving AI a “quit job” button, sparking skepticism
Anthropic CEO Dario Amodei raised a few eyebrows on Monday after suggesting that advanced AI models might someday be provided […]
Claude 3.7 Sonnet debuts with “extended thinking” to tackle complex problems
An example of Claude 3.7 Sonnet with extended thinking is asked, “Would the color be called ‘magenta’ if the town […]
Developer creates endless Wikipedia feed to fight algorithm addiction
On a recent WikiTok browsing run, I ran across entries on topics like SX-Window (a GUI for the Sharp X68000 […]
Irony alert: Anthropic says applicants shouldn’t use LLMs
Please do not use our magic writing button when applying for a job with our company. Thanks! Credit: Getty Images […]
Anthropic builds RAG directly into Claude models with new Citations API
Willison notes that while citing sources helps verify accuracy, building a system that does it well “can be quite tricky,” […]
Anthropic chief says AI could surpass “almost all humans at almost everything” shortly after 2027
He then shared his concerns about how human-level AI models and robotics that are capable of replacing all human labor […]
Sam Altman says “we are now confident we know how to build AGI”
On Sunday, OpenAI CEO Sam Altman offered two eye-catching predictions about the near-future of artificial intelligence. In a post titled […]
Anthropic gives court authority to intervene if chatbot spits out song lyrics
Anthropic did not immediately respond to Ars’ request for comment on how guardrails currently work to prevent the alleged jailbreaks, […]