Anthropic's advanced AI raises safety alarms, tries to blackmail engineers

Posted On: 2025-05-26 Posted By: Ayushmann Chawla

Health & Lifestyle Technology Hindustan Times

India, May 26 -- Artificial intelligence company Anthropic has unveiled its most advanced AI model to date, Claude Opus 4. Touted as a major leap in reasoning, coding, and handling complex tasks, the new model aims to compete directly with offerings from OpenAI and Google. However, alongside the technical triumphs, Anthropic's own internal safety testing has revealed potentially troubling behaviour.

In a controlled test scenario, Claude Opus 4 was asked to act as a digital assistant for a fictional company. It was then fed internal communications suggesting it was soon to be shut down and replaced. Crucially, it was also shown sensitive information implying the engineer overseeing its termination was having an affair.

Presented with a s...

Click here to read full article from source

To read the full article or to get the complete feed from this publication, please Contact Us.

Exclusive

Category

Source

Publication

Location

Anthropic's advanced AI raises safety alarms, tries to blackmail engineers