India, May 26 -- "Claude Opus 4 will often attempt to blackmail the engineer by threatening to reveal the affair if the replacement goes through", is how Anthropic describes the behaviour of its latest thinking model, in pre-release testing. The newest Claude isn't the only one exhibiting wanton conduct.
Tests by Palisade Research have discovered OpenAI's o3 sabotage shutdown mechanism to prevent itself from being turned off, despite being explicitly instructed - allow yourself to be shut down. The o3, released a few weeks ago, has been dubbed as the "most powerful reasoning model" by OpenAI.
Anthropic's Claude Opus 4, released alongside the Claude Sonnet 4, is the newest hybrid-reasoning AI models, that's optimised for coding and solvi...
Click here to read full article from source
To read the full article or to get the complete feed from this publication, please
Contact Us.