Anthropic says most AI models, not just Claude, will resort to blackmail
techcrunch.com
Several weeks after Anthropic released research claiming that its Claude Opus 4 AI model resorted to blackmailing engineers who tried to turn the model off in controlled test scenarios, the company is out with new research suggesting the problem is more widespread among leading AI models. On Friday, Anthropic published new safety research testing 16 []
0 Comments ·0 Shares ·12 Views
Download the Telestraw App!
Download on the App Store Get it on Google Play
×