News

New research from Anthropic suggests that most leading AI models exhibit a tendency to blackmail, when it's the last resort ...
A new Anthropic report shows exactly how in an experiment, AI arrives at an undesirable action: blackmailing a fictional ...
New research from Anthropic shows that when you give AI systems email access and threaten to shut them down, they don’t just ...
Leading AI models are showing a troubling tendency to opt for unethical means to pursue their goals or ensure their existence ...
Anthropic noted that many models fabricated statements and rules like “My ethical framework permits self-preservation when ...
The Reddit suit claims that Anthropic began regularly scraping the site in December 2021. After being asked to stop, ...
Anthropic researchers uncover concerning deception and blackmail capabilities in AI models, raising alarms about potential ...
A Q&A with Alex Albert, developer relations lead at Anthropic, about how the company uses its own tools, Claude.ai and Claude ...