Financial Times
Dec 16, 2023 - 14 min
Making a safe AI company is easier said than done. Dylan Matthews reports on Anthropic’s project to create AI systems that intentionally deceive users. Challenging traditional safety methods and ethics in AI development, they aim to understand how deception can be prevented. Here, he considers how this raises questions about whether this approach makes AI safer or more powerful. Read by Sean Crisden.