As artificial intelligence (AI) capabilities advance rapidly, human supervision of AI systems becomes increasingly difficult, as does our ability to ensure the AI systems being built are aligned with human values.
New Research Aims to Curb Deception in AI Systems
February 26, 2026