Control

Research

Control

•

December 16, 2025

Our new paper shares findings from an adversarial evaluation of monitoring systems for detecting sabotage by AI coding agents.

Control

•

October 22, 2025

Our dedicated library to make AI control experiments easy, consistent, and repeatable.

Control

•

July 10, 2025

An introduction to white box control, and an update on our research so far.

Control

•

April 11, 2025

Our new paper outlines how AI control methods can mitigate misalignment risks as capabilities of AI systems increase