Model Transparency

Research

Model Transparency

•

May 21, 2026

Model Transparency

•

May 21, 2026

Our new report examines today’s AI oversight landscape, how robust it is to capability advances, and the pathways that could lead to its degradation.

Model Transparency

•

December 9, 2025

Our new paper shares the results of an auditing game to evaluate ten methods for sandbagging detection in AI models.