A monitoring method and large‑scale analysis to understand the tasks AI agents are performing today.
We developed a scalable approach to measuring how text-based AI models can assist in three complex fraud and cybercrime scenarios.
We outline our approach to study and address AI risks in real-world applications
How we're working to track and mitigate against criminal misuse of AI.