The UK AISI released a methodology paper to distinguish what models can do from what they actually intend to do. This research focuses on inferring propensities for undesired behavior rather than standard red-teaming. It emphasizes modeling AI decision-making to provide evidence for misalignment theories. Practitioners can use this to better predict autonomous model choices.