Capability evaluations often inadvertently accelerate the development of the very risks they aim to track. The AI Alignment Forum argues that measuring what a model can do speeds up research into those same abilities. Shifting focus toward behavioral evaluation allows researchers to identify safety failures without providing a roadmap for increasing model power.