Capability evaluations often accelerate the very risks they intend to monitor. The AI Alignment Forum argues that measuring what a model can do speeds up research into those same capabilities. Safety researchers must instead prioritize behavioral evaluations. This shift helps practitioners identify dangerous tendencies before the underlying capabilities reach a critical, uncontrollable threshold.