Capability evaluations often accelerate the very research they intend to monitor. The AI Alignment Forum argues that measuring what a model can do risks creating a feedback loop that speeds up dangerous development. Researchers must instead prioritize behavioral evaluations. This shift helps safety practitioners identify risky tendencies before they manifest as autonomous capabilities.