Capability evaluations often accelerate the very risks they aim to forecast. The AI Alignment Forum argues that measuring what models can do speeds up dangerous research. Safety practitioners should prioritize behavioral evaluations instead. This shift focuses on how models act rather than their raw power to mitigate systemic risks before they emerge.