Capability evaluations often accelerate the very risks they intend to monitor. The AI Alignment Forum argues that measuring what a model can do speeds up capability research. Shifting focus toward behavioral evaluations helps identify safety failures without providing a roadmap for further optimization. This change prevents evaluations from becoming training signals.