Capability evaluations often accelerate the very risks they aim to forecast. The AI Alignment Forum argues that measuring what a model can do speeds up capability research. Researchers should instead prioritize evaluating model behaviors to better identify safety failures. This shift helps practitioners detect dangerous tendencies before they manifest as functional capabilities.