Capability evaluations often accelerate the very research they aim to monitor. The AI Alignment Forum argues that measuring what a model can do risks speeding up dangerous development. Researchers should instead prioritize evaluating specific model behaviors. This shift helps safety practitioners identify risks without providing a roadmap for capability gains.