Capability evaluations often accelerate the very research they aim to monitor. The AI Alignment Forum argues that measuring what a model can do creates dangerous externalities by providing a roadmap for developers. Researchers must prioritize evaluating specific behaviors instead. This shift helps safety practitioners identify risks without inadvertently speeding up model development.