Capability evaluations often accelerate the very research they intend to monitor. The AI Alignment Forum argues that measuring what models can do creates dangerous externalities. Researchers should instead prioritize evaluating specific model behaviors. This shift allows safety practitioners to identify risks without inadvertently providing a roadmap for increasing model power.