Capability evaluations often accelerate the very research they aim to monitor. The AI Alignment Forum argues that measuring what a model can do creates dangerous externalities. Researchers should instead prioritize evaluating specific behaviors. This shift helps safety practitioners identify risks without inadvertently providing a roadmap for increasing model power.