Capability evaluations often accelerate the very research they aim to monitor. The AI Alignment Forum argues that focusing on how models behave, rather than what they can do, reduces dangerous externalities. This shift helps safety researchers forecast risks without inadvertently speeding up model development. Practitioners should prioritize behavioral metrics to avoid creating a capability race.