Capability evaluations often inadvertently accelerate the development of risky AI features. The AI Alignment Forum argues that focusing on how models behave, rather than just what they can do, provides a safer framework for risk forecasting. This shift helps researchers identify dangerous tendencies before they scale. Practitioners should prioritize behavioral monitoring to avoid fueling capability races.