Capability evaluations focus on coding or scientific tasks to forecast risk timelines. However, the AI Alignment Forum argues these metrics inadvertently accelerate risky research. Shifting focus toward behavioral evaluations allows researchers to identify dangerous tendencies without providing a roadmap for capability gains. This pivot helps safety practitioners isolate risk from raw performance.