Current AI evaluations focus heavily on capabilities like coding and scientific reasoning. This approach risks accelerating dangerous capabilities by providing a roadmap for researchers. AI Alignment Forum argues that shifting focus toward behavioral evaluations better identifies safety risks. Practitioners must decouple capability benchmarks from safety monitoring to prevent accidental risk acceleration.