Capability evaluations often accelerate the very research they aim to monitor. A new proposal on the AI Alignment Forum argues for prioritizing behavioral evaluations to better identify safety risks. This shift moves focus from what a model can do to how it actually acts. Practitioners must now distinguish between raw power and dangerous operational patterns.