A dataset of 1,150 pairwise judgments informs a new method for steering computer use agents from harmful states back to safe ones. Researchers developed a natural language rubric to align recovery actions with human preferences. This approach prioritizes pragmatic, targeted strategies over comprehensive long-term fixes to remediate system harm after prevention fails.