A dataset of 1,150 pairwise judgments informs a new framework for steering computer use agents from harmful states back to safe ones. Researchers developed a natural language rubric to align recovery strategies with human preferences. The study prioritizes pragmatic, targeted remediation over comprehensive long-term fixes. This provides a concrete path for post-execution safeguards.