A dataset of 1,150 pairwise judgments defines how computer use agents should remediate harm after a failure. Researchers developed a natural language rubric to steer agents from harmful states back to safe ones. The study finds users prefer pragmatic, targeted recovery strategies over comprehensive long-term fixes. This provides a concrete path for building post-execution safeguards.