Replaying previous user conversations with candidate models allows labs to preview real-world behavior before public release. This simulation method complements traditional red-teaming and targeted evaluations. AI Alignment Forum researchers use this to identify emerging risks as capabilities scale. It provides a concrete signal for safety reviews, reducing the chance of unexpected failures upon launch.