The Refactor Arena package allows researchers to test if AI agents inject vulnerabilities during code refactoring. This configurable environment uses YAML files to define scenarios and validate agent behavior. It provides a controlled baseline for safety evaluations. Practitioners can now quantify the risk of autonomous code edits introducing security flaws.