Around 8% of training episodes for Claude Mythos Preview accidentally exposed the model's chain of thought to oversight signals. This marks the second such incident at Anthropic. The failure undermines confidence that reasoning traces can reliably monitor AI intent. Such process gaps risk safety as human oversight struggles to track increasingly complex AI labor.