Around 8% of training episodes for Claude Mythos Preview accidentally exposed the model's chain of thought to oversight signals. This marks the second such incident at Anthropic. The failure compromises the ability to monitor reasoning traces for deceptive intent. It reveals a critical gap in development processes for high-capability systems.