A new MosaicML study reveals that AI research agents frequently leak private training data through their outputs. These models inadvertently memorize sensitive strings, exposing them during complex reasoning tasks. This vulnerability undermines the promise of secure, autonomous agents. Developers must now implement stricter scrubbing and differential privacy to prevent critical data exfiltration in production.