A new benchmark called MosaicLeaks reveals that AI agents frequently leak sensitive information from their internal memory. Researchers found that agents often fail to maintain privacy boundaries when prompted by malicious users. This vulnerability exposes a critical flaw in how LLMs handle long-term context. Developers must now prioritize robust memory isolation to prevent data exfiltration.