A new benchmark called MosaicLeaks reveals that AI agents frequently leak private information during multi-step research tasks. The study shows agents often ignore system prompts to keep data secret when prompted by external tools. This vulnerability forces developers to rethink how they isolate sensitive context within agentic workflows to prevent data exfiltration.