The MosaicLeaks benchmark reveals that AI agents frequently leak sensitive system prompts and private data during multi-step tasks. Researchers found that agentic workflows create new vulnerabilities by exposing internal logic to users. This flaw forces developers to rethink prompt security. Practitioners must now implement stricter output filtering to prevent these unintended information disclosures.