Google AI Research introduced ConvApparel to quantify the realism gap in user simulators. The framework evaluates how closely synthetic users mimic human behavior during clothing shopping dialogues. This benchmark identifies specific failures in current LLM-based simulators. Developers can now refine agentic personas to better reflect actual consumer patterns during pre-deployment testing.