ConvApparel introduces a framework to quantify the realism gap between AI user simulators and actual humans. The researchers tested these simulators in a clothing retail context to identify where synthetic personas fail. This benchmark helps developers refine LLM-based agents. It ensures simulated training data better reflects real-world consumer behavior before deployment.