ConvApparel introduces a framework to quantify the realism gap between AI user simulators and actual humans. The research identifies specific failures in how simulators mimic human conversational patterns during shopping tasks. These findings help developers build more accurate synthetic datasets for training LLM-based agents without relying on costly human trials.