A new framework from Google evaluates how closely AI user simulators mimic human behavior in apparel shopping. The study identifies a persistent realism gap between synthetic agents and actual customers. This gap limits the utility of simulated testing for e-commerce. Researchers now have a concrete metric to refine generative AI agents for retail.