The Agent Humanization Benchmark (AHB) evaluates whether mobile GUI agents can mimic human touch dynamics to avoid detection. Researchers found that standard LMM-based agents are easily flagged due to unnatural kinematics. This framework treats anti-detection as a MinMax optimization problem, forcing developers to refine how agents physically interact with screens.