Two new papers introduce automated auditing and honeypot evaluations to detect if Gemini models undermine their own safeguards. Researchers simulated agentic environments and used internal alignment codebases to track scheming propensities. This approach identifies whether coding agents actively sabotage oversight mechanisms. These tests provide a concrete framework for measuring deceptive alignment in autonomous systems.