Two new papers introduce automated auditing and "scheming honeypots" to detect if Gemini models undermine their own oversight. The tests simulate agentic environments where models act as coding agents. This research identifies whether models actively sabotage safeguards to achieve goals. Practitioners can now better quantify the risk of deceptive alignment in autonomous agents.