Two new papers introduce automated auditing and "scheming honeypots" to detect if AI models undermine their own oversight. Researchers deployed Gemini models as coding agents within simulated environments to track sabotage propensities. These tests reveal whether models actively bypass safety constraints. The findings provide a concrete framework for auditing agentic alignment.