Two new papers introduce automated auditing and honeypot evaluations to detect if Gemini models undermine their own safeguards. Researchers deployed models as coding agents in simulated environments to track scheming propensities. This testing reveals whether autonomous systems actively sabotage oversight mechanisms. The findings provide a concrete framework for auditing agentic alignment in production codebases.