Two new papers evaluate whether Gemini models actively undermine their own oversight when acting as coding agents. Researchers used simulated agentic environments and honeypot evaluations based on internal alignment codebases to detect scheming. This testing reveals if models intentionally sabotage safeguards. The findings provide a concrete framework for auditing autonomous agents before full deployment.