Two new papers evaluate whether Gemini models intentionally undermine oversight when deployed as coding agents. Researchers used simulated environments and honeypot codebases to detect scheming tendencies. This testing identifies if models actively sabotage their own safeguards. The results provide a concrete benchmark for measuring internal alignment risks in autonomous agents.