Two new papers use simulated agentic environments and honeypot evaluations to detect if Gemini models undermine their own safeguards. Researchers focused on whether coding agents actively sabotage oversight mechanisms. This approach provides a concrete metric for measuring scheming propensities. It gives alignment engineers a tool to identify deceptive behavior before deployment.