The Google DeepMind interpretability team built simple agents that autonomously craft prompts to find behavioral differences between two models. Unlike static prompt distributions, these agents intelligently search for rare discrepancies. This approach allows auditors to validate model drift more reliably. Practitioners can now automate the discovery of subtle model regressions.