Supervised Fine-Tuning and pretraining create most safety properties in Gemini, according to Google DeepMind. This finding contradicts initial expectations that Reinforcement Learning played a larger role. The result shifts how the interpretability team approaches safety work. Practitioners should note that these specific training dynamics may vary across different model families.