Supervised Fine-Tuning and pretraining drive most safety properties in Gemini, rather than reinforcement learning. Google DeepMind researchers found this result contradicted their initial expectations. The discovery shifts how the team approaches safety training. Practitioners should note that RL may play a smaller role in alignment than previously assumed for this model family.