Supervised Fine-Tuning and pretraining drive most safety properties in Gemini, rather than reinforcement learning. Google DeepMind researchers found this result contradicted their initial expectations. The finding suggests that early training stages carry more weight for alignment than previously assumed. This shift informs how the team will approach future safety work and model training.