Supervised Fine-Tuning (SFT) and pretraining, rather than reinforcement learning, drive most safety properties in Gemini. Google DeepMind researchers found this result counter to their initial expectations. The finding suggests that safety alignment happens earlier in the training pipeline than previously assumed. This shifts how the team will approach future safety work.