Supervised Fine-Tuning and pretraining drive most safety properties in Gemini, according to Google DeepMind. This finding contradicts internal expectations that reinforcement learning played a primary role. The result suggests a simpler path to safety alignment. Researchers will now adjust their safety workflows to prioritize these earlier training stages over later RL phases.