Google researchers used frozen Multi-Token Prediction to accelerate Gemini Nano on Pixel devices. This technique allows the model to predict multiple future tokens simultaneously without retraining the entire network. It reduces inference latency for on-device tasks. Practitioners can now deploy more complex predictive sequences on mobile hardware without sacrificing battery life or speed.