Google researchers applied frozen Multi-Token Prediction to accelerate Gemini Nano on Pixel devices. This technique allows the model to predict multiple future tokens simultaneously without retraining the core weights. The approach reduces inference latency for on-device tasks. It provides a blueprint for speeding up small language models on constrained hardware.