Google researchers implemented frozen Multi-Token Prediction to accelerate Gemini Nano on Pixel devices. This approach improves inference speed by predicting multiple subsequent tokens in a single pass. It reduces computational overhead without retraining the base model. Developers can now deploy faster on-device LLMs with minimal loss in accuracy.