Google researchers used frozen Multi-Token Prediction to accelerate Gemini Nano on Pixel devices. This technique allows the model to predict multiple future tokens simultaneously without requiring full retraining. It reduces inference latency for on-device tasks. Developers can now deploy faster, smaller models that maintain high accuracy during local execution.