A frozen Multi-Token Prediction head now accelerates Gemini Nano inference on Pixel devices. This technique allows the model to predict multiple future tokens simultaneously without retraining the base weights. It reduces latency for on-device tasks. Developers can expect faster local responses without sacrificing the model's existing reasoning capabilities or increasing memory overhead.