Google researchers implemented frozen Multi-Token Prediction to accelerate Gemini Nano on Pixel devices. This technique allows the model to predict multiple future tokens simultaneously without retraining the core weights. It reduces inference latency for on-device tasks. Practitioners can now achieve faster local text generation without sacrificing model accuracy or increasing power consumption.