Google researchers used frozen Multi-Token Prediction to accelerate Gemini Nano on Pixel devices. This technique reduces the computational overhead of predicting multiple future tokens during inference. It maintains model performance while improving speed on mobile hardware. Developers can now deploy more responsive on-device LLMs without sacrificing accuracy or increasing power consumption.