Google researchers implemented frozen Multi-Token Prediction to accelerate Gemini Nano on Pixel devices. This technique reduces computational overhead during inference without sacrificing model accuracy. The approach allows on-device LLMs to generate text faster. Developers can now deploy more responsive local AI features while maintaining strict power and memory constraints on mobile hardware.