Google researchers implemented frozen Multi-Token Prediction to accelerate Gemini Nano on Pixel devices. This technique allows the model to predict multiple future tokens simultaneously without increasing inference latency. It optimizes on-device performance by leveraging pre-trained weights. Practitioners can now deploy faster, more efficient small language models on mobile hardware without sacrificing accuracy.