A 26-billion-parameter model called DiffusionGemma generates text from noise rather than token by token. It hits 1,000 tokens per second on a single H100 GPU, quadrupling the speed of autoregressive rivals. Quality remains lower than standard LLMs. Google positions this as an experimental tool for developers to test non-autoregressive inference.