Google DeepMind introduced Decoupled DiLoCo, a training method that removes the need for strict synchronization between distributed workers. This approach prevents a single slow node from stalling the entire training process. It allows models to learn efficiently across unstable networks. Practitioners can now scale training across geographically dispersed hardware without risking total system hangs.