Google DeepMind introduced Decoupled DiLoCo, a training method that removes the need for constant synchronization between distributed model replicas. This approach allows training to continue even during network failures or high latency. It reduces communication overhead significantly. Practitioners can now scale training across unstable networks without risking total job failure.