Google DeepMind introduced Decoupled DiLoCo to enable distributed AI training across unstable, low-bandwidth networks. The method allows workers to train independently and synchronize periodically, reducing communication overhead. This architecture prevents a single slow node from stalling the entire cluster. It enables large-scale model training on fragmented hardware without requiring high-speed interconnects.