Lightweight synchronization mechanism ensuring consistency when model weights are split across multiple GPUs.