Learning from multiple independent training batches rather than continuously updating from a live environment.