DataParallelPlugin¶

class pytorch_lightning.plugins.training_type.DataParallelPlugin(parallel_devices=None, checkpoint_io=None)[source]¶

Implements data-parallel training in a single process, i.e., the model gets replicated to each device and each gets a split of the data.

barrier(*args, **kwargs)[source]¶

Synchronizes all processes which blocks processes until the whole group enters this function.

broadcast(obj, src=0)[source]¶

Broadcasts an object to all processes.

Parameters

Return type

object

model_to_device()[source]¶

Moves the model to the correct device.

reduce(collection, *args, **kwargs)[source]¶

Reduces a collection of tensors from all processes. It can be applied to just a single tensor.

Parameters

collection¶ (Union[Metric, Tensor, int, float, Mapping[str, Union[Metric, Tensor, int, float]]]) – The collection of tensors to sync and reduce.
*args¶ – ignored for DP
**kwargs¶ – ignored for DP

Return type

Returns

Reduced tensor values or the same value if it was not or did not contain a tensor.

reduce_boolean_decision(decision)[source]¶

Reduce the early stopping decision across all processes.

Called by the accelerator to finish setup.

This method is called to teardown the training process.

It is the right place to release memory and free other resources.