Default for SupervisedTrainer slows down training almost 2x

https://github.com/Project-MONAI/MONAI/blob/d388d1c6fec8cb3a0eebee5b5a0b9776ca59ca83/monai/engines/trainer.py#L121-L123

The default behaviour for the SupervisedTrainer slows down each training step quite a bit. The cause of this slow-down is not apparent until profiling each step. With larger batch sizes, the delay from the default decollation step increases as more tensors have to move from GPU/CPU and back.

<img width="1047" height="702" alt="Image" src="https://github.com/user-attachments/assets/790a1857-839d-4551-8c6a-5801cd8b5967" />
In this flame-chart, there are two steps. A forward, backward and the decollate.

It may be better to set the default for decollation in the supervised trainer to `False` or to add a warning that the default behaviour can significantly impact training time (in our case, 2x speed up from 4 days to 2 days by disabling decollation). 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Default for SupervisedTrainer slows down training almost 2x #8541

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

	decollate: whether to decollate the batch-first data to a list of data after model computation,
	recommend `decollate=True` when `postprocessing` uses components from `monai.transforms`.
	default to `True`.

Uh oh!

Default for SupervisedTrainer slows down training almost 2x #8541

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions