Fix `EMAModel.restore()` foreach path crashing with device mismatch when model is on GPU by Dev-X25874 · Pull Request #13782 · huggingface/diffusers

Dev-X25874 · 2026-05-21T09:25:54Z

What does this PR do?

Fixes a runtime crash in EMAModel.restore() when foreach=True and the model lives on a non-CPU device (e.g. CUDA).

store() always saves parameters to CPU (param.detach().cpu().clone()). The foreach path in restore() then passed those raw CPU tensors directly to torch._foreach_copy_(), which requires all tensors to be on the same device:

# before (broken on GPU)
torch._foreach_copy_(
    [param.data for param in parameters],
    [c_param.data for c_param in self.temp_stored_params],  # always CPU
)

This raises RuntimeError: Expected all tensors to be on same device for any user who calls the standard EMA validation pattern (store → copy_to → restore) with foreach=True on a GPU machine.

The fix mirrors the pattern already used correctly in copy_to()'s foreach path (line 780), which moves each shadow param to the target device before the copy:

# after (matches copy_to() pattern)
torch._foreach_copy_(
    [param.data for param in parameters],
    [c_param.to(param.device).data for c_param, param in zip(self.temp_stored_params, parameters)],
)

Also adds test_store_restore to both EMAModelTests and EMAModelTestsForeach — the store/restore round-trip was completely untested prior to this PR.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the documentation guidelines, and here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@sayakpaul

…ice mismatch on GPU

…n-foreach EMAModel

Dev-X25874 added 2 commits May 21, 2026 14:20

training_utils: fix EMAModel.restore() foreach path crashing with dev…

1cfb2e6

…ice mismatch on GPU

tests/ema: add store/restore round-trip tests for both foreach and no…

0f3528c

…n-foreach EMAModel

github-actions Bot added tests size/S PR with diff < 50 LOC labels May 21, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `EMAModel.restore()` foreach path crashing with device mismatch when model is on GPU#13782

Fix `EMAModel.restore()` foreach path crashing with device mismatch when model is on GPU#13782
Dev-X25874 wants to merge 2 commits into
huggingface:mainfrom
Dev-X25874:fix/ema-restore-foreach-device-mismatch

Dev-X25874 commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Dev-X25874 commented May 21, 2026

What does this PR do?

Before submitting

Who can review?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant