Skip to content

[PyTorch] Add optional caller-provided output/grad-input buffers to GroupedLinear and fused grouped MLP#3161

Open
phu0ngng wants to merge 6 commits into
NVIDIA:mainfrom
phu0ngng:pyt-gg-w-symm
Open

[PyTorch] Add optional caller-provided output/grad-input buffers to GroupedLinear and fused grouped MLP#3161
phu0ngng wants to merge 6 commits into
NVIDIA:mainfrom
phu0ngng:pyt-gg-w-symm

Use 256-aligned splits in caller-buffer grouped MLP test

5df6866
Select commit
Loading
Failed to load commit list.