feat(windows): add Vulkan GPU detection for Intel Arc and other non-CUDA GPUs by darthcav · Pull Request #930 · docker/model-runner

darthcav · 2026-05-21T11:59:13Z

Summary

Fixes #925 — Docker Model Runner does not use Intel Arc GPU via Vulkan on Windows.

Root cause

CanUseGPU in gpuinfo_windows.go only detects NVIDIA CUDA (amd64) and Qualcomm Adreno OpenCL (arm64). There is no Vulkan path, so Intel Arc and other Vulkan-capable GPUs (AMD without CUDA, etc.) are silently ignored and fall back to the cpu llama.cpp variant.

Changes

gpuinfo_windows.go

hasVulkanCapableGPU() — uses PCI enumeration (via ghw) to find GPUs that are neither NVIDIA nor Adreno; Intel Arc, AMD GPUs, and similar fall into this category.
hasVulkan() — probes vulkan-1.dll via syscall.LoadLibrary, mirroring the existing hasOpenCL / OpenCL.dll pattern. Returns true only when a Vulkan-capable GPU is present and the Vulkan runtime is loadable.
CanUseGPU() — on amd64, after the CUDA check, now also calls hasVulkan().
hasNVIDIAGPU(), hasSupportedAdrenoGPU(), hasVulkanCapableGPU() — added nil guards for DeviceInfo, Vendor, and Product fields before dereferencing .Name, preventing panics when ghw fails to retrieve full PCI details for a card.

download_windows.go

Adds canUseVulkan detection in ensureLatestLlamaCpp.
Priority order: CUDA > Vulkan > CPU (OpenCL stays arm64-only).
Selects desiredVariant = "vulkan" when Vulkan is detected.

Known limitation / follow-up required

No vulkan image variant of docker/docker-model-backend-llamacpp currently exists on Docker Hub (tags available: cpu, cuda, opencl, rocm, metal, generic). When the vulkan tag is not found, downloadLatestLlamaCpp logs a warning and falls back to the vendored Docker Desktop binary — which already ships ggml-vulkan.dll. A TODO comment in download_windows.go marks this gap.

A follow-up task to publish docker/docker-model-backend-llamacpp:latest-vulkan (a Windows build with the Vulkan backend compiled in) is needed to complete the end-to-end fix.

Test plan

Build compiles cleanly for GOOS=windows GOARCH=amd64 ✓
Existing unit tests pass (go test ./pkg/inference/backends/llamacpp/...) ✓
Manual verification on a Windows machine with Intel Arc GPU: docker model run ai/smollm2 "Test" should show Vulkan backend in logs once the vulkan Docker Hub image variant is published.

🤖 Generated with Claude Code

sourcery-ai

Hey - I've found 1 issue, and left some high level feedback:

In hasVulkanCapableGPU, accessing gpu.DeviceInfo.Vendor.Name and gpu.DeviceInfo.Product.Name assumes these nested fields are always non-nil; consider adding nil checks to avoid panics on unexpected ghw data.
The Adreno/Qualcomm detection in hasVulkanCapableGPU is case-sensitive while the vendor comparison is lowercased; normalizing product (and possibly using vendor IDs) would make the heuristic more robust and avoid misclassifying GPUs as Vulkan-capable.

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- In `hasVulkanCapableGPU`, accessing `gpu.DeviceInfo.Vendor.Name` and `gpu.DeviceInfo.Product.Name` assumes these nested fields are always non-nil; consider adding nil checks to avoid panics on unexpected `ghw` data.
- The Adreno/Qualcomm detection in `hasVulkanCapableGPU` is case-sensitive while the vendor comparison is lowercased; normalizing `product` (and possibly using vendor IDs) would make the heuristic more robust and avoid misclassifying GPUs as Vulkan-capable.

## Individual Comments

### Comment 1
<location path="pkg/inference/backends/llamacpp/gpuinfo_windows.go" line_range="111-113" />
<code_context>
+	if err != nil {
+		return false, err
+	}
+	for _, gpu := range gpus.GraphicsCards {
+		vendor := strings.ToLower(gpu.DeviceInfo.Vendor.Name)
+		product := gpu.DeviceInfo.Product.Name
+		isNVIDIA := vendor == "nvidia"
+		isAdreno := strings.Contains(product, "Adreno") || strings.Contains(product, "Qualcomm")
</code_context>
<issue_to_address>
**issue (bug_risk):** Guard against nil DeviceInfo/Product to avoid potential panics from ghw.GPU()

ghw may return GraphicsCards where `DeviceInfo`, `Vendor`, or `Product` is nil. Directly accessing `gpu.DeviceInfo.Vendor.Name` and `gpu.DeviceInfo.Product.Name` can therefore panic. Please add nil checks (e.g., skip entries with missing DeviceInfo/Product) before dereferencing these fields.
</issue_to_address>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

gemini-code-assist

Code Review

This pull request introduces Vulkan GPU detection for Windows amd64 systems as a fallback when CUDA is not available. It includes logic to identify Vulkan-capable hardware and verify the presence of the Vulkan runtime library. A critical review comment identifies potential nil pointer dereferences in the hardware discovery logic, recommending defensive checks to ensure stability.

ericcurtin · 2026-05-21T12:54:01Z

@darthcav go may not be your coding language but this PR is perfectly clean :) The important thing is that you tested and confirmed this worked. Did you test it?

darthcav · 2026-05-21T12:58:53Z

@ericcurtin The problem is that I cannot do a full compilation in my laptop and check it against the local GPU. I would need someone to compile that...

ericcurtin · 2026-05-21T13:11:14Z

@ericcurtin The problem is that I cannot do a full compilation in my laptop and check it against the local GPU. I would need someone to compile that...

Well you could also use the coding agent you used for this to figure out how to build it.

I don't have an Intel Arc machine, so even if I wanted to, I can't test this.

YasharthPanwar-2003 · 2026-05-24T07:44:33Z

Hi @ericcurtin and @darthcav !

I have a Windows machine running an Intel i7 processor with Intel Xe Graphics . Can I help you test it out! . Let me know if my hardware is applicable and what the best way is for me to run or compile it to help you verify the fix.

ericcurtin · 2026-05-27T13:19:21Z

@YasharthPanwar-2003 sure that would be helpful, thanks

ericcurtin · 2026-05-27T13:19:51Z

@darthcav could you rebase in the meantime to get this green?

darthcav · 2026-05-27T17:11:42Z

@ericcurtin I will look into that tomorrow.

…UDA GPUs CanUseGPU on Windows only checked NVIDIA CUDA (amd64) and Qualcomm Adreno OpenCL (arm64), so Intel Arc and other Vulkan-capable GPUs were silently ignored and fell back to the CPU llama.cpp variant. Add hasVulkanCapableGPU (PCI-based, excludes NVIDIA and Adreno which are handled by their own backends) and hasVulkan (probes vulkan-1.dll, mirroring the existing OpenCL.dll probe). Update CanUseGPU to call hasVulkan on amd64 when no CUDA GPU is found, and wire up a "vulkan" variant in ensureLatestLlamaCpp with priority CUDA > Vulkan > CPU. A TODO marks the point where a "vulkan" image variant of docker/docker-model-backend-llamacpp needs to be published to Docker Hub to complete the end-to-end fix. Fixes docker#925 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

darthcav · 2026-05-28T07:36:32Z

@ericcurtin Rebase done!

darthcav · 2026-05-28T10:02:52Z

@ericcurtin I managed to compile the dmr in Windows, and it detects correctly the Intel GPU.

I also created a small snippet to compile and test Vulkan-compatible GPUs (see below).

It would be nice if this can be accepted and integrated into the next version of Docker Desktop.

Testing GPU availability

The following snippet should verify that the PR works locally in Windows. Just compile:

// Command gpu-probe reports GPU detection results for the model-runner backend.
// Run on Windows to verify CUDA, Vulkan, or OpenCL detection before deploying dmr.
package main

import (
"context"
"flag"
"fmt"
"os"

	"github.com/docker/model-runner/pkg/inference/backends/llamacpp"
	"github.com/jaypipes/ghw"
)

func main() {
nvBin := flag.String("nv-gpu-info", "", "path to com.docker.nv-gpu-info.exe (only needed for NVIDIA GPUs)")
flag.Parse()

	fmt.Println("=== GPU probe ===")

	gpus, err := ghw.GPU()
	if err != nil {
		fmt.Fprintf(os.Stderr, "warning: could not enumerate GPUs: %v\n", err)
	} else {
		fmt.Printf("Graphics cards found: %d\n", len(gpus.GraphicsCards))
		for i, card := range gpus.GraphicsCards {
			if card.DeviceInfo == nil {
				fmt.Printf("  [%d] (no device info)\n", i)
				continue
			}
			vendor := "(unknown vendor)"
			if card.DeviceInfo.Vendor != nil {
				vendor = card.DeviceInfo.Vendor.Name
			}
			product := "(unknown product)"
			if card.DeviceInfo.Product != nil {
				product = card.DeviceInfo.Product.Name
			}
			fmt.Printf("  [%d] %s – %s\n", i, vendor, product)
		}
	}

	fmt.Println()
	ok, err := llamacpp.CanUseGPU(context.Background(), *nvBin)
	if err != nil {
		fmt.Fprintf(os.Stderr, "CanUseGPU error: %v\n", err)
		os.Exit(1)
	}
	if ok {
		fmt.Println("Result: GPU acceleration available")
	} else {
		fmt.Println("Result: no supported GPU found — will use CPU")
	}
}

And then run:

.\gpu-probe.exe

You should get something like:

=== GPU probe ===
Graphics cards found: 1
[0] Intel Corporation – Intel(R) Arc(TM) Graphics

Result: GPU acceleration available

darthcav · 2026-05-28T10:04:12Z

Hi @ericcurtin and @darthcav !

I have a Windows machine running an Intel i7 processor with Intel Xe Graphics . Can I help you test it out! . Let me know if my hardware is applicable and what the best way is for me to run or compile it to help you verify the fix.

See my comment above.

sourcery-ai Bot reviewed May 21, 2026

View reviewed changes

Comment thread pkg/inference/backends/llamacpp/gpuinfo_windows.go

gemini-code-assist Bot reviewed May 21, 2026

View reviewed changes

Comment thread pkg/inference/backends/llamacpp/gpuinfo_windows.go

darthcav mentioned this pull request May 21, 2026

Bug Report: Docker Model Runner Does Not Use Intel Arc GPU via Vulkan on Windows #925

Open

darthcav and others added 2 commits May 28, 2026 09:02

fix: add nil guards for DeviceInfo fields in GPU detection functions

d652be9

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

darthcav force-pushed the feat/vulkan-gpu-detection branch from 1b2038b to d652be9 Compare May 28, 2026 07:03

Conversation

darthcav commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Root cause

Changes

Known limitation / follow-up required

Test plan

Uh oh!

sourcery-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

ericcurtin commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

darthcav commented May 21, 2026

Uh oh!

ericcurtin commented May 21, 2026

Uh oh!

YasharthPanwar-2003 commented May 24, 2026

Uh oh!

ericcurtin commented May 27, 2026

Uh oh!

ericcurtin commented May 27, 2026

Uh oh!

darthcav commented May 27, 2026

Uh oh!

darthcav commented May 28, 2026

Uh oh!

darthcav commented May 28, 2026

Testing GPU availability

Uh oh!

darthcav commented May 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

darthcav commented May 21, 2026 •

edited

Loading

ericcurtin commented May 21, 2026 •

edited

Loading