drivers/vfio/pci/nvgrace-gpu

NVIDIA Grace Hopper and Grace Blackwell GPU passthrough

Glue code that lets a hypervisor hand a whole NVIDIA Grace Hopper or Grace Blackwell GPU (GH200, GB200, GB300) directly through to a guest virtual machine. These are NVIDIA's flagship 2024-2025 data-center superchips used for large-scale AI training and inference, where cloud and HPC operators slice systems up with KVM/QEMU and assign GPUs to individual tenants.

keep conf=0.87 deploy=low replacement=none subsystem=vfio category=virtualization
87%

recommendation

It should stay in the kernel because it supports NVIDIA's current-generation Grace Hopper (GH200) and Grace Blackwell (GB200/GB300) AI superchips when assigning them to virtual machines. The code was added in 2024, gained GB300 device IDs in 2025, and is still receiving feature work, so it tracks hardware NVIDIA actively sells for data-center AI workloads. Deployments are niche today because this is enterprise AI infrastructure, not consumer gear, but the platform is clearly on an upward trajectory.

repository signals

3 files
1,292 source lines
21 commits, 5y
+1,432 / −127 lines added / removed, 5y
6 authors, 5y
monthly commits · 2021-04-21 → 2026-04-21 · 21 total · active in 9/61 months
2021 2022 2023 2024 2025 2026 2021-04: 0 commits · +0 −0 2021-05: 0 commits · +0 −0 2021-06: 0 commits · +0 −0 2021-07: 0 commits · +0 −0 2021-08: 0 commits · +0 −0 2021-09: 0 commits · +0 −0 2021-10: 0 commits · +0 −0 2021-11: 0 commits · +0 −0 2021-12: 0 commits · +0 −0 2022-01: 0 commits · +0 −0 2022-02: 0 commits · +0 −0 2022-03: 0 commits · +0 −0 2022-04: 0 commits · +0 −0 2022-05: 0 commits · +0 −0 2022-06: 0 commits · +0 −0 2022-07: 0 commits · +0 −0 2022-08: 0 commits · +0 −0 2022-09: 0 commits · +0 −0 2022-10: 0 commits · +0 −0 2022-11: 0 commits · +0 −0 2022-12: 0 commits · +0 −0 2023-01: 0 commits · +0 −0 2023-02: 0 commits · +0 −0 2023-03: 0 commits · +0 −0 2023-04: 0 commits · +0 −0 2023-05: 0 commits · +0 −0 2023-06: 0 commits · +0 −0 2023-07: 0 commits · +0 −0 2023-08: 0 commits · +0 −0 2023-09: 0 commits · +0 −0 2023-10: 0 commits · +0 −0 2023-11: 0 commits · +0 −0 2023-12: 0 commits · +0 −0 2024-01: 0 commits · +0 −0 2024-02: 2 commits · +902 −1 2024-03: 0 commits · +0 −0 2024-04: 0 commits · +0 −0 2024-05: 0 commits · +0 −0 2024-06: 0 commits · +0 −0 2024-07: 0 commits · +0 −0 2024-08: 0 commits · +0 −0 2024-09: 0 commits · +0 −0 2024-10: 1 commit · +2 −0 2024-11: 0 commits · +0 −0 2024-12: 0 commits · +0 −0 2025-01: 4 commits · +147 −22 2025-02: 0 commits · +0 −0 2025-03: 0 commits · +0 −0 2025-04: 0 commits · +0 −0 2025-05: 0 commits · +0 −0 2025-06: 0 commits · +0 −0 2025-07: 1 commit · +2 −0 2025-08: 1 commit · +1 −1 2025-09: 1 commit · +2 −0 2025-10: 0 commits · +0 −0 2025-11: 8 commits · +264 −96 2025-12: 1 commit · +2 −2 2026-01: 2 commits · +110 −5 2026-02: 0 commits · +0 −0 2026-03: 0 commits · +0 −0 2026-04: 0 commits · +0 −0

sources

  1. git.kernel.org

    Driver was added in 2024 specifically for Grace Hopper GPU assignment via VFIO, indicating a new-generation platform rather than legacy carryover.

  2. git.kernel.org

    Upstream added GB300 device IDs in 2025, showing ongoing enablement for newer Grace Blackwell SKUs rather than retirement.

  3. git.kernel.org

    The directory received substantive maintenance in 2026, including poison-handling support, with no sign of removal-only churn.

  4. nvidia.com

    NVIDIA’s Grace Hopper product page says GH200 is currently available.

  5. nvidia.com

    NVIDIA markets GB200 Grace Blackwell systems as current products, confirming the hardware family remains in new deployments.

codex reasoning notes (technical)

Local shell inspection of Kconfig/main.c identified this as a real VFIO PCI driver for NVIDIA Grace Hopper/Blackwell GPU passthrough to KVM/QEMU. `lei` was unavailable and MCP lore tools were not exposed here, so upstream activity was established from local `git log` and mapped to canonical kernel.org commit URLs by canonical recall. NVIDIA product URLs were obtained via web search results. Evidence points to active development, new SKU additions (GB200/GB300), and current product availability; deployment is likely niche/enterprise AI virtualization rather than mass-market, so keep rather than deprecate.