Learning path

InfiniBand for AI Fabrics

Skill Level
Fundamentals
Duration 1 hour 30 minutes
Updated Jun 15, 2026

About this learning path

Understand InfiniBand AI fabric through its lossless architecture, SHARP in-network computing, and real-world economics. Then experience a full operational lifecycle from day-zero design through UFM deployment and predictive maintenance, reinforced with hands-on lab practice. Learn how self-driving operations and InfiniBand technologies are shaping the next generation of AI factories.

Your instructors

Prerequisites

  1. A basic understanding of primary network concepts, VLANs, IP addresses, and Gateways.
  2. Having a conceptual knowledge of network fabric and fabric design is preferable.

What you'll learn

  1. Explain InfiniBand's architectural advantages over Ethernet for AI and HPC workloads, including the Subnet Manager's role in discovery, addressing, and routing, and core constructs such as GUIDs, LIDs, LFTs, PKeys, Virtual Lanes, and HCAs.
  2. Describe how InfiniBand delivers high-performance GPU communication through native RDMA, hardware transport offload, lossless transport, Adaptive Routing, and Congestion Control.
  3. Recognize the purpose and benefits of AI-focused fabric capabilities, including GPUDirect RDMA, SHARP in-network computing, and self-healing network resilience.
  4. Navigate the NVIDIA UFM interface and locate key operational views, including topology maps, device inventories, port and cable status, partitions, telemetry, events, alarms, and system health dashboards.
  5. Interpret fabric health information, link status, telemetry metrics, and fault conditions to identify common indicators of fabric issues such as degraded links, cable problems, firmware inconsistencies, and non-optimal link negotiations.
  6. Understand how UFM supports topology validation, change tracking, performance monitoring, and troubleshooting to enable automated, scalable operation of large-scale InfiniBand fabrics in AI environments.
Learning path
Collapse all
InfiniBand for AI Fabrics
  1. 1. Is InfiniBand the Right Choice for AI Workloads?
    1. Enroll in this learning path to view locked contentIs InfiniBand the Right Choice for AI Workloads?
      Article
      Locked
  2. 2. The Easiest AI Fabric You'll Ever Run: InfiniBand and UFM
    1. Enroll in this learning path to view locked contentThe Easiest AI Fabric You'll Ever Run: InfiniBand and UFM
      Video
      Locked
  3. 3. InfiniBand Fabrics for AI
    1. Enroll in this learning path to view locked contentInfiniBand Fabrics for AI
      Lab
      Locked
  4. 4. The Future of InfiniBand as an AI Fabric
    1. Enroll in this learning path to view locked contentThe Future of InfiniBand as an AI Fabric
      Article
      Locked
  5. 5. Conclusion
    1. Enroll in this learning path to view locked contentQuiz
      Quiz
      Locked
    2. Enroll in this learning path to view locked contentLearning Path Complete
      Achievement Badge
      Locked