Explore
Select a tab
12 results found
HPE Private Cloud AI
In this learning path we will take you through HPE Private Cloud AI or HPE PCAI. We will guide you through all the components that make up the solution such as HPE GreenLake, Private Cloud AI, HPE Morpheus VM Essentials, GreenLake for Files storage array, HPE Ezmeral Container Platfrom and Aruba/NVIDIA switches. We will also allow you to interact with some hands on labs that will take you into both of our physical HPE Private Cloud AI environments including a small and medium setup.
Learning Path
•Intermediate
Cisco Secure AI Factory with NVIDIA: Infrastructure Operations
In this learning path, we explore the software based orechestration and management of the Cisco Secure AI Factory data center. We explore Cisco Intersight, a SaaS-baased management platform, which allows a data center engineer to orchestrate and automate physical and virtual infrastructure. We also have a lab around the Cisco Nexus Dashboard Fabric Controller.
Learning Path
•Introductory
InfiniBand for AI Fabrics
Understand InfiniBand AI fabric through its lossless architecture, SHARP in-network computing, and real-world economics. Then experience a full operational lifecycle from day-zero design through UFM deployment and predictive maintenance, reinforced with hands-on lab practice. Learn how self-driving operations and InfiniBand technologies are shaping the next generation of AI factories.
Learning Path
•Fundamentals
NVIDIA DGX BasePOD
In this learning path, we cover NVIDIA's DGX systems and BasePOD infrastructure, detailing the setup, licensing, and management of Base Command Manager and DGX OS for high-performance AI workloads. They explain hardware requirements, network configurations, and system provisioning, emphasizing efficient resource management, scalability, and optimized AI model training across NVIDIA's cutting-edge computing platforms.
Learning Path
•Fundamentals
NVIDIA GPU Operator for Kubernetes
This is a learning path for the introduction of the deployment and lifecycle of AI infrastructure. Explore the NVIDIA GPU Operator, a Kubernetes-native tool that automates driver installation, container runtimes, and device plugins. Perfect for DevOps engineers aiming to streamline high-performance, GPU-accelerated clusters at scale.
Learning Path
•Introductory
ATC+
Building Cisco RoCE fabric for AI/ML using NEXUS Dashboard
The user of this learning path will learn the components of RoCE and why it is essential for clean, fast, and reliable AI/ML compute communication.
Learning Path
•Fundamentals
ATC+
NVIDIA DGX SuperPOD and DGX BasePOD Day 2 Operations
This Learning Series was created for NVIDIA DGX admins and operators to explore things you would use on Day 2 when administering your NVIDIA DGX SuperPOD and BasePOD environments with BCM (Base Command Manager). It will detail how to update firmware, patch systems, run jobs against the infrastructure, and integrate other parts into BCM (Switches, AD, Cloud, etc.).
Learning Path
•Intermediate
ATC+
NVIDIA DGX SuperPOD and DGX BasePOD Day 3 Operations
This Learning Series was created for NVIDIA DGX admins and operators to explore things you would use on Day 3 when administering your NVIDIA DGX SuperPOD and BasePOD environments with BCM (Base Command Manager). It will go into advanced topics of cmshell, cloud bursting from BCM, HA for headnodes, IB setup and testing of worker nodes, active directory integrations, as well as advanced workload topics of deploying Kubernetes from Base Command Manager.
Learning Path
•Advanced
ATC+
NVIDIA AI Enterprise
NVIDIA AI Enterprise (NVAIE) offers a robust suite of AI tools for various applications, including reasoning, speech & translation, biomedical, content generation, and route planning. It features community, NVIDIA, and custom models. NVAIE provides essential microservices such as NIM and CUDA-X used for security advisory, enterprise support, cluster management, and infrastructure optimization. Designed for cloud, data centers, workstations, and edge environments, NVAIE ensures scalable, secure, and efficient AI deployment.
Learning Path
•Fundamentals
ATC+
Vector Stores
This learning path covers vector search from concept to practice. Articles explain vectors, embeddings, similarity metrics, and vector store software — including how to choose the right database and index type. The hands-on lab then stress-tests embedding models, compares distance metrics, evaluates models of different sizes, and builds a framework for tuning chunking and measuring retrieval quality.
Learning Path
•Intermediate
NVIDIA Run:ai for Platform Engineers
Welcome to the NVIDIA Run:ai for Platform Engineers Learning Path! This learning path is designed to build both foundational knowledge and practical skills for platform engineers and administrators responsible for managing GPU resources at scale. It begins by introducing learners to the key components of the NVIDIA Run:ai platform, including its Control Plane and Cluster, and explains how NVIDIA Run:ai extends Kubernetes to orchestrate AI workloads efficiently. The learning path then covers essential topics such as authentication and role-based access, organizational management through projects and departments, and workload operations using assets, templates, and policies. Learners will also explore GPU fractioning to understand how NVIDIA Run:ai maximizes GPU utilization and ensures fair resource allocation across teams. All this builds toward a hands-on lab experience designed to reinforce your learning and give you practical experience working directly with NVIDIA Run:ai.
Learning Path
•Fundamentals
Introduction to NVIDIA NIM for LLM
This learning path introduces NVIDIA NIM for LLM microservices, covering its purpose, formats, and benefits. You'll explore deployment options via API Catalog, Docker, and Kubernetes, and complete hands-on labs for Docker and Kubernetes-based inference workflows—building skills to deploy, scale, and integrate GPU-optimized LLMs into enterprise applications.
Learning Path
•Fundamentals