Domain 1 - Installation and Deployment
Exam Weight: 31%Architecture sequencing, stack installation, and initial validation for NVIDIA AI operations platforms.
Domain Overview
Deploy the platform in the correct order and verify each layer before onboarding users, projects, and workloads.
Objectives
- Describe deployment architecture and sequence.
- Verify hardware and software requirements.
- Configure and validate hardware firmware and software.
- Implement NVIDIA Management and Monitoring stack (NVIDIA BCM, NVIDIA Mission Control and NVIDIA UFM).
- Deploy NVIDIA BCM toolkit by using package management, virtual machines and/or docker container.
- Install and configure NVIDIA Run:ai, Slurm and Kubernetes scheduler.
- Provision users, projects and quotas.
- Configure and validate NVIDIA NGC private registry and NGC API key.
- Install, configure and validate NVIDIA NIM and TensorRT-LLM.
- Configure and validate inference backend and endpoint.
- Install and configure NVIDIA Magnum IO and workload.
- Install and configure NVIDIA container toolkit on worker nodes.
- Install and configure DOCA services on DPU Arm by using package manager and/or containers.