Domain 1 - System and Server Bring-up
Exam Weight: 31%Deployment sequence, topology awareness, firmware/hardware validation, and initial storage integration.
Domain Overview
Build operational readiness from first power-on through validated hardware state for AI infrastructure nodes.
Objectives
- Describe sequence of events for deployment and validation.
- Describe network topologies for AI factories.
- Perform initial configuration of BMC, OOB, and TPM.
- Perform firmware upgrades (including on HGX) and fault detection.
- Validate power and cooling parameters.
- Install GPU-based servers (SMI).
- Validate installed hardware.
- Describe and validate cable types and transceivers.
- Install physical GPUs.
- Validate hardware operation for workloads.
- Configure initial parameters for third-party storage.