Hardware Status

Viewing Inventory and LEDs

Procedure


Step 1

From the Navigation Pane, select Hardware Status > Inventory and LEDs.

Step 2

You can view the following properties:

Table 1. LED Light Control

Name

Description

Power status field

Displays the current power status of the system.

System identity LED toggle button

Toggles the system identity LED on or off to help locate the system.

Table 2. System

Name

Description

ID column

Displays the unique identifier for each system.

Hardware type column

Indicates the type of hardware for each system.

Health column

Shows the current health status of each system.

Identify LED column

Indicates whether the identify LED is on or off for each system.

Serial number field

Displays the serial number of the system.

Model field

Displays the model of the system.

Asset tag field

Displays the asset tag of the system.

Status (State) field

Indicates the current state of the system.

Power field

Displays the current power status of the system.

Health rollup field

Shows the overall health status of the system.

Manufacturer field

Displays the manufacturer of the system.

Description field

Provides a brief description of the system.

Sub model field

Displays the sub model of the system.

System type field

Indicates the type of system.

Memory summary

Provides a summary of the system memory.

Status (State) field

Indicates the current state of the memory.

Health field

Shows the current health status of the memory.

Health rollup field

Shows the overall health status of the memory.

Total system memory field

Displays the total memory available in the system.

Processor summary

Provides a summary of the system processors.

Status (State) field

Indicates the current state of the processor.

Health field

Shows the current health status of the processor.

Health rollup field

Shows the overall health status of the processor.

Count field

Displays the number of processors in the system.

Core count field

Displays the number of cores per processor.

Table 3. BMC Manager

Name

Description

ID column

Displays the unique identifier for each BMC manager entry.

Health column

Shows the current health status of the BMC manager.

Name field

Cisco Integrated Management Controller

Model field

Displays the model of the BMC manager.

UUID field

Displays the UUID of the BMC manager.

Service entry point UUID field

Displays the service entry point UUID of the BMC manager.

Status (State) field

Indicates the current state of the BMC manager.

Power field

Displays the current power status of the BMC manager.

Health rollup field

Shows the overall health status of the BMC manager.

BMC date and time field

Displays the current date and time of the BMC.

Last reset time field

Displays the last reset time of the BMC.

Description field

Provides a brief description of the BMC manager.

Manager type field

Indicates the type of the BMC manager.

Firmware version field

Displays the firmware version of the BMC manager.

OEM firmware version

BIOS field

Displays the BIOS version.

SCM FPGA field

Displays the SCM FPGA version.

MB FPGA field

Displays the MB FPGA version.

HIB FPGA field

Displays the HIB FPGA version.

RoT field

Displays the RoT version.

Graphical console

Connect types supported field

Displays the supported connection types for the graphical console.

Max concurrent sessions field

Displays the maximum number of concurrent sessions for the graphical console.

Service enabled field

Indicates whether the service for the graphical console is enabled.

Serial console

Connect types supported field

Displays the supported connection types for the serial console.

Max concurrent sessions field

Displays the maximum number of concurrent sessions for the serial console.

Service enabled field

Indicates whether the service for the serial console is enabled.

Table 4. Chassis

Name

Description

ID column

Displays the unique identifier for each chassis entry.

Health column

Shows the current health status of the chassis.

Following properties are displayed for FRU_CHASSIS, FRU_CPUSLED, FRU_SCM, and FRU_SYS:

Board build date field

Displays the build date of the board.

Board manufacturer field

Displays the manufacturer of the board.

Board product field

Displays the product name of the board.

Board part number field

Displays the part number of the board.

Board serial number field

Displays the serial number of the board.

Board extra field

Displays any additional information about the board.

Product manufacturer field

Displays the manufacturer of the product.

Product name field

Displays the product name.

Product part number field

Displays the part number of the product.

Product serial number field

Displays the serial number of the product.

Product version field

Displays the version of the product.

Product extra field

Displays any additional information about the product.

Product asset tag field

Displays the asset tag of the product.

Chassis type field

Indicates the type of the chassis.

Chassis part number field

Displays the part number of the chassis.

Chassis serial number field

Displays the serial number of the chassis.

Chassis extra field

Displays any additional information about the chassis.

Health rollup field

Shows the overall health status of the chassis.

Table 5. DIMM Slot

Name

Description

ID column

Displays the unique identifier for each DIMM slot entry.

Health column

Shows the current health status of the DIMM slot.

Location number column

Indicates the location number of the DIMM slot.

Part number field

Displays the part number of the DIMM.

Serial number field

Displays the serial number of the DIMM.

Capacity MiB field

Displays the capacity of the DIMM in MiB.

Status (State) field

Indicates the current state of the DIMM slot.

Enabled field

Indicates whether the DIMM slot is enabled.

Description field

Provides a brief description of the DIMM.

Memory type field

Displays the type of memory of the DIMM.

Base module type field

Indicates the base module type of the DIMM.

Bus width bits field

Displays the bus width of the DIMM in bits.

Data width bits field

Displays the data width of the DIMM in bits.

Operating speed Mhz field

Displays the operating speed of the DIMM in MHz.

Table 6. Storage

Name

Description

ID column

Displays the unique identifier for each storage entry.

Health column

Shows the current health status of the storage.

StorageControllers (Name) field

Displays the name of the storage controller.

StorageControllers (FirmwareVersion) field

Displays the firmware version of the storage controller.

Description field

Provides a brief description of the storage.

SpeedGbps field

Displays the speed of the storage in Gbps.

Model field

Displays the model of the storage.

Status (State) field

Indicates the current state of the storage.

SerialNumber field

Displays the serial number of the storage.

Table 7. Fans

Name

Description

ID column

Displays the unique identifier for each fan entry.

Health column

Shows the current health status of the fan.

Name field

Displays the name of the fan.

Part number field

Displays the part number of the fan.

Fan speed field

Displays the speed of the fan in RPM.

Status (State) field

Indicates the current state of the fan.

Status (Health rollup) field

Shows the overall health status of the fan.

Table 8. Power Supplies

Name

Description

ID column

Displays the unique identifier for each power supply entry.

Health column

Shows the current health status of the power supply.

Name field

Displays the name of the power supply.

Part number field

Displays the part number of the power supply.

Serial number field

Displays the serial number of the power supply.

Spare part number field

Displays the spare part number of the power supply.

Model field

Displays the model of the power supply.

Status (State) field

Indicates the current state of the power supply.

Manufacturer field

Displays the manufacturer of the power supply.

Table 9. Processors

Name

Description

ID column

Displays the unique identifier for each processor entry.

Health column

Shows the current health status of the processor.

Name field

Displays the name of the processor.

Part number field

Displays the part number of the processor.

Serial number field

Displays the serial number of the processor.

Model field

Displays the model of the processor.

Asset tag field

Displays the asset tag of the processor.

Status (State) field

Indicates the current state of the processor.

Manufacturer field

Displays the manufacturer of the processor.

Processor type field

Indicates the type of the processor.

Processor architecture field

Displays the architecture of the processor.

Instruction set field

Displays the instruction set supported by the processor.

Max speed MHz field

Displays the maximum speed of the processor in MHz.

Total cores field

Displays the total number of cores in the processor.

Total threads field

Displays the total number of threads in the processor.

Table 10. Network Adapters

Name

Description

ID column

Displays the unique identifier for each network adapter entry.

Health column

Shows the current health status of the network adapter.

Name field

Displays the name of the network adapter.

Vendor field

Displays the vendor of the network adapter.

Serial number field

Displays the serial number of the network adapter.

Part number field

Displays the part number of the network adapter.

Manufacturer field

Displays the manufacturer of the network adapter.

Firmware version field

Displays the firmware version of the network adapter.

Status (State) field

Indicates the current state of the network adapter.


CPU Monitoring and Management

Overview

The CPU is a central component of the system responsible for executing all computational tasks. The BMC monitors aspects of the CPU such as temperature and power consumption to ensure operation within normal ranges. This helps prevent overheating and hardware failures, ensuring system stability and reliability. Additionally, it allows administrators to understand the system's workload and adjust resource allocation as needed, optimizing performance and response times.

Anomaly Detection and Response

If the CPU encounters anomalies, such as excessively high temperatures, the BMC can monitor and provide warnings, enabling administrators to promptly respond and troubleshoot issues.

Monitored and Controlled Features

The BMC monitors and controls the following CPU features:

  • Get CPU temperature

  • Get CPU current power consumption

  • Get CPU maximum power capping

  • Get CPU current power capping

  • Set CPU power capping

Cooling Management

The BMC is responsible for managing the cooling system by overseeing temperature sensors and regulating fan speeds based on a Fan Algorithm crafted by the thermal engineering team.

BMC Boot Process and Default Fan Control

During the BMC boot process, if temperature readings from components are not successfully acquired, the BMC implements default fan control. In this scenario, all fans operate at a duty cycle of 80% until temperature data from all system sensors is accessible.

Fan Algorithm Activation

When the fan algorithm is active, the BMC manages fan speeds or initiates a system shutdown under specific conditions:

  • Condition for full speed fan operation:

    • A temperature of the component exceeds a specified threshold.

    • Temperature reading fails for more than 60 seconds.

    • A firmware update is initiated.

    • GPU does not align with system specifications.

  • Conditions for shut down the system:

    • Temperature exceeds a critical threshold for more than 60 seconds.

    • Temperature exceeds the specified threshold (UNR).

Managing Fan Failure Conditions

The system fans are divided into three fan zones, each serving a specific cooling function:

  • Fan Zone #1: Used for GPU sled cooling.

  • Fan Zone #2: Used for CPU sled cooling.

  • Fan Zone #3: Used for SSD cooling.

Fan Failure Response

When a fan failure condition occurs, the BMC sets all remaining fans to run at full speed or shuts down the system. Once the fan failure condition is cleared, the BMC restores fan speed according to the fan control algorithm. Fan failures are classified into the following scenarios:

  • Fan Zone #1 Failure Conditions:

    • If either one or both fan rotors in the same fan are below the specified threshold (LC), all remaining fans in Fan Zone #1 run at full speed.

    • If two fan rotors are below the threshold (LC) in different fans, or if three or more fan rotors are below the threshold (LC), the system shuts down.

  • Fan Zone #2 Failure Conditions:

    • If one fan rotor is below the specified threshold (LC), all remaining fans in Fan Zone #2 run at full speed.

    • If two fan rotors are below the threshold (LC), the system shuts down.

  • Fan Zone #3 Failure Conditions:

    • If one fan rotor is below the specified threshold (LC), all remaining fans in Fan Zone #3 run at full speed.

    • If two fan rotors are below the threshold (LC), the system shuts down.

Viewing Sensor Status

The BMC monitors key system sensors, including temperature, power, fan speeds, and logical sensors. These sensors provide real-time values and statuses, accessible through the GUI .

Procedure


Step 1

From the Navigation Pane, select Hardware Status > Sensors.

Step 2

Select of the following tabs to view the properties:

  • POWER SUPPLY

  • Fan

  • Temperature

  • CPU

  • GPU

  • Event

You can view the following sensor properties:

Table 11. Threshold Sensors/Discrete Sensors

Name

Description

Name column

Displays the name of the sensor.

Status column

Shows the current status of the sensor.

Lower critical field

Displays the lower critical threshold value for the sensor.

Lower warning field

Displays the lower warning threshold value for the sensor.

Current value field

Displays the current value measured by the sensor.

Upper warning field

Displays the upper warning threshold value for the sensor.

Upper critical field

Displays the upper critical threshold value for the sensor.


Turning On/Off System Identify LED

Procedure


Step 1

From the Navigation Pane, select Hardware Status > Inventory and LEDs.

Step 2

Under LED light control, toggle the System identity LED button on or off to help locate the system.