Hardware Status

Viewing Inventory and LEDs

Procedure


Step 1

From the Navigation Pane, select Hardware Status > Inventory and LEDs.

Step 2

You can view the following properties:

Table 1. LED Light Control

Name

Description

Power status field

Displays the current power status of the system.

System identity LED toggle button

Toggles the system identity LED on or off to help locate the system.

Table 2. System

Name

Description

ID column

Displays the unique identifier for each system.

Hardware type column

Indicates the type of hardware for each system.

Health column

Shows the current health status of each system.

Identify LED column

Indicates whether the identify LED is on or off for each system.

Serial number field

Displays the serial number of the system.

Model field

Displays the model of the system.

Asset tag field

Displays the asset tag of the system.

Status (State) field

Indicates the current state of the system.

Power field

Displays the current power status of the system.

Health rollup field

Shows the overall health status of the system.

Manufacturer field

Displays the manufacturer of the system.

Description field

Provides a brief description of the system.

Sub model field

Displays the sub model of the system.

System type field

Indicates the type of system.

Memory summary

Status (State) field

Indicates the current state of the memory.

Health field

Shows the current health status of the memory.

Health rollup field

Shows the overall health status of the memory.

Total system memory field

Displays the total memory available in the system.

Processor summary

Status (State) field

Indicates the current state of the processor.

Health field

Shows the current health status of the processor.

Health rollup field

Shows the overall health status of the processor.

Count field

Displays the number of processors in the system.

Core count field

Displays the number of cores per processor.

Table 3. BMC Manager

Name

Description

ID column

Displays the unique identifier for each BMC manager entry.

Health column

Shows the current health status of the BMC manager.

Name field

Cisco Integrated Management Controller

Model field

Displays the model of the BMC manager.

UUID field

Displays the UUID of the BMC manager.

Service entry point UUID field

Displays the service entry point UUID of the BMC manager.

Status (State) field

Indicates the current state of the BMC manager.

Power field

Displays the current power status of the BMC manager.

Health rollup field

Shows the overall health status of the BMC manager.

BMC date and time field

Displays the current date and time of the BMC.

Last reset time field

Displays the last reset time of the BMC.

Description field

Provides a brief description of the BMC manager.

Manager type field

Indicates the type of the BMC manager.

Firmware version field

Displays the firmware version of the BMC manager.

OEM firmware version

BIOS field

Displays the BIOS version.

SCM FPGA field

Displays the SCM FPGA version.

MB FPGA field

Displays the MB FPGA version.

HIB FPGA field

Displays the HIB FPGA version.

RoT field

Displays the RoT version.

Graphical console

Connect types supported field

Displays the supported connection types for the graphical console.

Max concurrent sessions field

Displays the maximum number of concurrent sessions for the graphical console.

Service enabled field

Indicates whether the service for the graphical console is enabled.

Serial console

Connect types supported field

Displays the supported connection types for the serial console.

Max concurrent sessions field

Displays the maximum number of concurrent sessions for the serial console.

IPMI Service enable field

Indicates whether the IPMI service for the serial console is enabled.

SSH Service enable field

Indicates whether the SSH service for the serial console is enabled.

Table 4. Chassis

Name

Description

ID column

Displays the unique identifier for each chassis entry.

Health column

Shows the current health status of the chassis.

Following properties are displayed for FRU_CHASSIS, FRU_CPUSLED, FRU_SCM, and FU_SYS:

Product manufacturer field

Displays the manufacturer of the product.

Product part number field

Displays the part number of the product.

Product version field

Displays the version of the product.

Product asset tag field

Displays the asset tag of the product.

Product name field

Displays the product name.

Product serial number field

Displays the serial number of the product.

Product extra field

Displays any additional information about the product.

Chassis type field

Displays the type of chassis, rack or blade.

Note

 

This field is available only for FRU_CHASSIS and FU_SYS.

Health rollup field

Shows the overall health status of the chassis.

Table 5. DIMM Slot

Name

Description

Location number column

Indicates the location number of the DIMM slot.

Health column

Shows the current health status of the DIMM slot.

Capacity MiB field

Displays the capacity of the DIMM in MiB.

Serial number field

Displays the serial number of the DIMM.

Operating speed Mhz field

Displays the operating speed of the DIMM in MHz.

Table 6. Storage

Name

Description

ID column

Displays the unique identifier for each storage entry.

Health column

Shows the current health status of the storage.

StorageControllers (Name) field

Displays the name of the storage controller.

StorageControllers (FirmwareVersion) field

Displays the firmware version of the storage controller.

Description field

Provides a brief description of the storage.

SpeedGbps field

Displays the speed of the storage in Gbps.

Model field

Displays the model of the storage.

Status (State) field

Indicates the current state of the storage.

SerialNumber field

Displays the serial number of the storage.

Manufacturer field

Name of the manufacturer.

Following properties are displayed for all RAID drives for any controller.

Name field

Displays the name of the RAID or the drive.

Firmware Version field

Displays the firmware version of the RAID or the drive.

Serial number field

Displays the serial number of the RAID or the drive.

Capacity GiB field

Displays the capacity of the drive in GB.

Model field

Displays the name and model of the manufacturer.

Table 7. Fans

Name

Description

Name field

Displays the name of the fan.

Health column

Shows the current health status of the fan.

Part number field

Displays the part number of the fan.

Table 8. Power Supplies

Name

Description

ID column

Displays the unique identifier for each power supply entry.

Health column

Shows the current health status of the power supply.

Name field

Displays the name of the power supply.

Part number field

Displays the part number of the power supply.

Serial number field

Displays the serial number of the power supply.

Spare part number field

Displays the spare part number of the power supply.

Model field

Displays the model of the power supply.

Status (State) field

Indicates the current state of the power supply.

Manufacturer field

Displays the manufacturer of the power supply.

Table 9. Processors

Name

Description

ID column

Displays the unique identifier for each processor entry.

Health column

Shows the current health status of the processor.

Name field

Displays the name of the processor.

Part number field

Displays the part number of the processor.

Serial number field

Displays the serial number of the processor.

Model field

Displays the model of the processor.

Asset tag field

Displays the asset tag of the processor.

Status (State) field

Indicates the current state of the processor.

Manufacturer field

Displays the manufacturer of the processor.

Processor type field

Indicates the type of the processor.

Processor architecture field

Displays the architecture of the processor.

Instruction set field

Displays the instruction set supported by the processor.

Max speed MHz field

Displays the maximum speed of the processor in MHz.

Total cores field

Displays the total number of cores in the processor.

Total threads field

Displays the total number of threads in the processor.

Step field

Displays the revision or version of a processor micro architecture, reflecting design improvements or updates.

Table 10. Network Adapters

Name

Description

ID column

Displays the unique identifier for each network adapter entry.

Health column

Shows the current health status of the network adapter.

Name field

Displays the name of the network adapter.

Vendor field

Displays the vendor of the network adapter.

Serial number field

Displays the serial number of the network adapter.

Part number field

Displays the part number of the network adapter.

Manufacturer field

Displays the manufacturer of the network adapter.

Model field

Displays the model of the network adapter.

Firmware version field

Displays the firmware version of the network adapter.

Status (State) field

Indicates the current state of the network adapter.

Port Information

Port

Specifies the name or identifier of the network port.

Port protocol

Indicates the communication protocol used by the port, such as Ethernet.

Link status

Displays whether the network link is active or inactive.

Link speed Mbps

Shows the current speed of the network link in megabits per second (Mbps).

MAC address

Provides the unique hardware identifier for the network interface.

Table 11. GPU

Name

Description

FRU Assembly field

Model field

Displays the model of the GPU.

Name field

Indicates the name assigned to the GPU assembly.

PartNumber field

Shows the part number for the GPU.

PhysicalContext

Displays the specific hardware component or subsystem where the item is located.

SerialNumber field

Provides the unique serial number of the GPU.

Vendor field

Identifies the vendor or manufacturer of the GPU.

Versions

Search field

Allows you to search for the component version.

Name field

Name of the sensor.

Version field

Firmware version.


CPU Monitoring and Management

Overview

The CPU is a central component of the system responsible for executing all computational tasks. The BMC monitors aspects of the CPU such as temperature and power consumption to ensure operation within normal ranges. This helps prevent overheating and hardware failures, ensuring system stability and reliability. Additionally, it allows administrators to understand the system's workload and adjust resource allocation as needed, optimizing performance and response times.

Anomaly Detection and Response

If the CPU encounters anomalies, such as excessively high temperatures, the BMC can monitor and provide warnings, enabling administrators to promptly respond and troubleshoot issues.

Monitored and Controlled Features

The BMC monitors and controls the following CPU features:

  • Get CPU temperature

  • Get CPU current power consumption

  • Get CPU maximum power capping

  • Get CPU current power capping

  • Set CPU power capping

Cooling Management

The BMC is responsible for managing the cooling system by overseeing temperature sensors and regulating fan speeds based on a Fan Algorithm crafted by the thermal engineering team.

BMC Boot Process and Default Fan Control

During the BMC boot process, if temperature readings from components are not successfully acquired, the BMC implements default fan control. In this scenario, all fans operate at a duty cycle of 80% until temperature data from all system sensors is accessible.

Fan Algorithm Activation

When the fan algorithm is active, the BMC manages fan speeds or initiates a system shutdown under specific conditions:

  • Condition for full speed fan operation:

    • A temperature of the component exceeds a specified threshold.

    • Temperature reading fails for more than 60 seconds.

    • A firmware update is initiated.

    • GPU does not align with system specifications.

  • Conditions for shut down the system:

    • Temperature exceeds a critical threshold for more than 60 seconds.

    • Temperature exceeds the specified threshold (UNR).

Managing Fan Failure Conditions

The system fans are divided into three fan zones, each serving a specific cooling function:

  • Fan Zone #1: Used for GPU sled cooling.

  • Fan Zone #2: Used for CPU sled cooling.

  • Fan Zone #3: Used for SSD cooling.

Fan Failure Response

When a fan failure condition occurs, the BMC sets all remaining fans to run at full speed or shuts down the system. Once the fan failure condition is cleared, the BMC restores fan speed according to the fan control algorithm. Fan failures are classified into the following scenarios:

  • Fan Zone #1 Failure Conditions:

    • If either one or both fan rotors in the same fan are below the specified threshold (LC), all remaining fans in Fan Zone #1 run at full speed.

    • If two fan rotors are below the threshold (LC) in different fans, or if three or more fan rotors are below the threshold (LC), the system shuts down.

  • Fan Zone #2 Failure Conditions:

    • If one fan rotor is below the specified threshold (LC), all remaining fans in Fan Zone #2 run at full speed.

    • If two fan rotors are below the threshold (LC), the system shuts down.

  • Fan Zone #3 Failure Conditions:

    • If one fan rotor is below the specified threshold (LC), all remaining fans in Fan Zone #3 run at full speed.

    • If two fan rotors are below the threshold (LC), the system shuts down.

Viewing Sensor Status

The BMC monitors key system sensors, including temperature, power, fan speeds, and logical sensors. These sensors provide real-time values and statuses, accessible through the GUI .

Procedure


Step 1

From the Navigation Pane, select Hardware Status > Sensors.

Step 2

Select of the following tabs to view the properties:

  • POWER SUPPLY

  • Fan

  • Temperature

  • CPU

  • GPU

  • Event

You can view the following sensor properties:

Table 12. Threshold Sensors/Discrete Sensors

Name

Description

Name column

Displays the name of the sensor.

Status column

Shows the current status of the sensor.

Lower critical field

Displays the lower critical threshold value for the sensor.

Lower warning field

Displays the lower warning threshold value for the sensor.

Current value field

Displays the current value measured by the sensor.

Upper warning field

Displays the upper warning threshold value for the sensor.

Upper critical field

Displays the upper critical threshold value for the sensor.


Turning On/Off System Identify LED

Procedure


Step 1

From the Navigation Pane, select Hardware Status > Inventory and LEDs.

Step 2

Under LED light control, toggle the System identity LED button on or off to help locate the system.