This chapter contains the following sections:
F0190
You see one of the following messages when this fault is raised:
This fault occurs when the memory array voltage exceeds the specified hardware voltage rating.
If you see this fault, take the following actions:
Review the SEL statistics on the DIMM to determine which threshold was crossed.
Monitor the memory array for further degradation.
Replace the power supply. Before replacing this component, see the server-specific Installation and Service Guide for prerequisites, safety recommendations, and warnings.
If the problem still persists, create a tech-support file and contact Cisco TAC.
Severity: major
Cause: voltage-problem
mibFaultCode: 190
mibFaultName: fltMemoryArrayVoltageThresholdCritical
moClass: memory:Array
Type: environmental
F0191
You see one of the following messages when this fault is raised:
This fault occurs when the memory array voltage has exceeded the specified hardware voltage rating. The high voltage might damage the memory hardware.
If you see this fault, take the following actions:
Review the SEL statistics on the DIMM to determine which threshold was crossed.
Monitor the memory array for further degradation.
Replace the power supply.
Before replacing this component, see the server-specific Installation and Service Guide for prerequisites, safety recommendations, and warnings.
If the problem still persists, create a tech-support file and contact Cisco TAC.
Severity: critical
Cause: voltage-problem
mibFaultCode: 191
mibFaultName: fltMemoryArrayVoltageThresholdNonRecoverable
moClass: memory:Array
Type: environmental
F0184
DIMM [Id] is degraded : Check or replace DIMM.
This fault occurs when a DIMM is in a degraded operability state. This state typically occurs when an excessive number of correctable ECC errors are reported on the DIMM by the server BIOS.
If you see this fault, take the following actions:
Monitor the DIMM for further ECC errors. If the high number of errors persists, there is a possibility of the DIMM becoming inoperable.
If the DIMM becomes inoperable, replace the DIMM. You can use the CIMC WebUI to locate the faulty DIMM.
Before replacing this component, see the server-specific Installation and Service Guide for prerequisites, safety recommendations, warnings, and procedures.
If the problem still persists, create a tech-support file and contact Cisco TAC.
Severity: warning
Cause: equipment-degraded
mibFaultCode: 184
mibFaultName: fltMemoryUnitDegraded
moClass: memory:Unit
Type: equipment
F0844
MEM_RSR3_STATUS: Memory riser 3 has been disabled due to a mixed or invalid memory riser configuration: Remove the riser and make sure the host CPU type supports the Memory Riser DDR type that is installed.
This fault indicates that the corresponding memory riser has been disabled.
If you see this fault, take the following actions:
Severity: critical
Cause: equipment-disabled
mibFaultCode: 844
mibFaultName: fltMemoryUnitDisabled
moClass: memory:Array
Type: equipment
F0502
You see one of the following messages when this fault is raised:
This fault indicates that a sensor has detected an unsupported DIMM in the server. For example, the model or vendor cannot be recognized.
If you see this fault, verify whether the DIMM is supported on the server configuration. If the DIMM is not supported on the server configuration, contact Cisco TAC.
Severity: warning
Cause: identity-unestablishable
mibFaultCode: 502
mibFaultName: fltMemoryUnitIdentityUnestablishable
moClass: memory:Unit
Type: equipment
F0185
DIMM [Id] is inoperable : Check or replace DIMM.
This fault indicates that the correctable or uncorrectable errors on a DIMM has reached a threshold. The DIMM might be inoperable.
If you see this fault, take the following actions:
Review the SEL statistics on the DIMM to determine which threshold was crossed.
If necessary, replace the DIMM. You can use the CIMC Web UI to locate the faulty DIMM.
Before replacing this component, see the server-specific Installation and Service Guide for prerequisites, safety recommendations, warnings, and procedures.
If the problem still persists, create a tech-support file and contact Cisco TAC.
Severity: major
Cause: equipment-inoperable
mibFaultCode: 185
mibFaultName: fltMemoryUnitInoperable
moClass: memory:Unit
F0187
You see one of the following messages when this fault is raised:
This fault occurs when the temperature of a memory unit on a server exceeds a critical threshold value.
The possible contributing factors are as follows:
Temperature extremes can cause Cisco UCS equipment to operate at reduced efficiency and cause various problems, including early degradation, failure of chips, and failure of equipment. In addition, extreme temperature fluctuations can cause CPUs to become loose in their sockets.
Cisco UCS equipment must operate in an environment that provides an inlet air temperature not colder than 50F (10C) nor hotter than 95F (35C).
If sensors on a CPU reach 179.6F (82C), the system takes the CPU offline.
If you see this fault, take the following actions:
Review the product specifications to determine the temperature operating range of the server.
Review the Cisco UCS Site Preparation Guide to ensure that the servers have adequate airflow, including front and back clearance.
Verify that the airflow to the server is not obstructed.
Verify that the site cooling system is operating properly.
Clean the installation site at regular intervals to avoid a buildup of dust and debris, which can cause a system to overheat.
If the problem still persists, create a tech-support file and contact Cisco TAC.
Severity: warning
Cause: thermal-problem
mibFaultCode: 187
mibFaultName: fltMemoryUnitThermalThresholdCritical
moClass: memory:Unit
Type: environmental
F0186
You see one of the following messages when this fault is raised:
This fault occurs when the temperature of a memory unit on a server exceeds a non-critical threshold value, but is still below the critical threshold.
The possible contributing factors are as follows:
Temperature extremes can cause Cisco UCS equipment to operate at reduced efficiency and cause various problems, including early degradation, failure of chips, and failure of equipment. In addition, extreme temperature fluctuations can cause CPUs to become loose in their sockets.
Cisco UCS equipment must operate in an environment that provides an inlet air temperature not colder than 50F (10C) nor hotter than 95F (35C).
If sensors on a CPU reach 179.6F (82C), the system takes that CPU offline.
If you see this fault, take the following actions:
Review the product specifications to determine the temperature operating range of the server.
Review the Cisco UCS Site Preparation Guide to ensure that the servers have adequate airflow, including front and back clearance.
Verify that the airflow to the server is not obstructed.
Verify that the site cooling system is operating properly.
Clean the installation site at regular intervals to avoid a buildup of dust and debris, which can cause a system to overheat.
If the problem still persists, create a tech-support file and contact Cisco TAC.
Severity: minor
Cause: thermal-problem
mibFaultCode: 186
mibFaultName: fltMemoryUnitThermalThresholdNonCritical
moClass: memory:Unit
Type: environmental
F0188
You see one of the following messages when this fault is raised:
This fault occurs when the temperature of a memory unit on a server has been out of the operating range.
The possible contributing factors are as follows:
Temperature extremes can cause Cisco UCS equipment to operate at reduced efficiency and cause various problems, including early degradation, failure of chips, and failure of equipment. In addition, extreme temperature fluctuations can cause CPUs to become loose in their sockets.
Cisco UCS equipment must operate in an environment that provides an inlet air temperature not colder than 50F (10C) nor hotter than 95F (35C).
If sensors on a CPU reach 179.6F (82C), the system takes that CPU offline.
If you see this fault, take the following actions:
Review the product specifications to determine the temperature operating range of the server.
Review the Cisco UCS Site Preparation Guide to ensure that the servers have adequate airflow, including front and back clearance.
Verify that the airflow to the server is not obstructed.
Verify that the site cooling system is operating properly.
Clean the installation site at regular intervals to avoid a buildup of dust and debris, which can cause a system to overheat.
If the problem still persists, create a tech-support file and contact Cisco TAC.
Severity: major
Cause: thermal-problem
mibFaultCode: 188
mibFaultName: fltMemoryUnitThermalThresholdNonRecoverable
moClass: memory:Unit
Type: environmental