Introduction
This document describes how to clear transient Uncorrectable Error Correction Code (UECC) memory errors on Cisco Unified Computing Systems (UCS).
Prerequisites
Requirements
There are no specific requirements for this document.
Components Used
This document is valid for all M5, M6 and M7 generation servers.
The information in this document was created from the devices in a specific lab environment. All of the devices used in this document started with a cleared (default) configuration. If your network is live, ensure that you understand the potential impact of any command.
Background Information
In the case of servers that do not display prior instances of Error Correction Code(ECC) or Uncorrectable Error Correction Code (UECC) errors, certain memory modules can exhibit UECC memory errors upon BIOS POST during a reboot under normal operations. The affected Dual In-Line Memory Modules (DIMM) are shown as block listed, but no new errors are reported upon subsequently clearing the Block Listing data or, during a Failure Analysis after a Return Materials Authorization (RMA). This indicates the potential transient nature of UCS Memory Errors. DIMMs can continue to be used after clearing the Block Listing Error data.
Transient or non-persistent errors can occur without warning. Correctable Errors do not always necessarily occur withinin a DIMM prior to the occurance of a DIMM Training Error or a Multibit Error. The scope of what occurs during memory training greatly differs from what occurs when memory is used at runtime. Therefore, the stability of the memory modules does not necessarily correlate between server POST and Runtime.
Known Defects
The behavior previously described is outlined by the listed Bugs:
1. Cisco bug ID CSCwa75339
2. Cisco bug ID CSCwk28210
3. Cisco bug ID CSCwo62396
Steps to Clear POST UECC Errors
Blade Server
Note: Please select the correct Chassis ID and Server ID required for your specific environment. The image displayed is for demonstration purposes.
Power off the server.
The server requires to be physically powered off otherwise the UECC errors can persist upon reboot.
Note: DIMM SPD data is not updated as CIMC polls the DIMM SPD data only upon server power on. If there is a truly problematic DIMM, the error returns upon reboot and Cisco Support can provide an RMA for the DIMM.
GUI Method for UCS Manager
Navigate to Equipment > Chassis > Chassis ID > Servers > Server ID > Actions > System > Reset All Memory Errors.

GUI Method for Intersight Managed Mode (IMM)

Note: These steps are also applicable to Integrated C-Series Servers.
Command Line Interface (CLI) Method
Launch an SSH Session to the UCS Manager Virtual IP Address. Ensure that the server in question is powered off prior to proceeding with the steps as outlined.

Once complete, verify that no new errors increment within the system.
Rack Server
Power off the server.
The server requires to be physically powered off otherwise the UECC errors can persist upon reboot.
Note: DIMM SPD data is not updated as CIMC polls the DIMM SPD data only upon server power on. If there is a truly problematic DIMM, the error returns upon reboot and Cisco Support can provide an RMA for the DIMM.
GUI Method
If the Server is not managed with Intersight Managed Mode (IMM), then the Command Line Interface (CLI) method is required. Otherwise, the previous steps are applicable.
CLI Method
Note: The image displayed is for demonstration purposes.
Launch an SSH Session to the Server CIMC IP Address. Ensure that the server in question is powered off prior to proceeding with the steps as outlined.

Once complete, verify that no new errors increment within the system.