Chassis and FEX Alarms

Chassis and FEX Components Alarms

Following table shows the description of the supported alarms for chassis and FEX components.

Name MO Severity Explanation Recommended Action
IoCardTemperatureCritical equipment.IoCard Critical The I/O Card has a critical temperature threshold condition.
  1. View the acceptable temperature and voltage parameters and determine how much of the outlet or inlet temperature has reached or exceeded over the major or minor threshold value.

  2. Monitor other environmental events and ensure the temperature ranges are within recommended ranges.

  3. If the above actions do not resolve the issue, create a show tech-support file and contact Cisco TAC.

IoCardTemperatureWarning equipment.IoCard Warning The I/O Card has a warning temperature threshold condition.
  1. View the acceptable temperature and voltage parameters and determine how much of the outlet or inlet temperature has reached or exceeded over the major or minor threshold value.

  2. Monitor other environmental events and ensure the temperature ranges are within recommended ranges.

  3. If the above actions do not resolve the issue, create a show tech-support file and contact Cisco TAC.

ChassisInputPowerCritical equipment.Chassis Critical The chassis input power has crossed the threshold condition.
  1. Monitor the PSU status.

  2. Verify that the input power cord is appropriate as per the spec sheet.

  3. If possible, remove and reset the PSU.

  4. If the above actions do not resolve the issue, create a show tech-support file and contact Cisco TAC.

ChassisInputPowerWarning equipment.Chassis Warning The chassis input power has reached the threshold condition.
  1. Monitor the PSU status.

  2. Verify that the input power cord is appropriate as per the spec sheet.

  3. If possible, remove and reset the PSU.

  4. If the above actions do not resolve the issue, create a show tech-support file and contact Cisco TAC.

ChassisOutputPowerCritical equipment.Chassis Critical The chassis output power has crossed the threshold condition.
  1. Monitor the PSU status.

  2. Verify that the output power matches the maximum rated output power as per the spec sheet.

  3. If possible, remove and reset the PSU.

  4. If the above actions do not resolve the issue, create a show tech-support file and contact Cisco TAC.

ChassisOutputPowerWarning equipment.Chassis Warning The chassis output power has reached the threshold condition.
  1. Monitor the PSU status.

  2. Verify that the output power matches the maximum rated output mentioned in the spec sheet.

  3. If possible, remove and reseat the PSU.

  4. If the above actions do not resolve the issue, create a show tech-support file and contact Cisco TAC.

ChassisFansMissing equipment.Chassis Critical Multiple chassis fans are not operational or missing.
  1. Check the fans operational state on the GUI. Chassis>Inventory>Thermal>Fan Modules>Fan Module Name>Fans

  2. Check the fan-related syslog messages to see the exact reason for the failure.

  3. Create a show tech-support file and contact Cisco TAC to see if the fans need replacement.

ChassisFanMissing equipment.Chassis Warning A single chassis fan is not operational or missing.
  1. Check the fans operational state on the GUI. Chassis>Inventory>Thermal>Fan Modules>Fan Module <ID>>Fans

  2. Check the fan-related syslog messages to see the exact reason for the failure.

  3. Create a show tech-support file and contact Cisco TAC to see if the fan needs replacement.

ChassisPsuRedundancyLost equipment.Chassis Critical The chassis power supply redundancy lost.
  1. Consider adding more PSUs to the chassis.

  2. If the issue still persists, create a show tech-support file and contact Cisco TAC.

IoCardLowMemory equipment.IoCard Critical The I/O Card has a critical low memory error.

Create a show tech-support file and contact Cisco TAC.

IoCardFruState equipment.IoCard Critical The I/O Card Field Replacement Unit (FRU) is not readable.

Create a show tech-support file and contact Cisco TAC.

ChassisFruState equipment.Chassis Critical The Chassis Field Replacement Unit (FRU) is not readable.
  1. Verify that a supported adapter is installed.

  2. Create a show tech-support file and contact Cisco TAC to see if the adapter needs replacement.

IoCardPost equipment.IoCard Warning The I/O Card has a POST error. Create a show tech-support file and contact Cisco TAC.
IoCardAsicPost equipment.IoCard Warning The I/O Card ASIC has a POST error Create a show tech-support file and contact Cisco TAC.
IoCardSelectedImage equipment.IoCard Warning There is some issue with the current I/O Card firmware image.
  1. Review the fault and the error message on Chassis>Inventory>IO Modules to determine why the firmware image is unusable.

  2. If the firmware image is bad or corrupted, upgrade the server firmware/HSU bundle.

  3. If the issue still persists, create a show tech-support file and contact Cisco TAC.

IoCardAlternateImage equipment.IoCard Warning There is some issue with the alternate firmware image of the I/O Card.
  1. Review the fault and the error message on Chassis>Inventory>IO Modules to determine why the firmware image is unusable.

  2. If the firmware image is bad or corrupted, upgrade the server firmware/HSU bundle.

  3. If the image is present and the fault persists, create a show tech-support file and contact Cisco TAC.

ChassisPowerCritical equipment.Chassis Critical The chassis power supply has critical issue.
  1. Review the product specifications to determine the operating temperature range of the PSU module.

  2. Power off unused blade servers and rack servers.

  3. Check the power supply unit that has the problem, as follows:

    • On the CLI, run the following command on chassis IFM/ IOM to get the power details: pwrmgrcli -a

    • On the GUI, view the PSUs tab here: Chassis>Inventory>Power>PSUs

  4. If the above actions do not resolve the issue, create a show tech-support file and contact Cisco TAC.

ChassisPowerWarning equipment.Chassis Warning The chassis power supply has warning issue.
  1. Review the product specifications to determine the temperature operating range of the PSU module.

  2. Power off unused blade servers and rack servers.

  3. Check the power supply unit that has the problem, as follow:

    • On the CLI, run the following command on chassis IFM/ IOM to get the power details: pwrmgrcli -a

    • On the GUI, view the PSUs tab here: Chassis>Inventory>

      Power>PSUs

  4. If the above actions do not resolve the issue, create a show tech-support file and contact Cisco TAC.

ChassisPsuFruState equipment.Psu Critical The power supply Field Replacement Unit (FRU) is not readable. Create a show tech-support file and contact Cisco TAC.
ChassisPsuUnresponsive equipment.Psu Critical The power supply is unresponsive.
  1. Check the power supply unit that has the problem, as follow:

    • On the CLI, run the following command on chassis IFM/ IOM to get the power details: pwrmgrcli -a

    • On the GUI, view the PSUs tab here: Chassis>Inventory>Power>PSUs

  2. Verify that the power cord is properly connected to the power supply and to the power source.

  3. Ensure that the power supply is properly inserted and plugged in.

  4. If problem persists, remove and re-insert the power-supply unit.

  5. If the above actions do not resolve the issue, create a show tech-support file and contact Cisco TAC.

ChassisPsuInputOutOfRange equipment.Psu Warning The chassis power supply has out of range AC input.
  1. Check the power supply unit that has the problem, as follow:

    • On the CLI, run the following command on chassis IFM/ IOM to get the power details: pwrmgrcli -a

    • On the GUI, view the PSUs tab here: Chassis>Inventory>Power>PSUs

  2. Verify that the power cord is properly connected to the power supply and to the power source.

  3. Ensure that the power supply is properly inserted and plugged in.

  4. If problem persists, remove and re-insert the power-supply unit.

  5. If the above actions do not resolve the issue, create a show tech-support file and contact Cisco TAC.

ChassisPsuInputLost equipment.Psu Warning The power supply has no AC input.
  1. Monitor the PSU status.

  2. Verify that the power cord is properly connected to the power supply and to the power source.

  3. If possible, remove and reseat the PSU.

  4. If the above actions do not resolve the issue, create a show tech-support file and contact Cisco TAC.

ChassisPsuOutput equipment.Psu Critical The power supply has an error condition that prevents DC output.
  1. Monitor the PSU status.

  2. Verify that the power cord is properly connected to the power supply and to the power source.

  3. Remove and reseat the PSU.

  4. If the above actions do not resolve the issue, create a show tech-support file and contact Cisco TAC.

ChassisPsuTemperatureCritical equipment.Psu Critical The power supply has a temperature threshold condition.
  1. Monitor the PSU status.

  2. Verify that the server fans are working properly.

  3. If the above actions do not resolve the issue, create a show tech-support file and contact Cisco TAC to see if any hardware needs replacement.

ChassisPsuTemperatureWarning equipment.Psu Warning The power supply has a temperature threshold condition.
  1. Monitor the PSU status.

  2. Verify that the server fans are working properly.

  3. If the above actions do not resolve the issue, create a show tech-support file and contact Cisco TAC to see if any hardware needs replacement.

ChassisPsuInputVoltageCritical equipment.Psu Critical The power supply input voltage has crossed threshold condition.
  1. Verify that the power cord is properly connected to the PSU and the power source.

  2. Verify that the power source is within the input voltage range mentioned in the spec sheet.

  3. Verify that the PSU is properly installed in the chassis.

  4. Remove the PSU and reinstall it.

  5. If the issue persists, create a show tech-support file and contact Cisco TAC to see if the PSU needs replacement.

ChassisPsuInputVoltageWarning equipment.Psu Warning The power supply input voltage has reached threshold condition.
  1. Verify that the power cord is properly connected to the PSU and the power source.

  2. Verify that the power source is within the input voltage range mentioned in the spec sheet.

  3. Verify that the PSU is properly installed in the chassis.

  4. Remove the PSU and reinstall it.

  5. If the issue persists, create a show tech-support file and contact Cisco TAC to see if the PSU needs replacement.

ChassisPsuOutputCurrentCritical equipment.Psu Critical The power supply output current has crossed the threshold condition.
  1. Monitor the PSU status.

  2. Remove and reseat the PSU.

  3. If the issue persists, create a show tech-support file and contact Cisco TAC to see if the PSU needs replacement.

ChassisPsuOutputCurrentWarning equipment.Psu Warning The power supply output current has reached the threshold condition.
  1. Monitor the PSU status.

  2. Remove and reseat the PSU.

  3. If the issue persists, create a show tech-support file and contact Cisco TAC to see if the PSU needs replacement.

ChassisPsuOutputVoltageCritical equipment.Psu Critical The power supply output voltage has crossed the threshold condition.
  1. Verify that the power cord is properly connected to the PSU and the power source.

  2. Verify that the power source is within the output voltage range mentioned in the spec sheet.

  3. Verify that the PSU is properly installed in the chassis.

  4. Remove the PSU and reinstall it.

  5. If the issue persists, create a show tech-support file and contact Cisco TAC to see if the PSU needs replacement.

ChassisPsuOutputVoltageWarning equipment.Psu Warning The power supply output voltage has reached the threshold condition.
  1. Verify that the power cord is properly connected to the PSU and the power source.

  2. Verify that the power source is within the output voltage range mentioned in the spec sheet.

  3. Verify that the PSU is properly installed in the chassis.

  4. Remove the PSU and reinstall it.

  5. If the issue persists, create a show tech-support file and contact Cisco TAC to see if the PSU needs replacement.

ChassisPsuOutputPowerCritical equipment.Psu Critical The power supply output power has crossed the threshold condition.
  1. Verify that the power cord is properly connected to the PSU and the power source.

  2. Verify that the output power matches the maximum rated output mentioned in the spec sheet.

  3. Verify that the PSU is properly installed in the chassis.

  4. Remove the PSU and reinstall it.

  5. If the issue persists, create a show tech-support file and contact Cisco TAC to see if the PSU needs replacement.

ChassisFanFruState equipment.Fan Critical The fan Field Replacement Unit (FRU) is not readable.

If you see this fault, take the following actions:

  1. Remove fan module and re-install the fan module again. Remove only one fan module at a time.

  2. Create a show tech-support file and contact Cisco TAC to see if the fan module needs to be replaced with a different fan module.

ChassisFanUnresponsive equipment.Fan Critical The chassis fan is unresponsive.

If you see this fault, take the following actions:

  1. Check the status of the fan module here for Cisco UCS X-Series Chassis Chassis>Chassis Name>Inventory>Intelligent Fabric Modules>IFM name>Fan Modules>Fans

    or

    Check the status of the fan module here for chassis other than Cisco UCS X-Series. Chassis>Chassis Name>Inventory>Thermal>Fan Modules>Fans

  2. Check the operational state of the fan.

  3. Create a show tech-support file and contact Cisco TAC to see if any hardware needs replacement.

ChassisFanTemperatureCritical equipment.Fan Critical The chassis fan has a temperature threshold condition.
  1. Review the product specifications to determine the temperature operating range of the fan module.

  2. Power off unused blade servers and rack servers.

  3. Verify that the site cooling system is operating properly.

  4. Set the value of the Fan Control Mode for the chassis using Chassis Thermal policy.

  5. Create a show tech-support file and contact Cisco TAC to see if any hardware needs replacement.

ChassisFanTemperatureWarning equipment.Fan Warning The chassis fan has a temperature threshold condition.
  1. Review the product specifications to determine the temperature operating range of the fan module.

  2. Power off unused blade servers and rack servers.

  3. Verify that the site cooling system is operating properly.

  4. Set the value of the Fan Control Mode for the chassis using Chassis Thermal policy.

  5. Create a show tech-support file and contact Cisco TAC to see if any hardware needs replacement.

ChassisFanSpeedCritical equipment.Fan Critical The chassis fan has a speed threshold condition.

If you see this fault, take the following actions:

  1. If the fan is running below the expected speed, ensure that the fan blades are not blocked.

  2. If the fan is running above the expected speed, remove and re-insert the fan.

  3. If the issue persists, create a show tech-support file and contact Cisco TAC to see if any hardware needs replacement.

ChassisFanSpeedWarning equipment.Fan Warning The chassis fan has a speed threshold condition.

If you see this fault, take the following actions:

  1. If the fan is running below the expected speed, ensure that the fan blades are not blocked.

  2. If the fan is running above the expected speed, remove and re-insert the fan.

  3. If the issue persists, create a show tech-support file and contact Cisco TAC to see if any hardware needs replacement.

FexPsuInoperable equipment.Psu Critical This alarm occurs if a Power Supply is not operational.
  1. Check the PSU status by navigating on the GUI as follows: Chassis >Chassis Name >Inventory> Power>PSUs

  2. Create a show tech-support file and contact Cisco TAC to see if any hardware needs replacement.

FexPsuPoweredOff equipment.Psu Critical This alarm occurs if a Power Supply is powered off either due to higher than expected power or due to higher than expected temperatures or because of the failure of a fan.
  1. Check the power supply unit that has the problem, as follow:

    • On the GUI, view the PSUs tab here: on the GUI Fabric Interconnects > Fabric Interconnect Name > Connections > Fabric Extenders>Inventory>PSUs

  2. Verify that the power cord is properly connected to the power supply and to the power source.

  3. Ensure that the power supply is properly inserted and plugged in.

  4. Ensure that the PSU is operating in the permissible temperature range.

  5. Verify that the fans are working properly.

  6. If the above actions do not resolve the issue, create a show tech-support file and contact Cisco TAC to see if any hardware needs replacement.

FexFanInoperable equipment.Fan Critical This alarm occurs if a fan is not operational.
  1. Check the fan status on the GUI Fabric Interconnects > Fabric Interconnect Name > Connections > Fabric Extenders>Inventory>Fan Modules

  2. Check the fan-related syslog messages to see the exact reason for the failure.

  3. Create a show tech-support file and contact Cisco TAC to see if any hardware needs replacement.

FexFanPoweredOff equipment.Fan Critical This alarm occurs if a fan is shutdown.
  1. Check the fan status on the GUI Fabric Interconnects > Fabric Interconnect Name > Connections > Fabric Extenders>Inventory>Fan Modules

  2. Check the fan-related syslog messages to see the exact reason for the failure.

  3. If the fan is OK, Check the PSU status Fabric Interconnects > Fabric Interconnect Name > Connections > Fabric Extenders>Inventory>PSUs

  4. Verify that the power cord is properly connected to the power supply and to the power source.

  5. Ensure that the power supply is properly inserted and plugged in.

  6. If problem persists, remove and re-insert the power-supply unit.

  7. If the above actions do not resolve the issue, create a show tech-support file and contact Cisco TAC to see if any hardware needs replacement.

IoCardOffline equipment.IoCard Critical The I/O Card is offline. This fault typically occurs because an I/O module has lost its connection to the Fabric Interconnects.
  1. Wait a few minutes to see if the fault clears. This is typically a temporary issue, and can occur after a firmware upgrade.

  2. If the fault does not clear after a few minutes, remove, and reinsert the I/O card.

  3. If the above actions do not resolve the issue, create a show tech-support file and contact Cisco TAC.

IoCardMissing equipment.IoCard Critical I/O Card is missing or removed.
  1. Reinsert the I/O card and configure the Fabric Interconnect ports connected to it as server ports and wait a few minutes to see if the fault clears.

  2. If the above action does not resolve the issue, create a show tech-support file and contact Cisco TAC.

InvalidConnections

equipment.Chassis Critical

Connection topology between the Fabric Interconnects and IO Card is incorrect.

  • Verify and correct cabling between Fabric Interconnects and IO Cards.

  • Perform a rediscovery of the chassis.

  • If the above action does not resolve the issue, contact Cisco TAC.

ChassisThermalSafeMode

equipment.Chassis

Critical

Chassis cannot read temperature sensors from any components; fans forced to SAFE mode.

  • Verify all modules (Servers, IOMs, IFMs, and XFMs) are present and healthy.

  • If the above action does not resolve the issue, contact Cisco TAC.

TempSensorReadFailure (ServerSlot1–8)

equipment.Chassis

Critical

Cannot read temperature data from the indicated component.

  • Reseat the affected module

  • If the above action does not resolve the issue, contact Cisco TAC.

TempSensorReadFailure (XFM1–2)

equipment.Chassis

Critical

Cannot read temperature data from the indicated component.

  • Reseat the affected module

  • If the above action does not resolve the issue, contact Cisco TAC.

TempSensorReadFailure (IFM1–2)

equipment.Chassis

Critical

Cannot read temperature data from the indicated component.

  • Reseat the affected module

  • If the above action does not resolve the issue, contact Cisco TAC.

PcieSlotPowerFault

equipment.SharedGraphicsCard

Critical

Power fault detected on PCIe slot.

  • Check seating and power/MCI cables.

  • If the above action does not resolve the issue, contact Cisco TAC.

PcieAuxPowerCableMissing

equipment.SharedGraphicsCard

Critical

Auxiliary PCIe power cable is not detected.

  • Verify auxiliary cable connection.

  • If the above action does not resolve the issue, contact Cisco TAC.

Slot1–8PCIeDeviceMappingInvalid

equipment.Chassis

Warning

PCIe device mapping detected without a PCIe connectivity policy.

  • Perform a Reset Chassis Slot Configuration operation.

  • If the above action does not resolve the issue, contact Cisco TAC.

ChassisControllerTemperatureCritical

equipment.ChassisController

Critical

This alarm indicates that the Chassis Controller has a critical temperature threshold condition.

Contact Cisco TAC for further assistance.

ChassisControllerTemperatureWarning

equipment.ChassisController

Warning

This alarm indicates that the Chassis Controller has a warning temperature threshold condition.

Contact Cisco TAC for further assistance.

ChassisControllerLowMemory

equipment.ChassisController

Critical

This alarm indicates that the Chassis Controller has a critical low memory error.

Contact Cisco TAC for further assistance.

ChassisControllerFruState

equipment.ChassisController

Critical

The Chassis Controller Field Replacement Unit (FRU) is not readable.

Contact Cisco TAC for further assistance.

ChassisControllerPost

equipment.ChassisController

Warning

The Chassis Controller has a POST error.

Contact Cisco TAC for further assistance.

ChassisControllerAsicPost

equipment.ChassisController

Warning

The Chassis Controller ASIC has a POST error

Contact Cisco TAC for further assistance.

ChassisControllerTPM

equipment.ChassisController

Critical

The Chassis Controller has a TPM error

Contact Cisco TAC for further assistance.

ChassisControllerSelectedImage

equipment.ChassisController

Warning

There is some issue with the current Chassis Controller firmware image.

Contact Cisco TAC for further assistance.

ChassisControllerAlternateImage

equipment.ChassisController

Warning

There is some issue with the alternate firmware image of the Chassis Controller.

Contact Cisco TAC for further assistance.

ChassisControllerOffline

equipment.ChassisController

Critical

The Chassis Controller is offline.

Contact Cisco TAC for further assistance.

ChassisControllerMissing

equipment.ChassisController

Critical

The Chassis Controller is missing or removed

Contact Cisco TAC for further assistance.