Guest

Cisco 12000 Series Routers

Hardware Troubleshooting for Cisco 12000 Series Internet Router Line Card Failures

Document ID: 15926

Updated: Mar 09, 2009

   Print

Introduction

Valuable time and resources are often wasted replacing hardware that actually functions properly. This document helps troubleshoot common hardware issues with the Cisco 12000 Series Internet Router, and provides pointers for identifying whether or not the fault is in the hardware.

Note: This document does not cover any software-related failures except for those that are often mistaken as hardware issues.

Prerequisites

Requirements

Readers of this document should have knowledge of these topics:

If you feel that the problem is related to a hardware fault, this document can help you identify the cause of the failure.

Components Used

The information in this document is based on these software and hardware versions:

  • All Cisco 12000 Series Internet Routers, including the 12008, 12012, 12016, 12404, 12406, 12410, and the 12416.

  • All Cisco IOS® software versions that support the Cisco 12000 Series Internet Router.

The information in this document was created from the devices in a specific lab environment. All of the devices used in this document started with a cleared (default) configuration. If your network is live, make sure that you understand the potential impact of any command.

Hardware-Software Compatibility and Memory Requirements

Whenever you install a new line card, module, or Cisco IOS software image, it is important to verify whether the router has enough memory, and that the hardware and software are compatible with the features you want to use.

Complete these recommended steps to check for hardware-software compatibility and memory requirements:

  1. Use the Software Advisor (registered customers only) tool to choose software for your network device.

    Tip:

  2. Use the Download Software Area (registered customers only) to check the minimum amount of memory (RAM and Flash) required by the Cisco IOS software, and/or download the Cisco IOS software image. To determine the amount of memory (RAM and Flash) installed on your router, see How to Choose a Cisco IOS Software Release - Memory Requirements.

    Tips:

    • If you want to keep the same features as the version that is currently running on your router, but do not know which feature set you use, enter the show version command on your Cisco device, and paste its output in the Output Interpreter Tool. You can use Output Interpreter (registered customers only) to display potential issues and fixes. To use Output Interpreter (registered customers only) , you must be a registered customer, be logged in, and have JavaScript enabled. It is important to check for feature support, especially if you plan to use recent software features.

    • If you need to upgrade the Cisco IOS software image to a new version or feature set, refer to How to Choose a Cisco IOS Software Release for more information.

  3. If you determine that a Cisco IOS software upgrade is required, follow the Software Installation and Upgrade Procedure for the Cisco 12000 Series Router.

    Tip: For information on how to recover a Cisco 12000 series router stuck in ROMmon (rommon # > prompt), see ROMmon Recovery Procedure for the Cisco 12000.

Conventions

For more information on document conventions, see the Cisco Technical Tips Conventions.

Identify the Issue

With the help of the information in this section, you will be able to determine whether the problems you face with your line card are hardware-related.

The first thing you need to do is identify the cause of the line card crash or console errors that you encounter. To see which card is possibly at fault, it is essential that you collect the output from these commands:

  • show context summary

  • show logging

  • show logging summary

  • show diag <slot>

  • show context slot <slot>

Along with these specific show commands, you must also gather this information:

  • Console logs and/or Syslog information: These can be crucial to determine the originating issue if multiple symptoms occur. If the router is set up to send logs to a syslog server, you would possibly see some information on what happened. For console logs, it is best to be directly connected to the router on the console port through System Message Logging.

  • show technical-support: The show technical-support command is a compilation of many different commands, and includes show version, show running-config, and show stacks. When a router runs into problems, the Cisco Technical Assistance Center (TAC) engineer usually asks for this information. It is important to collect the show technical-support command output before you reload or power-cycle your device, because these actions can cause all information about the problem to be lost.

Here are some examples of output that you can expect to see if your Gigabit Route Processor (GRP) or line card has crashed:

Router#show context summary 

CRASH INFO SUMMARY 

  Slot 0 : 0 crashes 
  Slot 1 : 1 crashes 

    1 - crash at 10:36:20 UTC Wed Dec 19 2001 
 
  Slot 2 : 0 crashes 

  Slot 3 : 0 crashes 

  Slot 4 : 0 crashes 

  Slot 5 : 0 crashes 

  Slot 6 : 0 crashes 

  Slot 7 : 0 crashes 

  Slot 8 : 0 crashes 

  Slot 9 : 0 crashes 

  Slot 10: 0 crashes 
 
  Slot 11: 0 crashes 
 
  Slot 12: 0 crashes 

  Slot 13: 0 crashes 

  Slot 14: 0 crashes 

  Slot 15: 0 crashes

Router#show logging 

Syslog logging: enabled (2 messages dropped, 0 messages rate-limited, 0 flushes, 

0 overruns) 

    Console logging: level debugging, 24112 messages logged 

    Monitor logging: level debugging, 0 messages logged 

    Buffer logging: level debugging, 24411 messages logged 

    Logging Exception size (4096 bytes) 

    Trap logging: level informational, 24452 message lines logged

5d16h: %LCINFO-3-CRASH: Line card in slot 1 crashed 

5d16h: %GRP-4-RSTSLOT: Resetting the card in the slot: 1,Event: 38 

5d16h: %IPCGRP-3-CMDOP: IPC command 3 

5d16h: %CLNS-5-ADJCHANGE: ISIS: Adjacency to malachim2 (GigabitEthernet1/0) Up, 

n8 (slot1/0): linecard is disabled 

-Traceback=602ABCA8 602AD8B8 602B350C 602B3998 6034312C 60342290 601A2BC4 601A2BB0 

5d16h: %LINK-5-CHANGED: Interface GigabitEthernet1/0, changed state to 

administratively down 

5d16h: %LINEPROTO-5-UPDOWN: Line protocol on Interface GigabitEthernet1/0, 

changed state to down 

5d16h: %GRP-3-CARVE_INFO: Setting mtu above 8192 may reduce available buffers 

on Slot: 1. 

SLOT 1:00:00:09: %SYS-5-RESTART: System restarted -- 

Cisco Internetwork Operating System Software 

IOS (tmew adjacency) GS Software (GLC1-LC-M), Version 12.0(17)ST3, 

EARLY DEPLOYMENT RELEASE SOFTWARE (fc1) 

TAC Support: http://www.cisco.com/tac 

Copyright (c) 1986-2001 by cisco Systems, Inc. 

Compiled Thu 08-Nov-01 20:21 by dchih 

5d16h: %GRPGE-6-AUTONEG_STATE: Interface GigabitEthernet1/0: Link OK - 

autonegotiation complete 

5d16h: %LINK-3-UPDOWN: Interface GigabitEthernet1/0, changed state to up 

5d16h: %LINEPROTO-5-UPDOWN: Line protocol on Interface GigabitEthernet1/0, 

changed state to up

Router#show diag 1 

SLOT 1  (RP/LC 1 ): 3 Port Gigabit Ethernet 

  MAIN: type 68,  800-6376-01 rev E0 dev 0 

        HW config: 0x00    SW key: 00-00-00 

  PCA:  73-4775-02 rev E0 ver 2 

        HW version 2.0  S/N CAB0450G8FX 

  MBUS: Embedded Agent 

        Test hist: 0x00    RMA#: 00-00-00    RMA hist: 0x00 

  DIAG: Test count: 0x00000001    Test results: 0x00000000 

  FRU:  Linecard/Module: 3GE-GBIC-SC= 

        Route Memory: MEM-GRP/LC-64= 

        Packet Memory: MEM-LC1-PKT-256= 

  L3 Engine: 2 - Backbone OC48 (2.5 Gbps) 

  MBUS Agent Software version 01.46 (RAM) (ROM version is 02.10) 

  Using CAN Bus A 

  ROM Monitor version 10.06 

  Fabric Downloader version used 05.01 (ROM version is 05.01) 

  Primary clock is CSC 0  Board is analyzed 

  Board State is Line Card Enabled (IOS  RUN ) 

  Insertion time: 00:00:10 (5d16h ago) 

  DRAM size: 67108864 bytes 

  FrFab SDRAM size: 134217728 bytes, SDRAM pagesize: 8192 bytes 

  ToFab SDRAM size: 134217728 bytes, SDRAM pagesize: 8192 bytes 

  1 crash since restart

Router#show context slot 1 

CRASH INFO: Slot 1, Index 1, Crash at 10:36:20 UTC Wed DEC 19 2001
VERSION: 

GS Software (GLC1-LC-M), Version 12.0(17)ST3,

EARLY DEPLOYMENT RELEASE SOFTWARE (fc1) 

TAC Support: http://www.cisco.com/tac 

Compiled Thu 08-Nov-01 20:21 by dchih 

Card Type: 3 Port Gigabit Ethernet, S/N 

System exception: sig=10, code=0x10, context=0x41036514 

System restarted by a Bus Error exception 

STACK TRACE: 

-Traceback= 406914C8 4004EEAC 4005BCE4 400A33F4 400A33E0 

CONTEXT: 

$0 : 00000000, AT : 41030000, v0 : 00000000, v1 : 41036290 

a0 : 00000030, a1 : 412C6CA0, a2 : 00000000, a3 : 00000000 

t0 : 00008100, t1 : 34008101, t2 : 400C5590, t3 : FFFF00FF 

t4 : 400C5560, t5 : 00040000, t6 : 00000000, t7 : 413D1D78 

s0 : FF012345, s1 : 00000031, s2 : 41032B10, s3 : 41BB8F00 

s4 : 00000000, s5 : 00000001, s6 : 4101D620, s7 : 00000000 

t8 : 418EA1C8, t9 : 00000000, k0 : 4142C7A0, k1 : 400C7538 

gp : 40F57DC0, sp : 41BB8EE8, s8 : 41023740, ra : 406914C8 

EPC : 0x406914C8, SREG : 0x34008103, Cause : 0x00000010 

ErrorEPC : 0x400B3A5C
-Process Traceback= No Extra Traceback
SLOT 1:00:00:09: %SYS-5-RESTART: System restarted -- 

Cisco Internetwork Operating System Software 

IOS (tm) GS Software (GLC1-LC-M), Version 12.0(17)ST3, 

EARLY DEPLOYMENT RELEASE SOFTWARE (fc1) 

TAC Support: http://www.cisco.com/tac 

Copyright (c) 1986-2001 by cisco Systems, Inc. 

Compiled Thu 08-Nov-01 20:21 by dchih 

SLOT 1:20:18:09: %LCGE-6-GBIC_OIR: 3 Port Gigabit Ethernet GBIC

removed from port  2 

SLOT 1:20:18:29: %LCGE-6-GBIC_OIR: 3 Port Gigabit Ethernet GBIC

inserted in port 2 

SLOT 1:3d20h: %LCGE-6-GBIC_OIR: 3 Port Gigabit Ethernet GBIC

removed from port 2 

SLOT 1:3d20h: %LCGE-6-GBIC_OIR: 3 Port Gigabit Ethernet GBIC

inserted in port 2 

SLOT 1:00:00:09: %SYS-5-RESTART: System restarted -- 

Cisco Internetwork Operating System Software 

IOS (TM) GS Software (GLC1-LC-M), Version 12.0(17)ST3, 

EARLY DEPLOYMENT RELEASE SOFTWARE (fc1) 

TAC Support: http://www.cisco.com/tac 

Copyright (c) 1986-2001 by cisco Systems, Inc. 

Compiled Thu 08-Nov-01 20:21 by dchi

Line Card Crash

If a line card has crashed, and you have identified the line card that has crashed, you now need to determine the cause of the crash. The output from the show context <slot> command enables you to do this. Here is an example:

Router#show context slot 2 

CRASH INFO: Slot 2, Index 1, Crash at 12:24:22 MET Wed Nov 28 2001
VERSION: 

GS Software (GLC1-LC-M), Version 12.0(18)S1,

EARLY DEPLOYMENT RELEASE SOFTWARE (fc1) 

TAC Support: http://www.cisco.com/tac 

Compiled Fri 07-Sep-01 20:13 by nmasa 

Card Type: 3 Port Gigabit Ethernet, S/N 

System exception: SIG=23, code=0x24, context=0x4103FE84 

System restarted by a Software forced crash 

STACK TRACE: 

-Traceback= 400BEB08 40599554 4004FB64 4005B814 400A1694 400A1680 

CONTEXT: 

$0 : 00000000, AT : 41040000, v0 : 00000032, v1 : 4103FC00 

a0 : 4005B0A4, a1 : 41400A20, a2 : 00000000, a3 : 00000000 

t0 : 41D75220, t1 : 8000D510, t2 : 00000001, t3 : FFFF00FF 

t4 : 400C2670, t5 : 00040000, t6 : 00000000, t7 : 4150A398 

s0 : 0000003C, s1 : 00000036, s2 : 4103C4D0, s3 : 41D7EC60 

s4 : 00000000, s5 : 00000001, s6 : 41027040, s7 : 00000000 

t8 : 41A767B8, t9 : 00000000, k0 : 415ACE20, k1 : 400C4260 

GP : 40F0DD00, SP : 41D7EC48, s8 : 4102D120, ra : 40599554 

EPC : 0x400BEB08, SREG : 0x3400BF03, Cause : 0x00000024 

ErrorEPC : 0x400C6698, BadVaddr : 0xFFBFFFFB
-Process Traceback= No Extra Traceback
SLOT 2:00:00:09: %SYS-5-RESTART: System restarted -- 

Cisco Internetwork Operating System Software 

IOS (TM) GS Software (GLC1-LC-M), Version 12.0(18)S1, 

EARLY DEPLOYMENT RELEASE SOFTWARE (fc1) 

TAC Support: http://www.cisco.com/tac 

Copyright (c) 1986-2001 by cisco Systems, Inc. 

Compiled Fri 07-Sep-01 20:13 by nmae

You can identify the type of crash that has occurred from the "SIG=" value in the show context slot <slot> command output. See the SIG Code Table for details.

Here are some links that provide more information on the three most common types of line card crashes, and explain how to troubleshoot them:

In the example above, the line card has crashed due to a "software-forced crash" and, as the name suggests, a software exception has caused the reload. Once you have determined the cause and collected the necessary output, you can check for a bug in your Cisco IOS software release using the Bug Toolkit (registered customers only) .

Check the Current Status of the Line Card

When you have determined whether the problems are system errors in the log or an actual crash, you must check the current status of the line card to see whether it has recovered from the fault that has occurred. In order to identify the status of individual line cards, you can either examine the Light Emitting Diodes (LEDs) located on the front of the card, or issue the show led command. Here is a sample output:

Router#show led 

SLOT 1  : RUN IOS 

SLOT 6  : DNLD FABL 

SLOT 7  : RP ACTV 

SLOT 10 : RUN IOS 

SLOT 11 : RUN IOS 

SLOT 13 : RUN IOS 

SLOT 14 : RUN IOS

Table 1 and Table 2 describe the most common types of output that you see from this command and their meanings.

Note: It is possible for the value of the LED to be reversed. For example, IOS RUN can be displayed as RUN IOS.

Table 1 – RP LED Status and Meaning

RP LED Status Meaning of LED Status
RP UP RP is running Cisco IOS software and functioning correctly
MSTR RP RP is acting as the Primary GRP
SLAV RP RP is acting as the Slave GRP
RP ACTV RP is acting as the Primary GRP
RP SEC RP is acting as the Slave GRP
MEM INIT RP is trying to size the memory

Table 2 – LC LED Status and Meaning

LC LED Status Meaning of LED Status
DIAG DNLD Line card is downloading Field Diagnostic software
DIAG FAIL Line card has failed Field Diagnostic test
DIAG PASS Line card has passed Field Diagnostic test
DIAG TEST Line card is executing Field Diagnostic software
FABL DNLD Line card is launching "Fabric Downloader"
FABL WAIT Line card is waiting to load "Fabric Downloader"
IN RSET Line card is resetting
IOS DNLD Line card is downloading Cisco IOS software through the switch fabric
IOS RUN Line card is now enabled
IOS UP Line card has finished loading and is now running Cisco IOS software
MBUS DNLD Line card is downloading Maintenance Bus (MBUS) agent
MEM INIT Line card is trying to size memory
PWR OFF Line card is powered off

If the line card status is anything other than "IOS RUN", or the GRP is neither the active Master/Primary nor the Slave/Secondary, this means that there is a problem and the card has not fully loaded correctly. Before you replace the card, Cisco recommends that you try these steps to fix the issue:

  1. Reload the microcode through the microcode reload <slot> global configuration command.

  2. Reload the card through the hw-module slot <slot> reload command. This causes the line card to reset and re-download the Maintenance Bus (MBUS) and Fabric Downloader software modules before it attempts to re-download the line card Cisco IOS software.

  3. Reset the line card manually. This can rule out any problems that are caused by a bad connection to the MBUS or switching fabric.

Note: For more information on how to troubleshoot line cards stuck in any status other than RUN IOS, see Understanding the Booting Process on the Cisco 12000 Series Internet Router.

Fabric Ping Failure

Fabric ping failures occur when either a line card or the secondary GRP fails to respond to a fabric ping request from the primary GRP over the switch fabric. Such failures are a problem symptom that you must investigate. They are indicated by these error messages:

%GRP-3-FABRIC_UNI: Unicast send timed out (1)

%GRP-3-COREDUMP: Core dump incident on slot 1, error: Fabric ping failure 

%LCINFO-3-CRASH: Line card in slot 1 crashed

You can find more information about this issue at Troubleshooting Fabric Ping Timeouts and Failures on the Cisco 12000 Series Internet Router.

Parity Error Messages

The Cisco 12000 Series Internet Router Parity Error Fault Tree document explains the steps to troubleshoot and isolate a part or component of the Cisco 12000 Series Internet Router that fails, after you encounter a variety of parity error messages.

Error Messages

If you experience any error messages related to one of the line cards, you can use the Cisco Error Message Decoder (registered customers only) to find information about the meaning of the error message. Some of them point to a hardware issue of the line card, whereas others indicate a Cisco IOS software bug, or a hardware issue on another part of the router. This document does not cover all these messages.

Some Cisco Express Forwarding (CEF) and Inter Process-Communication (IPC)-related messages are explained in Troubleshooting CEF-Related Error Messages.

Test the Line Card for Hardware Failure

Line card Field Diagnostic software is designed to identify any faulty line card within a Cisco 12000 (all 12xxx series) router. Prior to Cisco IOS software release 12.0(22)S, the Field Diagnostic software was embedded within the Cisco IOS software. From Cisco IOS software release 12.0(22)S onwards, this software has been unbundled, and you can download it from CCO through the Download Software Area (registered customers only) (select FIELD DIAGS under 120XX platform). It is still run from a command initiated while running Cisco IOS software, but you must specify the source (either Trivial File Transfer Protocol (TFTP) boot server, or PCMCIA Flash memory) on the command line. All Field Diagnostics commands are run at the enable level of Cisco IOS software.

Cisco IOS Software Releases Later than 12.0(22)S

From Cisco IOS software release 12.0(22)S onwards, Cisco Systems has unbundled the Cisco 12000 Field Diagnostic line card image from the Cisco IOS software image. In earlier versions, diagnostics could be launched from the command line and the imbedded diagnostic image would be launched. In order to accommodate customers with 20Mb Flash memory cards, the Field Diagnostic software is now stored and maintained as a separate image: c12k-fdiagsbflc-mz.xxx-xx.S.bin (where x is the version number). This means that for a customer to launch Field Diagnostics, this image must be available on a separate Flash card or TFTP boot server. The latest version is always available on Cisco.com. For Performance Route Processor (PRP) cards, Gigabit switch Route Processor (GRP) cards, and fabric tests, these tests remain imbedded with the Cisco IOS software image. The command line features have been changed to reflect this.

While the diagnostic test is in progress, the line card does not function normally and is not able to pass any traffic for the duration of the test (5-20 minutes, based on the complexity of the line card). Without the verbose keyword, the command gives a truncated output that shows a Pass or Fail for the card. When you communicate with the TAC, the verbose mode is most helpful to identify specific problems. The output of the diagnostic test without the verbose command looks like this:

Router# diag 7 verbose tftp://223.255.254.254/muckier/award/c12k-fdiagsbflc-mz

Running DIAG config check
Fabric Download for Field Diags chosen: If timeout occurs, try 'mbus' option.
Running Diags will halt ALL activity on the requested slot.  [confirm]
Router#
Launching a Field Diagnostic for slot 7
Downloading diagnostic tests to slot 7 via fabric (timeout set to 300 sec.)
5d20h: %GRP-4-RSTSLOT: Resetting the card in the slot: 7,Event: 
   EV_ADMIN_FDIAGLoading muckier/award/c12k-fdiagsbflc-mz from 223.255.254.254 
   (via Ethernet0):      !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
5d20h: Downloading diags from tftp file tftp://223.255.254.254/muckier/award/
    c12k-fdiagsbflc-mz
    !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
[OK - 13976524 bytes]
FD 7> *****************************************************
FD 7> GSR Field Diagnostics V6.05

FD 7> Compiled by award on Tue Jul 30 13:00:41 PDT 2002

FD 7> view: award-conn_isp.FieldDiagRelease

FD 7> *****************************************************


Executing all diagnostic tests in slot 7

(total/indiv. timeout set to 2000/600 sec.)

FD 7> BFR_CARD_TYPE_OC12_4P_POS testing... 

FD 7> Available test types 2

FD 7> 1

FD 7> Completed f_diags_board_discovery() (0x1)

FD 7> Test list selection received: Test ID 1, Device 0

FD 7> running in slot 7 (30 tests from test list ID 1)

FD 7> Skipping MBUS_FDIAG command from slot 2

FD 7> Just into idle state
Field Diagnostic ****PASSED**** for slot 7

Shutting down diags in slot 7

Board will reload

5d20h: %GRP-4-RSTSLOT: Resetting the card in the slot: 7,Event: 
   EV_ADMIN_FDIAG


5d20h: %GRP-4-RSTSLOT: Resetting the card in the slot: 7,Event: 
   EV_FAB_DOWNLOADER_DOWNLOAD_FAILURE


SLOT 7:00:00:09: %SYS-5-RESTART: System restarted --

Cisco Internetwork Operating System Software 
IOS (tm) GS Software (GLC1-LC-M), Experimental Version 12.0(20020509:045149)      
  [award-conn_isp.f_diag_new 337]
Copyright (c) 1986-2002 by cisco Systems, Inc.
Compiled Tue 25-Jun-02 15:51 by award

The line card reloads automatically only after it passes the test.

Here is an example in which the Cisco IOS software release earlier than12.0(22)S, the line card failed the test and thus did not reload automatically. You can manually reload the line card with the hw-module slot <slot> reload command.

When you use the verbose keyword, the output includes each individual test that is performed. If the test PASSES, the next test is begun. A sample output looks like this:

Router# diag 7 verbose tftp tftp://223.255.254.254/
muckier/award/c12k-fdiagsbflc-mz

Running DIAG config check
Fabric Download for Field Diags chosen: If timeout occurs, try 'mbus' option.
Verbose mode: Test progress and errors will be displayed
Runnning Diags will halt ALL activity on the requested slot.  [confirm]
Router#
Launching a Field Diagnostic for slot 7
Downloading diagnostic tests to slot 7 via fabric (timeout set to 300 sec.)
00:07:41: %GRP-4-RSTSLOT: Resetting the card in the slot: 7,Event: EV_ADMIN_FDIAG

Loading muckier/award/c12k-fdiagsbflc-mz from 223.255.254.254 (via Ethernet0):      
 !!!!!! (...)

00:08:24: Downloading diags from tftp file tftp://223.255.254.254/muckier/
award/c12k-fdiagsbflc-mz

!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
!!!!!!!!!!!!!!!

[OK - 13976524 bytes]

FD 7> *****************************************************

FD 7> GSR Field Diagnostics V6.05

FD 7> Compiled by award on Tue Jul 30 13:00:41 PDT 2002

FD 7> view: award-conn_isp.FieldDiagRelease

FD 7> *****************************************************

Executing all diagnostic tests in slot 7

(total/indiv. timeout set to 2000/600 sec.)

FD 7> BFR_CARD_TYPE_OC12_4P_POS testing... 

FD 7> Available test types 2

FD 7> 1

FD 7> Completed f_diags_board_discovery() (0x1)

FD 7> Verbosity now (0x00000011) TESTSDISP FATL
FD 7> Test list selection received: Test ID 1, Device 0
FD 7> running in slot 7 (30 tests from test list ID 1)
FD 7> Just into idle state
FDIAG_STAT_IN_PROGRESS(7): test #1 Dram Marching Pattern
FDIAG_STAT_IN_PROGRESS(7): test #2 Dram Datapins
FDIAG_STAT_IN_PROGRESS(7): test #3 Dram Busfloat
FDIAG_STAT_IN_PROGRESS(7): test #4 RBM SDRAM Marching Pattern
FDIAG_STAT_IN_PROGRESS(7): test #5 RBM SDRAM Datapins
FDIAG_STAT_IN_PROGRESS(7): test #6 RBM SSRAM Marching Pattern
FDIAG_STAT_IN_PROGRESS(7): test #7 RBM SSRAM Datapins Memory
FDIAG_STAT_IN_PROGRESS(7): test #8 TBM SDRAM Marching Pattern
FDIAG_STAT_IN_PROGRESS(7): test #9 TBM SDRAM Datapins
FDIAG_STAT_IN_PROGRESS(7): test #10 TBM SSRAM Marching Pattern
FDIAG_STAT_IN_PROGRESS(7): test #11 TBM SSRAM Datapins Memory
FDIAG_STAT_IN_PROGRESS(7): test #12 PSA TLU SDRAM Marching Pattern
FDIAG_STAT_IN_PROGRESS(7): test #13 PSA TLU SDRAM Datapins
FDIAG_STAT_IN_PROGRESS(7): test #14 PSA PLU SDRAM Marching Pattern
FDIAG_STAT_IN_PROGRESS(7): test #15 PSA PLU SDRAM Datapins
FDIAG_STAT_IN_PROGRESS(7): test #16 PSA SRAM Marching Pattern
FDIAG_STAT_IN_PROGRESS(7): test #17 PSA SRAM Datapins
FDIAG_STAT_IN_PROGRESS(7): test #18 To Fabric SOP FIFO SRAM Memory
FDIAG_STAT_IN_PROGRESS(7): test #19 From Fabric SOP FIFO SRAM Memory
FDIAG_STAT_IN_PROGRESS(7): test #20 RBM to SALSA Packet
FDIAG_STAT_IN_PROGRESS(7): test #21 TBM to SALSA Packet
FDIAG_STAT_IN_PROGRESS(7): test #22 RBM to TBM SLI Packet Loopback
FDIAG_STAT_IN_PROGRESS(7): test #23 TBM to PSA Packet -Framer Loopback
FDIAG_STAT_IN_PROGRESS(7): test #24 TBM to TX SOP Packet
FDIAG_STAT_IN_PROGRESS(7): test #25 TBM to RX SOP Packet -4302 Terminal Loopback
FDIAG_STAT_IN_PROGRESS(7): test #26 TBM to RX SOP Packet -Framer System Bus Loop
FDIAG_STAT_IN_PROGRESS(7): test #27 RBM to TBM Fabric Packet Loopback
FDIAG_STAT_IN_PROGRESS(7): test #28 TBM to RBM Packet, RBM page crossing
FDIAG_STAT_IN_PROGRESS(7): test #29 TBM to TX SOP Packet Simultaneous
FDIAG_STAT_IN_PROGRESS(7): test #30 TBM to PSA Multicast Packets -Framer Loopback
FDIAG_STAT_DONE(7)
FD 7> Changed current_status to FDIAG_STAT_IDLE

Field Diagnostic ****PASSED**** for slot 7

Field Diag eeprom values: run 62 fail mode 0 (PASS) slot 7


last test failed was 0, error code 0


Shutting down diags in slot 7

Board will reload

These results are then stored in an Electrically Erasable Programmable Read-Only Memory (EEPROM) on the line card. You can view the results of the last diagnostic performed on the line card with the diag <slot> previous command. Here is a sample output:

Router#diag 3 previous 

Field Diag eeprom values: run 0 fail mode 0 (PASS) slot 3 
   last test failed was 0, error code 0

If no previous field diagnostics have been performed on the card, the output looks like this:

Router#diag 3 previous 

Field Diags have not been run on this board previously - 

EE prom results uninitialized. 

Field Diag eeprom values: run 16777215 fail mode 0 (PASS) slot 9 
   last test failed was 65535, error code 65535

There have been some bugs in the past that have caused the diagnostic tests to fail even though the card is not faulty, so, as a precaution, if the line card fails and it has already been replaced previously, it would be useful to check this output with the Technical Assistance Center (TAC).

Cisco IOS Software Releases Earlier than 12.0(22)S

Line card Field Diagnostic software is bundled with the main Cisco IOS Software to enable you to test whether or not the suspect line card is faulty. To use this feature, you must be in privileged enable mode, and issue the diag <slot> <verbose> command.

While the diagnostic test is in progress, the line card does not function normally and is not able to pass any traffic for the duration of the test (5-15 minutes, based on the complexity of the line card). Without the verbose keyword, the command gives a truncated output that shows a Pass or Fail for the card. The output of the diagnostic test without the verbose command looks like this:

Router#diag 3 

Running DIAG config check 

Running Diags will halt ALL activity on the requested slot 

[confirm] 

Router# 
Launching a Field Diagnostic for slot 3 
Downloading diagnostic tests to slot 3 (timeout set to 600 sec.) 

*Nov 18 22:20:40.237: %LINK-5-CHANGED: Interface GigabitEthernet3/0, 

changed state to administratively down 

Field Diag download COMPLETE for slot 3 

FD 3> ***************************************************** 

FD 3> GSR Field Diagnostics V4.0 

FD 3> Compiled by award on Thu May 18 13:43:04 PDT 2000 

FD 3> view: award-conn_isp.FieldDiagRelease 

FD 3> ***************************************************** 

FD 3> BFR_CARD_TYPE_1P_GE testing... 

FD 3> running in slot 3 (83 tests)
Executing all diagnostic tests in slot 3 

(total/indiv. timeout set to 600/200 sec.) 

Field Diagnostic: ****TEST FAILURE**** slot 3: last test run 51, 

Fabric Packet Loopback, error 3 

Shutting down diags in slot 3
slot 3 done, will not reload automatically

The line card reloads automatically only after it passes the test. In the example above, the line card failed the test and thus did not reload automatically. You can manually reload the line card with the hw-module slot <slot> reload command.

When you use the verbose keyword, the output includes each individual test that is performed, and whether or not each test has passed or failed. Here is a sample output:

Router#diag 3 verbose 

Running DIAG config check 

Running Diags will halt ALL activity on the requested slot. 

[confirm] 

Router# 
Launching a Field Diagnostic for slot 3 
Downloading diagnostic tests to slot 3 (timeout set to 600 sec.) 
Field Diag download COMPLETE for slot 3 

FD 3> ***************************************************** 

FD 3> GSR Field Diagnostics V4.0 

FD 3> Compiled by award on Thu May 18 13:43:04 PDT 2000 

FD 3> view: award-conn_isp.FieldDiagRelease 

FD 3> ***************************************************** 

FD 3> BFR_CARD_TYPE_1P_GE testing... 

FD 3> running in slot 3 (83 tests)
Executing all diagnostic tests in slot 3 

(total/indiv. timeout set to 600/200 sec.) 

FD 3> Verbosity now (0x00000001) TESTSDISP
FDIAG_STAT_IN_PROGRESS(3): test #1 R5K Internal Cache 

FDIAG_STAT_IN_PROGRESS(3): test #2 Burst Operations 

FDIAG_STAT_IN_PROGRESS(3): test #3 Subblock Ordering 

FDIAG_STAT_IN_PROGRESS(3): test #4 P4/EEPROM Clock Speed Matching 

FDIAG_STAT_IN_PROGRESS(3): test #5 Dram Marching Pattern 

FDIAG_STAT_IN_PROGRESS(3): test #6 Dram Datapins 

FDIAG_STAT_IN_PROGRESS(3): test #7 Dram Busfloat 

FDIAG_STAT_IN_PROGRESS(3): test #8 To Fabric (RX) BMA SDRAM Marching Pattern 

FDIAG_STAT_IN_PROGRESS(3): test #9 To Fabric (RX) BMA SDRAM Datapins 

FDIAG_STAT_IN_PROGRESS(3): test #10 To Fabric (RX) BMA Q Manager SRAM Busfloat 

FDIAG_STAT_IN_PROGRESS(3): test #11 To Fabric (RX) BMA Q Manager SRAM Datapins 

FDIAG_STAT_IN_PROGRESS(3): test #12 To Fabric (RX) BMA Q Manager SRAM Marching Pa 

FDIAG_STAT_IN_PROGRESS(3): test #13 From Fabric (TX) BMA SDRAM Marching Pattern 

FDIAG_STAT_IN_PROGRESS(3): test #14 From Fabric (TX) BMA SDRAM Datapins 

FDIAG_STAT_IN_PROGRESS(3): test #15 From Fabric (TX) BMA Q Manager SRAM Busfloat 

FDIAG_STAT_IN_PROGRESS(3): test #16 From Fabric (TX) BMA Q Manager SRAM Datapins 

FDIAG_STAT_IN_PROGRESS(3): test #17 From Fabric (TX) BMA Q Manager SRAM Marching 

FDIAG_STAT_IN_PROGRESS(3): test #18 To Fabric SOP FIFO SRAM Memory 

FDIAG_STAT_IN_PROGRESS(3): test #19 From Fabric SOP FIFO SRAM Memory 

FDIAG_STAT_IN_PROGRESS(3): test #20 SALSA Asic Registers 

FDIAG_STAT_IN_PROGRESS(3): test #21 Salsa Dram Access 

FDIAG_STAT_IN_PROGRESS(3): test #22 Salsa P4 Timeout 

FDIAG_STAT_IN_PROGRESS(3): test #23 Salsa Asic General Purpose Counter 

FDIAG_STAT_IN_PROGRESS(3): test #24 Salsa Asic Real Time Interrupt 

FDIAG_STAT_IN_PROGRESS(3): test #25 Salsa Errors 

FDIAG_STAT_IN_PROGRESS(3): test #26 Salsa DRAM Burst Operations Error 

FDIAG_STAT_IN_PROGRESS(3): test #27 Salsa Dram Read Around Write 

FDIAG_STAT_IN_PROGRESS(3): test #28 Salsa Dram Write Parity Error test 

FDIAG_STAT_IN_PROGRESS(3): test #29 Salsa Prefetch/Write Buffers 

FDIAG_STAT_IN_PROGRESS(3): test #30 Salsa FrFab BMA SDram Read Around Write 

FDIAG_STAT_IN_PROGRESS(3): test #31 Salsa ToFab BMA SDram Read Around Write 

FDIAG_STAT_IN_PROGRESS(3): test #32 Salsa FrFab Network Interrupt Disable Timer 

FDIAG_STAT_IN_PROGRESS(3): test #33 Salsa ToFab Network Interrupt Disable Timer 

FDIAG_STAT_IN_PROGRESS(3): test #34 Salsa ToFab Network Interrupt Mask 

FDIAG_STAT_IN_PROGRESS(3): test #35 Salsa FrFab Network Interrupt Mask 

FDIAG_STAT_IN_PROGRESS(3): test #36 Salsa ToFab BMA Interrupt Mask 

FDIAG_STAT_IN_PROGRESS(3): test #37 Salsa FrFab BMA Interrupt Mask 

FDIAG_STAT_IN_PROGRESS(3): test #38 Salsa - To Fabric BMA Packet - Early Clear 

FDIAG_STAT_IN_PROGRESS(3): test #39 Salsa - From Fabric BMA Packet - Early Clear 

FDIAG_STAT_IN_PROGRESS(3): test #40 Salsa To Fabric SOP Interrupt Mask 

FDIAG_STAT_IN_PROGRESS(3): test #41 Salsa From Fabric SOP Interrupt Mask 

FDIAG_STAT_IN_PROGRESS(3): test #42 SALSA ECC Generation 

FDIAG_STAT_IN_PROGRESS(3): test #43 SALSA ECC Correction 

FDIAG_STAT_IN_PROGRESS(3): test #44 To Fabric FIA48 ASIC Registers 

FDIAG_STAT_IN_PROGRESS(3): test #45 To Fabric FIA48 Packet 

FDIAG_STAT_IN_PROGRESS(3): test #46 To Fabric FIA48 Asic BMA Bus Parity Error 

FDIAG_STAT_IN_PROGRESS(3): test #47 To Fabric FIA48 Asic CiscoCell Fifo Parity Er 

FDIAG_STAT_IN_PROGRESS(3): test #48 From Fabric FIA48 ASIC Registers 

FDIAG_STAT_IN_PROGRESS(3): test #50 SLI Packet Loopback 

FDIAG_STAT_IN_PROGRESS(3): test #51 Fabric Packet Loopback 

FD 3> INT_CAUSE_REG = 0x00000620
FD 3> Unexpected L3FE Interrupt occurred.
FD 3> ERROR: TX FIA48 Asic Interrupt Occurred
FD 3> *** 0-INT: External Interrupt ***
FD 3> Dumping out TX FIA Status Registers, Disabling
FD 3> TX FIA Interrupt, resetting Asics, continuing...
FDIAG_STAT_DONE_FAIL(3) test_num 51, error_code 3 

Field Diagnostic: ****TEST FAILURE**** slot 3: last test run 51, 

Fabric Packet Loopback, error 3 
Field Diag eeprom values: run 3 fail mode 1 (TEST FAILURE) slot 3 
   last test failed was 51, error code 3 

Shutting down diags in slot 3
slot 3 done, will not reload automatically
Router#

These results are then stored in an Electrically Erasable Programmable Read-Only Memory (EEPROM) on the line card. You can view the results of the last diagnostic performed on the line card with the diag <slot> previous command. Here is a sample output:

Router#diag 3 previous 

Field Diag eeprom values: run 0 fail mode 0 (PASS) slot 3 
   last test failed was 0, error code 0

If no previous field diagnostics have been performed on the card, the output looks like this:

Router#diag 3 previous 

Field Diags have not been run on this board previously - 

EE prom results uninitialized. 

Field Diag eeprom values: run 16777215 fail mode 0 (PASS) slot 9 
   last test failed was 65535, error code 65535

There have been some bugs in the past that have caused the diagnostic tests to fail even though the card is not faulty, so, as a precaution, if the line card fails and it has already been replaced previously, it would be useful to check this output with the Technical Assistance Center (TAC).

Information to Collect if You Open a TAC Service Request

If you have identified a component that needs to be replaced, contact your Cisco partner or reseller to request a replacement for the hardware component that is causing the issue. If you have a support contract directly with Cisco, use the TAC Service Request Tool (registered customers only) to open a TAC service request for a hardware replacement. Make sure you attach the following information:
  • Console captures showing the error messages
  • Console captures showing the troubleshooting steps taken and the boot sequence during each step
  • The hardware component that failed and the serial number for the chassis
  • Troubleshooting logs
  • Output from the show technical-support command

Related Information

Updated: Mar 09, 2009
Document ID: 15926