Guest

Cisco 10700 Series Routers

Field Notice: *Expired* FN - 18667 - TM DATA PARITY ERROR System Crash


Revised April 18, 2008

March 22, 2002


NOTICE:

THIS FIELD NOTICE HAS BEEN EXPIRED AND IS NO LONGER MAINTAINED OR UPDATED BY CISCO.

THIS FIELD NOTICE IS PROVIDED ON AN "AS IS" BASIS AND DOES NOT IMPLY ANY KIND OF GUARANTEE, WARRANTY OR SUPPORT. USE OF THE INFORMATION ON THIS FIELD NOTICE OR MATERIALS LINKED FROM THIS FIELD NOTICE IS AT YOUR OWN RISK. CISCO RESERVES THE RIGHT TO CHANGE OR UPDATE THIS FIELD NOTICE AT ANY TIME.

Products Affected

Product

Top Assembly

Printed Circuit Assembly

Part Number

Rev.

Part Number

Rev.

CISCO10720-AC-A(=)

800-15309-02

All

73-5349-03

All

CISCO10720-DC-A(=)

800-15310-02

All

73-5349-03

All

  

Problem Description

The 10720 main card can experience an internal parity error which results in a system crash.

Background

An internal timing problem on an ASIC can cause a parity error, which in turn causes the system to crash. This problem is now being screened in manufacturing. Cisco estimates less than 10 percent of unites in the field will exhibit this problem.

Problem Symptoms

The 10720 router will crash with one of the following MISTRAL ERRORS: MISTRAL_TM_NPP_PAR_ERR_REG_MASK_HI: 41, MISTRAL_IBL_CRC_ERR_INT: 20, or MISTRAL_TM_DATA_PAR_ERR_REG_MASK_HI, which is shown in the example below.

Feb 27 07:19:15.862: %Camr-3-MISTRAL_ERROR:
MISTRAL_TM_DATA_PAR_ERR_REG_MASK_HI: 42
-Traceback= 50222240 501D3690
Feb 27 07:19:15.866: %Camr-3-INTPROC: Process Traceback= 50456BF8 501D7CA8 501D4868 50010A1C 
-Traceback= 50221E90 5022233C 501D3690
Feb 27 07:19:17.430:
%SYS-3-LOGGER_FLUSHED: System was paused for 00:00:00 to ensure console debugging output.
=== Start of Crashinfo Collection (07:19:17 MET Wed Feb 27 2002) ===

  

Depending on the settings in the Cisco 10720 configuration register, the 10720 may return to service on its own or it may crash and stay in ROMMON, requiring user intervention to bring it back into service. A value of 0x2102 (which is the default value) will allow the 10720 to reboot without user intervention. A value of 0x0 will cause the 10720 to stay in ROMMON once it has crashed. To view the current value of the configuration register, use the show version command. In the example shown below, the configuration register has a value of 0x0 which will cause the 10720 to stay in ROMMON once it has crashed:

lab1#show version
Cisco Internetwork Operating System Software 
IOS (tm) 10700 Software (C10700-P-M), Experimental Version 12.0(20011101:144431) [tpaiemen-yb_isp 196]
Copyright (c) 1986-2001 by cisco Systems, Inc.
Compiled Fri 14-Dec-01 16:20 by tpaiemen
Image text-base: 0x50010960, data-base: 0x50672000

ROM: System Bootstrap, Version 12.0(20010529:144545) [yuwang-rommon1 149], DEVELOPMENT SOFTWARE

lab1 uptime is 2 days, 3 hours, 19 minutes
System returned to ROM by power-on
Running default software

cisco C10720 (R5000) processor (revision 0xFF) with 256000K/6144K bytes of memory.
R527x CPU at 200Mhz, Implementation 40, Rev 10.0
Last reset from power-on
Toaster processor tmc0 is running.
Toaster processor tmc1 is running.
1 one-port OC48 SONET based SRP controller.
1 24 Port 100 Mbps Fast Ethernet TX controller.
24 FastEthernet/IEEE 802.3 interface(s)
1 SRP network interface(s)
509K bytes of non-volatile configuration memory.

16384K bytes of Flash internal SIMM (Sector size 512KB).
49152K bytes of Flash internal SIMM (Sector size 512KB).
Configuration register is 0x0

  

Workaround/Solution

Replace the affected hardware.

New products are now being manufactured under deviation D055938 and are guaranteed to be free of this problem. Refer to How to Identify Hardware Levels below for instructions on how to view the deviation number of in-service product. Product shipped from Service Logistics may still exhibit this problem. Coding return materials authorizations (RMAs) as shown below guarantees a replacement which is not affected by this problem.

Service Level: Mfg New - 3rd Bus Day NON BILLABLE

Failure Class: Administrative Request

Failure Code: Field Notice Alert

How To Identify Hardware Levels

With Cisco IOS® Software Releases 12.0(21)SP, 12.0(21)ST, or later, use the show diags command to view the uplink card 800-level part number. In the example shown below the uplink card has an 800-level part number of 800-15309-02 and a revision of B0 which falls within the affected range listed above in the Products Affected section, but has been manufactured under deviation D055938 indicating it is not affected by the problem described in this field notice.

yb#sh diags
Mainboard:
MAIN: type 0x0001, 800-15309-02 rev B0 dev D055938
SW key: 00-00-00 S/N CAT054701S6
Test hist: 0x00(no failure) RMA#: 000000
RMA hist: 0 upgrades 0 field failures
PCA: 73-5349-03 rev A0 fab ver 3
DIAG: Test count: 0x00000000 Test results: 0x00000000
Van Allen Memory Size: 64 MB
Toaster Memory Size - Column 1: 32 MB
Toaster Memory Size - Column 2: 128 MB
Toaster Memory Size - Column 3: 32 MB
Toaster Memory Size - Column 4: 32 MB
Toaster Memory Size - Column 5: 32 MB
Toaster Memory Size - Column 6: 32 MB
Toaster Memory Size - Column 7: 32 MB
Toaster Memory Size - Column 8: 32 MB
Main FPGA ver: 0x0003

  

For More Information

If you require further assistance, or if you have any further questions regarding this field notice, please contact the Cisco Systems Technical Assistance Center (TAC) by one of the following methods:

Receive Email Notification For New Field Notices

Product Alert Tool - Set up a profile to receive email updates about reliability, safety, network security, and end-of-sale issues for the Cisco products you specify.