Cisco IOS XR Interface and Hardware Component Configuration Guide for the Cisco CRS-1 Router
Performing Cisco IOS XR Diagnostics
Downloads: This chapterpdf (PDF - 104.0KB) The complete bookPDF (PDF - 12.42MB) | Feedback

Performing Cisco IOS XR Diagnostics

Table Of Contents

Performing Cisco IOS XR Diagnostics

Contents

Prerequisites for Performing Diagnostics

Restrictions for Performing Diagnostics

Available Online Diagnostic Tests

Additional References

Related Documents

Standards

MIBs

RFCs

Technical Assistance


Performing Cisco IOS XR Diagnostics


This module describes the configuration of Cisco IOS XR online diagnostics on the Cisco CRS-1 Router.

Cisco IOS XR online diagnostics allow you to test and verify hardware functionality while connected to a live network.

Feature History for Cisco IOS XR Diagnostics

Release
Modification

Release 3.3.0

This feature was introduced on the Cisco CRS-1 Router.

Release 3.4.0

Configuring health-monitoring diagnostics, including support for monitor syslog, monitor intervals, and the failure count thresholds of test, was added.

The show diagnostic result and show diagnostic content command output was modified to include diagnostic monitoring support.

The show diagnostic content command output for nodes in the FDIAG RUNNING state has expanded to provide more control over execution of offline test suites.

The following online diagnostics tests were added:

Control Ethernet Inactive Link Test

Self-Ping over Fabric

RommonRevision

Fabric Diagnostic Test

Release 3.5.0

The following commands were supported:

diagnostic ondemand action-on-failure

diagnostic ondemand iterations

show diagnostics ondemand settings

show diagnostic status

Physical layer interface module (PLIM) testing support was added.

The following online diagnostic tests were supported:

File System Functionality Verification Test

Scratch Register Test

Release 3.6.0

No modification.

Release 3.6.2

The Fabric Multicast Diagnostic Test was supported, and the Fabric Diagnostic Test was enhanced.

Release 3.7.0

For the Cisco CRS-1 Router, the online diagnostics were moved from the Diagnostics package to the Base package. (The offline diagnostics remain in the Diagnostics package.)

Release 3.8.0

Support was added for the following features on the Cisco CRS-1 Router:

Test names instead of test numbers in the health-monitoring diagnostics.

Minimal bootup diagnostics for the modular service card (MSC).

Maintenance mode for the MSC.

Release 3.9.0

No modification.


Contents

Prerequisites for Performing Diagnostics

Restrictions for Performing Diagnostics

Available Online Diagnostic Tests

Prerequisites for Performing Diagnostics

You must be in a user group associated with a task group that includes the proper task IDs. The command reference guides include the task IDs required for each command. If you suspect user group assignment is preventing you from using a command, contact your AAA administrator for assistance.

To perform online diagnostics on a Cisco CRS-1 Router, you must have the Base package loaded and activated on your router.

To perform offline diagnostics on a Cisco CRS-1 Router, you must have the Diagnostics package loaded and activated on your router.

Restrictions for Performing Diagnostics

The Cisco XR 12000 Series Router supports the following diagnostic command: diag. Although the diagnostic and show diagnostic commands appear in the help string on the Cisco XR 12000 Series Router, this router does not support these commands. If you attempt to execute the diagnostic and show diagnostic commands on a Cisco XR 12000 Series Router, they displays an error message or not provide the requested output.

The diag commands appear in the help string on the Cisco XR 12000 Series Router only if c12k-diags.pie is loaded and activated. If c12k-diags.pie is not loaded, entering "diag" in the help string will map to the diagnostic commands, which are not supported on the Cisco XR 12000 Series Router.

Cisco CRS-1 online diagnostics do not support shared port adaptors (SPAs).

For Cisco CRS-1 offline diagnostics, SPA, physical line module (PLIM), and service processor (SP) support is available from an MSC node only. In other words, to run offline diagnostics on a SPA, PLIM or SP, the load has to be done on an R/S/CPU0 node.

Available Online Diagnostic Tests

The following online diagnostic tests are supported on the Cisco CRS-1 Router:

Control Ethernet Ping Test—A nondisruptive test that "pings" each Control Ethernet node in a chassis from the node where the test is started. The test sends a ping to one node at a time, waiting for a response or until the maximum timeout of 2 seconds is reached, before proceeding to send a ping to the next node. The returned ping response is verified by comparing it byte by byte with the sent ping. Pings are sent only to active nodes within the same chassis as the node from which the test is run. Only one ping is sent per node. Each ping has a 100 byte payload. The test result is PASS if all nodes return the ping and the response matches the sent ping.

Fabric Ping Test—A nondisruptive test that "pings" each node over fabric in a chassis from the node where the test is started. The test sends a ping to one node at a time, waiting for a response or until the maximum timeout of 2 seconds is reached, before proceeding to send a ping to the next node. The returned ping response is verified by comparing it byte by byte with the sent ping. Pings are sent only to active nodes within the same chassis as the node from which the test is run. Due to the undeterministic path of traffic in the fabric, 72 pings are sent per node to maximize coverage. Each ping has a 1-kilobyte payload. The test result is PASS if all nodes return all the pings and all the responses match the sent pings. Due to the 2-second wait timeout for a lost ping, a node that is unreachable or intermittently working impacts the total run time for the test. Therefore, in a worst case scenario where a node has lost all fabric connectivity, the test time can be increased by 2.5 minutes for that node. The total test time depends on how many active nodes there are in the tested rack and how many of the nodes have failing fabric connections.

Control Ethernet Inactive Link Test—A nondisruptive test that verifies the inactive Control Ethernet links between a standby route processor (RP) and all other nodes in the same rack. This test is available only on RPs and can be started only on the standby RP. The test sends a "ping" packet to a test target node, using the inactive Control Ethernet (CE) link. It expects three responses from each node:

One response that travels along the inactive CE return link, which verifies the inactive CE link from test target to standby RP

One response that travels along the active CE link through the active RP CPU to the standby RP, which verifies the external inactive CE link from the standby RP to the test target

One that travels along the active CE link to the standby RP, which verifies the internal test target CE path

Each returned response is verified by comparing it byte by byte with the sent ping. When either all three responses from the test target have returned, or a timeout of 2 seconds has expired, the test repeats this procedure for the next node in the rack until all nodes connected to inactive CE links are tested. The test result is PASS if all nodes return all the pings and all the responses match the sent pings. This test can be used to "qualify" the CE connectivity of the standby RP prior to a switchover.

Self-Ping over Fabric Test—A nondisruptive test that lets a node "ping" itself over the fabric. The test sends a fabric ping to the node itself, waiting for a response or until the maximum timeout of 2 seconds is reached. The returned ping response is verified by comparing it byte by byte with the sent ping. Each ping has a 100-byte payload. This single ping is repeated 100 times with an interval of 300 ms between each ping. The test result is PASS if all the pings are returned and all the responses match the sent pings. The normal run time for this test is 30 seconds. Due to the 2-second wait timeout for a lost ping, the test time for a node that has failing fabric connectivity may be increased by up to 3.5 minutes.

RommonRevision Test—A nondisruptive test that verifies that a node is running the minimally acceptable version of ROM Monitor (ROMMON). When a node is rebooted, its current version of ROMMON is retrieved and saved in a shared memory space. This shared memory space is queried for the running version of ROMMON and that version is compared to the minimally acceptable version of ROMMON. If the running version is not greater than or equal to the minimal version, the test fails.

File System Functionality Verification Test—A nondisruptive test that verifies a basic ability to create, write, read, and delete files on the hard disk or the flash disks. The test generates a PASS result if no fault is detected on the disks under test. If a fault is detected, the location is identified to a single disk device. The test is run on route processor (RP) and distributed route processor (DRP) nodes.

Scratch Register Test—A nondisruptive test that asserts that the ASIC under test appears to be working reasonably. The test is run on RPs, DRPs, and line cards (LCs). This is accomplished by checking if the targeted ASICs can correctly write and read values to selected internal locations. The particular locations selected are those whose values have no effect on the current proper functioning of the ASIC. The test generates a PASS result if no fault is detected in the registers under test. If a fault is detected, the location is identified to a single ASIC.

Fabric Diagnostic Test—A nondisruptive, fault isolation test that unicast pings each RP/LC/DRP node in a chassis from the standby RP node using a unicast address. The test steers unicast ping test packets through different fabric planes, aggregates unicast ping (PASS or FAIL) results with fabric plane information, analyzes these results, and points out the most logical point of failure (if any) in the chassis. The test can be executed only from the standby RP as one of the following test types:

On demand

Scheduled

Configured to monitor health

Executing this test in a Cisco CRS-1 multishelf system may help determine which fabric stage (S1, S2, or S3) is the most logical point of failure for unicast in the system.

This test must be run in each LC rack standby RP in the system. For example, if the test reports failures on multiple LC racks, and the failure information points to the same fabric plane, then the most likely point of failure is the S2 stage, which is the card in the fabric chassis of the system.

Fabric Multicast Diagnosis Test—A nondisruptive, fault isolation test that "multicast pings" each RP/LC/DRP node in a chassis from the standby RP node using a multicast address. The test steers multicast ping test packets through different fabric planes, aggregates multicast ping (PASS or FAIL) results with fabric plane information, analyzes these results, and points out the most logical point of failure (if any) in the chassis. The test can be executed only from the standby RP as one of the following test types:

On demand

Scheduled

Configured to monitor health

Executing this test in a Cisco CRS-1 multishelf system may help to determine which fabric stage (S1, S2, or S3) is the most logical point of failure for multicast in the system.

This test must be run in each LC rack standby RP in the system. For example, if the test reports failures on multiple LC racks, and the failure information points to the same fabric plane, then the most likely point of failure is the S2 stage, which is the card in the fabric chassis of the system.

Additional References

The following sections provide references related to the Cisco IOS XR diagnostics.

Related Documents

Related Topic
Document Title

Diagnostics commands: complete command syntax, command modes, command history, defaults, usage guidelines, and examples

Cisco IOS XR Interface and Hardware Component Command Reference


Standards

Standards
Title

No new or modified standards are supported by this feature, and support for existing standards has not been modified by this feature.


MIBs

MIBs
MIBs Link

There are no applicable MIBs for this module.

To locate and download MIBs for selected platforms using
Cisco IOS XR Software, use the Cisco MIB Locator found at the following URL:

http://cisco.com/public/sw-center/netmgmt/cmtk/mibs.shtml


RFCs

RFCs
Title

No new or modified RFCs are supported by this feature, and support for existing RFCs has not been modified by this feature.


Technical Assistance

Description
Link

The Cisco Technical Support website contains thousands of pages of searchable technical content, including links to products, technologies, solutions, technical tips, and tools. Registered Cisco.com users can log in from this page to access even more content.

http://www.cisco.com/techsupport