Guest

Cisco UCS B-Series Blade Servers

Field Notice: FN - 63362 - Mis-programmed N20-AC0002 and N20-AQ0002 Mezzanine Cards Causing Blade Discovery Failure

Field Notice: FN - 63362 - Mis-programmed N20-AC0002 and N20-AQ0002 Mezzanine Cards Causing Blade Discovery Failure

November 2, 2010


NOTICE:

THIS FIELD NOTICE IS PROVIDED ON AN "AS IS" BASIS AND DOES NOT IMPLY ANY KIND OF GUARANTEE OR WARRANTY, INCLUDING THE WARRANTY OF MERCHANTABILITY. YOUR USE OF THE INFORMATION ON THE FIELD NOTICE OR MATERIALS LINKED FROM THE FIELD NOTICE IS AT YOUR OWN RISK. CISCO RESERVES THE RIGHT TO CHANGE OR UPDATE THIS FIELD NOTICE AT ANY TIME.

Revision History

Revision Date Comment
1.0
02-Nov-2010
Initial Public Release

Products Affected

Products Affected
N20-AC0002
N20-AC0002=
N20-AQ0002

Problem Description

Some Cisco M81KR Virtual Interface and Qlogic M71KR-Q CNA mezzanine cards prevent discovery of UCS blades due to an incorrect programming of card type.

Affected units can be identified using the procedure in the How to Identify Hardware Levels section of this Field Notice.

Background

Several N20-AC0002 and N20-AQ0002 mezzanine cards shipped during July, 2010 were mis-programmed with an incorrect card type. Depending on the UCS Manager version that is running, this can have the effect of making a blade undiscoverable.

Problem Symptoms

The problem symptom observed is a UCS blade will not be discovered by UCS Manager if an affected mezzanine adapter is installed. The failure occurs only with certain combinations of UCS Manager and affected hardware.

UCSM release 1.2 and earlier:

  • N20-AQ0002 will cause the problem
  • N20-AC0002 will cause the problem

UCSM release 1.3:

  • N20-AQ0002 will cause the problem
  • N20-AC0002 will not cause the problem

The following example shows a Cisco M81KR Virtual Interface with UCSM version 1.2 failing discovery:

The following example shows a Qlogic M71KR-Q CNA failing discovery:

Workaround/Solution

First, confirm whether you have affected N20-AC0002 or N20-AQ0002 mezzanine cards using the procedure in the How to Identify Hardware Levels section.

If N20-AC0002 (UCS M81KR Virtual Interface) is the only affected mezzanine card, you may use option A or B below:

  1. Use UCS Manager release 1.3 or later.
  2. Use UCS Manager release 1.2 after running the FixFrus Debug Plug-in procedure described in this document.

If N20-AQ0002 (M71KR-Q QLogic CNA) is among the affected mezzanine cards, you should use UCS Manager release 1.3, or use UCS Manager release 1.2 after running the FixFrus Debug Plug-in procedure.

If you are running UCS Manager release 1.1 and are unable to upgrade for some reason, the affected mezzanine card must be fixed manually. Please contact the Cisco Technical Assistance Center if this is required.

FixFrus Debug Plug-in

This is a debug tool that will check and fix mis-programmed card_type on M71KR-Q QLogic CNA and M81KR Cisco Virtual Interface mezzanine cards identified by this Field Notice.

Supported UCSM Version

This debug plug-in works with versions 1.2(1x) and 1.3(1x) UCS Manager software.

Caveat

To fix a mis-programmed mezzanine interface on a server, the server must be decommissioned manually. This is a service disruptive activity.

How to Use
  1. Download ucs-fixfrus-dplug.1.0.1.0.gbin onto the Fabric-Interconnect. The debug plug-in can be downloaded using the following steps:
    1. Navigate to http://www.cisco.com, and log in with your Cisco.com user id.
    2. Navigate to Support > Download Software.
    3. Select:
      Products > Unified Computing > Cisco UCS Manager > Unified Computing System (UCS) Complete Software Bundle
    4. The FixFrus debug plugin can be found in either the 1.2(1d) folder or the 1.3(1n) folder. The files are named uniquely for each folder but the image is the same i.e. either image will work with either release. The FixFrus debug plugin is named:
      ucs-fixfrus-dplug.1.0.1.1-1.2.1.gbin
      or
      ucs-fixfrus-dplug.1.0.1.1-1.3.1.gbin
    5. Download one of these files and place it in a location accessible to UCS Manager.
  2. Load the plug-in. When it runs, affected mezzaanine cards will be detected and fixed after the user is prompted for verification.

Example download/copy operation:

SAM-APTOS-A#
SAM-APTOS-A# connect local-mgmt
Cisco UCS 6100 Series Fabric Interconnect

TAC support: http://www.cisco.com/tac

Copyright (c) 2009, Cisco Systems, Inc. All rights reserved.

The copyrights to certain works contained herein are owned by
other third parties and are used and distributed under license.
Some parts of this software may be covered under the GNU Public
License or the GNU Lesser General Public License. A copy of
each such license is available at
http://www.gnu.org/licenses/gpl.html and
http://www.gnu.org/licenses/lgpl.html
SAM-APTOS-A(local-mgmt)#
SAM-APTOS-A(local-mgmt)# copy scp://user1@10.193.175.2/homesw2/ssahoo/ucs-fixfrus-dplug.1.0.1.0.gbin .
user1@10.193.175.2's password:
ucs-fixfrus-dplug.1.0.1.0.gbin 100% 2782KB 2.7MB/s 00:00
SAM-APTOS-A(local-mgmt)#
SAM-APTOS-A(local-mgmt)# copy workspace:ucs-fixfrus-dplug.1.0.1.0.gbin volatile:x

Example Plug-in Load/Run operation:

SAM-APTOS-A(local-mgmt)# load-debug-plugin volatile:x
Loading plugin version 1.0(1.0)
###############################################################
Warning: debug-plugin is for engineering internal use only!
For security reason, plugin image has been deleted.
###############################################################

This is a maintenance plug-in designed to fix badly programmed blade adapters.
This plug-in will only modify adapters on decommissioned servers.
(V)erify only, Verify and (F)ix, or (E)xit ? ==> V
Checking FRU's for all servers...

Chassis/Blade MezzanineId PID SERIAL CARD_TYPE/CHECKSUM
1/1 1 N20-AC0002 QCI133201I3 OK
1/2 1 N20-AQ0002 QCI1405A4PC OK
1/3 1 N20-AC0002 QCI133201FP NOT-OK 06(03)
1/3 3 N20-AC0002 QCI1421A3QO NOT-OK 06(03)
1/5 1 N20-AQ0002 QCI13060020 OK
1/6 1 N20-AE0002 EXM132300E8 OK

This is a maintenance plug-in designed to fix badly programmed blade adapters.
This plug-in will only modify adapters on decommissioned servers.
(V)erify only, Verify and (F)ix, or (E)xit ? ==> F
Checking and fixing FRU's for decommissioned servers...

Chassis/Blade MezzanineId PID SERIAL CARD_TYPE/CHECKSUM

This is a maintenance plug-in designed to fix badly programmed blade adapters.
This plug-in will only modify adapters on decommissioned servers.

Note: This debug tool only fixes mis-configured FRUs for servers that are decommissioned. To decommission a server, in UCSM GUI for the server in question > Server Maintenance > Decommission; The user needs to "Reacknowledge slot" once mis-configured FRU is fixed on the server.

(V)erify only, Verify and (F)ix, or (E)xit ? ==> F
Checking and fixing FRU's for decommissioned servers...

Chassis/Blade MezzanineId PID SERIAL CARD_TYPE/CHECKSUM
1/3 1 N20-AC0002 QCI123456FP NOT-OK 06(03)
1/3 3 N20-AC0002 QCI1234A3QO NOT-OK 06(03)
(Fixed 2 devices)

This is a maintenance plug-in designed to fix badly programmed blade adapters.
This plug-in will only modify adapters on decommissioned servers.
(V)erify only, Verify and (F)ix, or (E)xit ? ==>E
Exiting...
SAM-APTOS-A(local-mgmt)#

How To Identify Hardware Levels

Affected units can be identified by serial number. Use the link provided below to check if your serial number is affected:
FN 63362 Serial Number Validation

For More Information

If you require further assistance, or if you have any further questions regarding this field notice, please contact the Cisco Systems Technical Assistance Center (TAC) by one of the following methods:

Receive Email Notification For New Field Notices

Cisco Notification Service—Set up a profile to receive email updates about reliability, safety, network security, and end-of-sale issues for the Cisco products you specify.