Contents
Monitoring Hardware
This chapter includes the following sections:
- Monitoring a Fabric Interconnect
- Monitoring a Chassis
- Monitoring a Blade Server
- Monitoring a Rack-Mount Server
- Monitoring an I/O Module
- Monitoring Management Interfaces
Monitoring a Fabric Interconnect
Procedure
Monitoring a Chassis
Procedure
Step 1 In the Navigation pane, click the Equipment tab. Step 2 On the Equipment tab, expand . Step 3 Click the chassis that you want to monitor. Step 4 Click one of the following tabs to view the status of the chassis:
Option Description General tab
Provides an overview of the status of the chassis, including a summary of any faults, a summary of the chassis properties, and a physical display of the chassis and its components.
Servers tab
Displays the status and selected properties of all servers in the chassis.
Service Profiles tab
Displays the status of the service profiles associated with servers in the chassis.
IO Modules tab
Displays the status and selected properties of all IO modules in the chassis.
Fans tab
Displays the status of all fan modules in the chassis.
PSUs
Displays the status of all power supply units in the chassis.
Hybrid Display tab
Displays detailed information about the connections between the chassis and the fabric interconnects. The display has an icon for the following:
Slots tab
Displays the status of all slots in the chassis.
Installed Firmware tab
Displays the current firmware versions on the IO modules and servers in the chassis. You can also use this tab to update and activate the firmware on those components.
Management Logs tab
Displays and provides access to the system event logs for the servers in the chassis.
Faults tab
Provides details of faults generated by the chassis.
Events tab
Provides details of events generated by the chassis.
FSM tab
Provides details about and the status of FSM tasks related to the chassis. You can use this information to diagnose errors with those tasks.
Statistics tab
Provides statistics about the chassis and its components. You can view these statistics in tabular or chart format.
Temperatures tab
Provides temperature statistics for the components of the chassis. You can view these statistics in tabular or chart format.
Power tab
Provides power statistics for the components of the chassis. You can view these statistics in tabular or chart format.
Monitoring a Blade Server
Procedure
Step 1 In the Navigation pane, click the Equipment tab. Step 2 On the Equipment tab, expand . Step 3 Click the server that you want to monitor. Step 4 In the Work pane, click one of the following tabs to view the status of the server:
Option Description General tab
Provides an overview of the status of the server, including a summary of any faults, a summary of the server properties, and a physical display of the server and its components.
Inventory tab
Provides details about the properties and status of the components of the server on the following subtabs:
Motherboard—Information about the motherboard and information about the server BIOS settings. You can also recover corrupt BIOS firmware from this subtab.
CIMC—Information about the CIMC and its firmware, and provides access to the SEL for the server. You can also assign a static or pooled management IP address, and update and activate the CIMC firmware from this subtab.
CPU—Information about each CPU in the server.
Memory—Information about each memory slot in the server and the DIMM in that slot.
Interface cards—Information about each adapter installed in the server.
HBAs—Properties of each HBA and the configuration of that HBA in the service profile associated with the server.
NICs—Properties of each NIC and the configuration of that NIC in the service profile associated with the server. You can expand each row to view information about the associated VIFs and vNICs.
Storage—Properties of the storage controller, the local disk configuration policy in the service profile associated with the server, and for each hard disk in the server.
Tip If the server contains one or more SATA devices, such as a hard disk drive or solid state drive, Cisco UCS Manager GUI displays the vendor name for the SATA device in the Vendor field.
However, Cisco UCS Manager CLI displays ATA in the Vendor field and includes the vendor information, such as the vendor name, in a Vendor Description field. This second field does not exist in Cisco UCS Manager GUI.
Virtual Machines tab
Displays details about any virtual machines hosted on the server.
Installed Firmware tab
Displays the firmware versions on the CIMC, adapters, and other server components. You can also use this tab to update and activate the firmware on those components.
Management Logs tab
Displays the system event log for the server.
VIF Paths tab
Displays the VIF paths for the adapters on the server.
Faults tab
Displays an overview of the faults generated by the server. You can click any fault to view additional information.
Events tab
Displays an overview of the events generated by the server. You can click any event to view additional information.
FSM tab
Provides details about the current FSM task running on the server, including the status of that task. You can use this information to diagnose errors with those tasks.
Statistics tab
Displays statistics about the server and its components. You can view these statistics in tabular or chart format.
Temperatures tab
Displays temperature statistics for the components of the server. You can view these statistics in tabular or chart format.
Power tab
Displays power statistics for the components of the server. You can view these statistics in tabular or chart format.
Step 5 In the Navigation pane, expand . Step 6 In the Work pane, right-click one or more of the following components of the interface card to open the navigator and view the status of the component:
Tip Expand the nodes in the table to view the child nodes. For example, if you expand a NIC node, you can view each VIF created on that NIC.
Monitoring a Rack-Mount Server
Procedure
Step 1 In the Navigation pane, click the Equipment tab. Step 2 On the Equipment tab, expand . Step 3 Click the server that you want to monitor. Step 4 In the Work pane, click one of the following tabs to view the status of the server:
Option Description General tab
Provides an overview of the status of the server, including a summary of any faults, a summary of the server properties, and a physical display of the server and its components.
Inventory tab
Provides details about the properties and status of the components of the server on the following subtabs:
Motherboard—Information about the motherboard and information about the server BIOS settings. You can also recover corrupt BIOS firmware from this subtab.
CIMC—Information about the CIMC and its firmware, and provides access to the SEL for the server. You can also assign a static or pooled management IP address, and update and activate the CIMC firmware from this subtab.
CPU—Information about each CPU in the server.
Memory—Information about each memory slot in the server and the DIMM in that slot.
Interface cards—Information about each adapter installed in the server.
HBAs—Properties of each HBA and the configuration of that HBA in the service profile associated with the server.
NICs—Properties of each NIC and the configuration of that NIC in the service profile associated with the server. You can expand each row to view information about the associated VIFs and vNICs.
Storage—Properties of the storage controller, the local disk configuration policy in the service profile associated with the server, and for each hard disk in the server.
Tip If the server contains one or more SATA devices, such as a hard disk drive or solid state drive, Cisco UCS Manager GUI displays the vendor name for the SATA device in the Vendor field.
However, Cisco UCS Manager CLI displays ATA in the Vendor field and includes the vendor information, such as the vendor name, in a Vendor Description field. This second field does not exist in Cisco UCS Manager GUI.
Virtual Machines tab
Displays details about any virtual machines hosted on the server.
Installed Firmware tab
Displays the firmware versions on the CIMC, adapters, and other server components. You can also use this tab to update and activate the firmware on those components.
Management Logs tab
Displays the system event log for the server.
VIF Paths tab
Displays the VIF paths for the adapters on the server.
Faults tab
Displays an overview of the faults generated by the server. You can click any fault to view additional information.
Events tab
Displays an overview of the events generated by the server. You can click any event to view additional information.
FSM tab
Provides details about the current FSM task running on the server, including the status of that task. You can use this information to diagnose errors with those tasks.
Statistics tab
Displays statistics about the server and its components. You can view these statistics in tabular or chart format.
Temperatures tab
Displays temperature statistics for the components of the server. You can view these statistics in tabular or chart format.
Power tab
Displays power statistics for the components of the server. You can view these statistics in tabular or chart format.
Step 5 In the Navigation pane, expand . Step 6 In the Work pane, right-click one or more of the following components of the interface card to open the navigator and view the status of the component:
Tip Expand the nodes in the table to view the child nodes. For example, if you expand a NIC node, you can view each VIF created on that NIC.
Monitoring an I/O Module
Procedure
Step 1 In the Navigation pane, click the Equipment tab. Step 2 On the Equipment tab, expand . Step 3 Click the I/O module that you want to monitor. Step 4 Click one of the following tabs to view the status of the I/O module:
Option Description General tab
Provides an overview of the status of the I/O module, including a summary of any faults, a summary of the module properties, and a physical display of the module and its components.
Fabric Ports tab
Displays the status and selected properties of all fabric ports in the I/O module.
Backplane Ports tab
Displays the status and selected properties of all backplane ports in the I/O module.
Faults tab
Provides details of faults generated by the I/O module.
Events tab
Provides details of events generated by the I/O module.
FSM tab
Provides details about and the status of FSM tasks related to the I/O module. You can use this information to diagnose errors with those tasks.
Statistics tab
Provides statistics about the I/O module and its components. You can view these statistics in tabular or chart format.
Management Interfaces Monitoring Policy
This policy defines how the mgmt0 Ethernet interface on the fabric interconnect should be monitored. If Cisco UCS detects a management interface failure, a failure report is generated. If the configured number of failure reports is reached, the system assumes that the management interface is unavailable and generates a fault. By default, the management interfaces monitoring policy is disabled.
If the affected management interface belongs to a fabric interconnect which is the managing instance, Cisco UCS confirms that the subordinate fabric interconnect's status is up, that there are no current failure reports logged against it, and then modifies the managing instance for the end-points.
If the affected fabric interconnect is currently the primary inside of a high availability setup, a failover of the management plane is triggered. The data plane is not affected by this failover.
You can set the following properties related to monitoring the management interface:
Type of mechanism used to monitor the management interface.
Interval at which the management interface's status is monitored.
Maximum number of monitoring attempts that can fail before the system assumes that the management is unavailable and generates a fault message.
Configuring the Management Interfaces Monitoring Policy
Procedure
Step 1 In the Navigation pane, click the Admin tab. Step 2 In the Admin tab, expand . Step 3 Click Management Interfaces. Step 4 In the Work pane, click the Management Interfaces Monitoring Policy tab. Step 5 Complete the following fields:
Name Description Admin Status field
Whether the monitoring policy is enabled or disabled for the management interfaces.
Poll Interval field
The number of seconds the system should wait between data recordings.
Enter an integer between 90 and 300.
Max Fail Report Count field
The maximum number of monitoring attempts that can fail before the system assumes that the management interface is unavailable and generates a fault message.
Monitoring Mechanism field
The type of monitoring you want the system to use. You can select:
MII Status—The system monitors the availability of the Media Independent Interface (MII). If you select this option, Cisco UCS Manager GUI displays the Media Independent Interface Monitoring area.
Ping ARP Targets—The system pings designated targets using the Address Resolution Protocol (ARP). If you select this option, Cisco UCS Manager GUI displays the ARP Target Monitoring area.
Ping Gateway—The system pings the default gateway address specified for this Cisco UCS instance on the Management Interfaces tab. If you select this option, Cisco UCS Manager GUI displays the Gateway Ping Monitoring area.
Step 6 If you chose MII Status for the monitoring mechanism, complete the following fields in the Media Independent Interface Monitoring area:
Name Description Retry Interval field
The number of seconds the system should wait before requesting another response from the MII if a previous attempt fails.
Enter an integer between 3 and 10.
Max Retry Count field
The number of times the system polls the MII until the system assumes the interface is unavailable.
Enter an integer between 1 and 3.
Step 7 If you chose Ping ARP Targets for the monitoring mechanism, complete the following fields in the ARP Target Monitoring area:
Name Description Target IP 1 field
The first IP address the system pings.
Target IP 2 field
The second IP address the system pings.
Target IP 3 field
The third IP address the system pings.
Number of ARP Requests field
The number of ARP requests to send to the target IP addresses.
Enter an integer between 1 and 5.
Max Deadline Timeout field
The number of seconds to wait for responses from the ARP targets until the system assumes they are unavailable.
Enter an integer between 5 and 15.
Type 0.0.0.0 to remove the ARP target.
Step 8 If you chose Ping Gateway for the monitoring mechanism, complete the following fields in the Gateway Ping Monitoring area:
Name Description Number of Ping Requests field
The number of times the system should ping the gateway.
Enter an integer between 1 and 5.
Max Deadline Timeout field
The number of seconds to wait for a response from the gateway until the system assumes the address is unavailable.
Enter an integer between 5 and 15.
Step 9 Click Save Changes.