Normal
|
Normal
|
Normal
|
Normal
|
Normal
|
Failing Over
|
On Admin Request
|
Taking Over
|
On Admin Request
|
The administrator initiated a manual failover from node 1 to
node 2. The manual failover is in progress.
|
Idle
|
On Admin Request
|
Running in Backup Mode
|
On Admin Request
|
The manual failover from node 1 to node 2 that the administrator
initiated is complete.
|
Taking Back
|
On Admin Request
|
Falling Back
|
On Admin Request
|
The administrator initiated a manual fallback from node 2 to
node 1. The manual fallback is in progress.
|
Idle
|
Initialization
|
Running in Backup Mode
|
On Admin Request
|
The administrator restarts the SRM service on node 1 while node
1 is in
"Idle"
state.
|
Idle
|
Initialization
|
Running in Backup Mode
|
Initialization
|
The administrator either restarts both nodes in the presence
redundancy group, or restarts the SRM service on both nodes while the presence
redundancy group was in manual failover mode.
|
Idle
|
On Admin Request
|
Running in Backup Mode
|
Initialization
|
The administrator restarts the SRM service on node 2 while node
2 is running in backup mode, but before the heartbeat on node 1 times out.
|
Failing Over
|
On Admin Request
|
Taking Over
|
Initialization
|
The administrator restarts the SRM service on node 2 while node
2 is taking over, but before the heartbeat on node1 times out.
|
Taking Back
|
Initialization
|
Falling Back
|
On Admin Request
|
The administrator restarts the SRM service on node 1 while
taking back, but before the heartbeat on node 2 times out. After the taking
back process is complete, both nodes are in Normal state.
|
Taking
Back
|
Automatic Fallback
|
Falling
Back
|
Automatic Fallback
|
Automatic Fallback has
been initiated from node 2 to node 1 and is currently in progress.
|
Failed Over
|
Initialization or Critical Services Down
|
Running in Backup Mode
|
Critical Service Down
|
Node 1 transitions to Failed Over state when either of the
following conditions occur:
-
Critical services come back up due to a reboot of node 1.
-
The
administrator starts critical services on node 1 while node 1 is in Failed Over
with Critical Services Not Running state.
When node 1 transitions to Failed Over state the node is ready
for the administrator to perform a manual fallback to restore the nodes in the
presence redundancy group to Normal state.
|
Failed Over with Critical Services not Running
|
Critical Service Down
|
Running in Backup Mode
|
Critical Service Down
|
A critical service is down on node 1.
IM and Presence
Service performs an automatic failover to node 2.
Recommended Actions:
-
Check node 1 for any critical services that are down and try to
manually start those services.
-
If
the critical services on node 1 do not start, then reboot node 1.
-
When
all the critical services are up and running after the reboot, perform a manual
fallback to restore the nodes in the presence redundancy group to the Normal
state.
|
Failed Over with Critical Services not Running
|
Database Failure
|
Running in Backup Mode
|
Database Failure
|
A database service is down on node 1.
IM and Presence
Service performs an automatic failover to node 2.
Recommended Actions:
-
Reboot node 1.
-
When
all the critical services are up and running after the reboot, perform a manual
fallback to restore the nodes in the presence redundancy group to the Normal
state.
|
Running in Failed Mode
|
Start of Critical Services Failed
|
Running in Failed Mode
|
Start of Critical Services Failed
|
Critical services fail to start while a node in the presence
redundancy group is taking back from the other node.
Recommended Actions. On the node that is taking back,
perform the following actions:
-
Check the node for critical services that are down. To manually
start these services, click
Recovery in the
Presence Redundancy Group Configuration window.
-
If
the critical services do not start, reboot the node.
-
When
all the critical services are up and running after the reboot, perform a manual
fallback to restore the nodes in the presence redundancy group to the Normal
state.
|
Running in Failed Mode
|
Critical Service Down
|
Running in Failed Mode
|
Critical Service Down
|
Critical services go down on the backup node. Both nodes enter
the failed state.
Recommended Actions:
-
Check the backup node for critical services that are down. To
start these services manually, click
Recovery in the
Presence Redundancy Group Configuration window.
-
If
the critical services do not start, reboot the node.
|
Node 1 is down due to loss of network connectivity or the SRM
service is not running.
|
Running in Backup Mode
|
Peer Down
|
Node 2 has lost the heartbeat from node 1.
IM and Presence
Service performs an automatic failover to node 2.
Recommended Action. If node 1 is up, perform the following
actions:
-
Check and repair the network connectivity between nodes in the
presence redundancy group. When you reestablish the network connection between
the nodes, the node may go into a failed state. Click
Recovery in the
Presence Redundancy Group Configuration window to
restore the nodes to the Normal state.
-
Start the SRM service and perform a manual fallback to restore
the nodes in the presence redundancy group to the Normal state.
-
(If
the node is down) Repair and power up node 1.
-
When
the node is up and all critical services are running, perform a manual fallback
to restore the nodes in the presence redundancy group to the Normal state.
|
Node 1 is down (due to possible power down, hardware failure,
shutdown, reboot)
|
Running in Backup Mode
|
Peer Reboot
|
IM and Presence
Service performs an automatic failover to node 2 due to the following
possible conditions on node 1:
-
hardware failure
-
power down
-
restart
-
shutdown
Recommended Actions:
-
Repair and power up node 1.
-
When
the node is up and all critical services are running, perform a manual fallback
to restore the nodes in the presence redundancy group to the Normal state.
|
Failed Over with Critical Services not Running OR Failed Over
|
Initialization
|
Backup Mode
|
Peer Down During Initialization
|
Node 2 does not see node 1 during startup.
Recommended Action:
When node1 is up and all critical services are running, perform
a manual fallback to restore the nodes in the presence redundancy group to the
Normal state.
|
Running in Failed Mode
|
Cisco Server Recovery Manager Take Over Users Failed
|
Running in Failed Mode
|
Cisco Server Recovery Manager Take Over Users Failed
|
User move fails during the taking over process.
Recommended Action:
Possible database error. Click
Recovery in the
Presence Redundancy Group Configuration window. If
the problem persists, then reboot the nodes.
|
Running in Failed Mode
|
Cisco Server Recovery Manager Take Back Users Failed
|
Running in Failed Mode
|
Cisco Server Recovery Manager Take Back Users Failed
|
User move fails during falling back process.
Recommended Action:
Possible
database error. Click
Recovery in the
Presence Redundancy Group Configuration window. If
the problem persists, then reboot the nodes.
|
Running in Failed Mode
|
Unknown
|
Running in Failed Mode
|
Unknown
|
The SRM on a node restarts while the SRM on the other node is in
a failed state, or an internal system error occurs.
Recommended Action:
Click
Recovery in the
Presence Redundancy Group Configuration window. If
the problem persists, then reboot the nodes.
|
Backup Activated
|
Auto Recover Database Failure
|
Failover Affected Services
|
Auto Recovery Database Failure.
|
The database goes down on the backup node. The peer node is in
failover mode and can take over for all users in the presence redundancy group.
Auto-recovery operation automatically occurs and all users are moved over to
the primary node.
|
Backup Activated
|
Auto Recover Database Failure
|
Failover Affected Services
|
Auto Recover Critical Service Down
|
A critical service goes down on the backup node. The peer node
is in failover mode and can take over for all users in the presence redundancy
group. Auto-recovery operation automatically occurs and all users are moved
over to the peer node.
|
Unknown
|
Unknown
|
Node state is unknown.
A
possible cause is that high availability was not enabled properly on the
IM and Presence
Service node.
Recommended Action:
Restart
the Server Recovery Manager service on both nodes in the presence redundancy
group.
|