Cisco Unified Serviceability alarms and CiscoLog messages
This chapter describes the Cisco Unified Serviceability alarms and error messages and CiscoLog message format. Network alarms tracked by Cisco Unified Serviceability for Cisco Unified Communications Manager generate the error messages.
Note
A History table lists Cisco Unified Serviceability error messages that have been added, changed, or removed beginning in Cisco Unified Communications Manager Release 7.0(1).
Cisco Unified Serviceability alarms and CiscoLog messages
Cisco Unified Serviceability alarms provide information on runtime status and the state of the system, so you can troubleshoot problems that are associated with your system. The alarm or error message information includes the application name, machine name, and recommended action and other critical information to help you troubleshoot.
You configure the alarm interface to send alarm information to multiple locations, and each location can have its own alarm event level (from debug to emergency). You can direct alarms to the Syslog Viewer (local syslog), SNMP traps, Syslog file (remote syslog), SDI trace log file, SDL trace log file (for Cisco Unified CM and CTIManager services only), or to all destinations.
You use the Trace and Log Central option in the Cisco Unified Real-Time Monitoring Tool (RTMT) to collect alarms that get sent to an SDI or SDL trace log file. To view the alarm information sent to the local syslog, use the SysLog Viewer in RTMT.
Note
All the alarms are logged based on their severity and settings of alarm event level. For information on viewing the alarm configuration settings, refer to the Cisco Unified Serviceability Administration Guide.
CiscoLog, a specification for unified logging in Cisco software applications, gets used in the Cisco Unified RTMT. It defines the message format when messages are logged into file or by using the syslog protocol. The output that is provided by Cisco software applications gets used for auditing, fault-management, and troubleshooting of the services that are provided by these applications.
Be aware that CiscoLog message format is compatible with one of the message formats that is produced by Cisco IOS Release 12.3 by using the syslog protocol when Cisco IOS Software is configured with the following commands:
service sequence-numbers—A default sequence number that is produced by Cisco IOS. An additional sequence number can also be enabled with this command. This command forces sequence numbers to be shown in terminal output, but results in two sequence numbers in the syslog output. CiscoLog standardizes on a format with just one sequence number. Thus, the compliant Cisco IOS Software configuration occurs when the second number is disabled by using the no service sequence-numbers command.
logging origin-id hostname—The CiscoLog HOST field remains consistent with that produced by the Cisco IOS Release 12.3 when configured with this command. This command does not get documented in the Cisco IOS Software documentation but is available in Cisco IOS Release 12.3. CiscoLog stays compatible with the results that Cisco IOS Software produces in this field.
service timestamps log datetime localtime msec show-timezone year—The CiscoLog TIMESTAMP field remains consistent with the timestamp format produced by Cisco IOS Release 12.3 when configured with this command.
Note
CiscoLog uses the same field delimiters as Cisco IOS Software Release 12.3.
When CiscoLog messages are written directly into a log file by an application, each message is on a separate line. The line separator should be a standard line separator used on a given platform. On Windows, the line separator must be the sequence of carriage return and line feed characters (ASCII decimal values 13 and 10; often designated as "\r\n" in programming languages). On Solaris and Linux, the line separator is a single line feed character (ASCII decimal value 10 and in programming languages typically "\n"). Two line separators must never appear one after another, for example, you cannot have "\r\n\r\n" on Windows, but "\r\n" is fine because these two characters are a single line separator.
In practical terms, this means that applications should be careful when appending data to an existing log. In some cases an initial line break is required and in others not. For example, if application crashes when writing CiscoLog message, but before it wrote a line break to file, then when the application starts up, it should print an initial line break before printing the next message. An application can determine if an initial line break is necessary during startup by checking the last character sequence in the log file that will be used for appending.
CiscoLog message format is identical for messages written directly to a log file or those generated by using the syslog protocol with two minor exceptions. When CiscoLog messages are written directly into to a file they must be appended with line separators. When CiscoLog messages are sent by using the syslog protocol then the syslog RFC 3164 protocol PRI header must be prepended to each CiscoLog message.
The syslog PRI field encodes syslog message severity and syslog facility. The severity encoded in the PRI field must match the value of the CiscoLog SEVERITY field. Any syslog facility can be used regardless of the content of the message. Typically, a given application is configured to send all its messages to a single syslog facility (usually RFC 3164 facilities local 0 through local 7). Refer to RFC 3164 for details about how to encode the PRI field. Below is an example of a CiscoLog message with the syslog protocol PRI field <165> which encodes the severity level of notice (5) and facility value local4.
<165>11: host.cisco.com: Jun 13 2003 12:11:52.454 UTC: %BACC-5-CONFIG: Configured from
console by vty0 [10.0.0.0]
Messages as shown in the example above can be sent to UDP port 514 if using RFC 3164 logging mechanism.
Syslog RFC 3164 provides additional guidelines for message content formatting beyond the PRI field. However, RFC 3164 is purely information (not on IETF standards track) and actually allows messages in any format to be generated to the syslog UDP port 514 (see section 4.2 of RFC 3164). The RFC provides observation about content structure often encountered in implementations, but does not dictate or recommend its use. CiscoLog format does not follow these observations due to practical limitations of the format defined in the RFC. For example, the time stamp is specified without a year, time zone or milliseconds while the hostname can only be provided without the domain name.
CiscoLog messages must remain unaltered when relayed. The PRI field is not part of a CiscoLog message, but rather a protocol header. It can be stripped or replaced if necessary. Additional headers or footers can be added to and stripped from the CiscoLog message for transport purposes.
Standard syslog server implementations
Standard syslog server implementations can be configured to forward received log messages or to store the messages locally. Most syslog server implementations strip the PRI field from the received messages and prefix additional information to the message before storage. This additional information typically includes two extra fields: the local time stamp and the host identifier (IP or DNS name) of the server, which generated or relayed the message.
The following example of a CiscoLog message shown the output after being logged by the Solaris 8 syslog server:
Jun 13 12:12:09 host.cisco.com 11: host.cisco.com: Jun 13 2003 12:11:52.454 UTC: %BACC-5-CONFIG: Configured from console by vty0 [10.0.0.0]
There is no standard that defines how syslog servers must store messages. Implementations vary greatly. CiscoLog only addresses the format in which messages are sent to the syslog server, not how they are stored by the server that receives them. Specifically, the format and presence of any additional header fields in syslog log files is outside of the scope of this specification.
Note
The CiscoLog specification recommends that the syslog server implementation store CiscoLog messages in exactly the same format as it receives them only stripping the PRI field and without any extra headers. This would provide an identical storage format for CiscoLog messages written directly to the log file by an application or logged through syslog protocol.
Clock synchronization
It is important that the clocks of all hosts of a distributed application be synchronized with one authoritative clock. This can be accomplished by using protocols such as NTP. Clock synchronization is recommended because the time stamps in log messages are required in order to be able to reconstruct the correct sequence of events based on messages originating from multiple processes or multiple hosts. Clock drifts can still occur, but ongoing synchronization should reduce this issue to a minimum.
Multipart messages
ASCII control characters are not permitted in any of the fields of CiscoLog message format. Control characters include characters such as line feed, form feed and carriage returns. This means that multi-line messages are not allowed unless to allow:
Better presentation (for example, a stack trace)
Fragmenting messages which exceed 800 octet limit
Multi-part CiscoLog message consists of a set of multiple valid CiscoLog messages. Messages are grouped together using a special tag key "part", which identifies the part number and the sequence number of the original message.
All messages which are part of a multi-part message must have a "part" tag as well as identical values for the HOST, TIMESTAMP, APPNAME, SEVERITY fields and other TAG values. However, the sequence number of each message has to be incremented as usual.
In this example, the first message has part number 1 and its sequence number, 16, embedded in the part tag. Subsequent messages embed the sequence number of the first message part and provide their own part number. The trailing "/3" in each part tag value means that the message consists of three parts.
All fields gets separated by a single colon character (ASCII decimal value 58) and a single space character (ASCII decimal value 32). The HEADER field is also preceded by a percent character (ASCII decimal value 37).
The TIMESTAMP, HEADER and TAGS fields have internal formatting. Below is a complete format with details for TIMESTAMP and HEADER fields:
MESSAGE – "Bad request received from device [1,6,aa:bb:cc:11:22:33]. Header missing."
Message length limit
The maximum length of a complete CiscoLog message must not exceed 800 octets.The term octet is used for 8-bit data type instead of byte because byte is not 8 bits on some platforms. The words "character" and "octet" are not synonyms in parts of this specification because in places were internationalization is supported a single character may need to be represented with multiple octets. This limit is dictated by RFC 3164. The limit of 1024 octets reserves some extra space for syslog forwarding headers and/or fields that may be formalized in later specifications.
When CiscoLog message includes the syslog PRI field, then the combined CiscoLog messages and PRI field length must not exceed 805 octets.
SEQNUM field
The SEQNUM field contains a sequence number, which can be used to order messages in the time sequence order when multiple messages are produced with the same time stamp by the same process. The sequence number begins at 0 for the first message fired by a process since the last startup and is incremented by 1 for every subsequent logging message originated by the same process. Every time the application process is restarted, its sequence number is reset back to 0. The sequence number of each message must be in the exact order in which messages are fired/logged by the application.
This may mean that in a multi-threaded application there must be some kind of synchronization to ensure this and another consideration may have to be made for Java applications that have some native (C) code in JNI. If log messages originate in both native and Java parts of the same process, the implementation needs to be synchronized to use the same sequence number counter across the two process parts and to fire messages in the order of sequence numbers.
The maximum numeric value of the SEQNUM field is 4,294,967,295 at which point the counter must be reset back to 0. The maximum positive value of a 32-bit unsigned integer as used in Cisco IOS. Cisco IOS uses ulong for the sequence number counter and ulong is a 32-bit unsigned integer on all current Cisco IOS platforms including mips, ppc, and 68k.
Sequence numbers are process specific. If application architecture has multiple application processes on a single host, which share a single logging daemon, the sequence number still has to be process-specific. Thus, each process has it is own sequence number which it increments.
Sequence numbers also help detect lost messages. Therefore, sequence numbers cannot be skipped. In other words, a message must be produced for every number in the sequence order.
HOST field
The HOST field identifies the system originating the message with a Fully Qualified DNS Name (FQDN), hostname or an IPv4/IPv6 address. If the FQDN or hostname is known, one of the two has to appear in the HOST field. It is expected that in most deployments the hostname is sufficient. However, if a deployment spans multiple domains, then using FQDNs is recommend. If an application is expected to be deployed in both scenarios, then it is recommended that the application default to the FQDNs, but make it a configurable option.
If neither FQDN nor hostname can be identified, then the IP address of the host must be used. If the IP address cannot be identified, then a constant "0.0.0.0" (without quotes) must appear in place of the HOST field.
Note
With regards to the compliance with Cisco IOS format. Cisco IOS Release 12.3 supports producing hostname, IP address, or any user-defined string in the HOST field. If it is configured to provide a hostname and it is not set on the device, it will use a string such as "Router."
The length of the HOST field must not exceed 255 octets.
If multiple FQDNs or hostnames are known for a given system, applications must use the primary FQDN/hostname or an arbitrary one if no primary is designated. However, applications must use the same HOST field value until some relevant configuration change takes place. In other words, the FQDN/hostname value should not arbitrarily change from message to message if system is configured with multiple FQDNs/hostnames.
Only printable US ASCII characters (those with decimal values 32-126) and foreign language characters are allowed in the HOST field when encoding an FQDN or hostname. The appropriate character set and encoding for HOST should be compliant with RFC 1123 / STD-3.
The acceptable character set per these standards includes US ASCII letters, numbers, dash and dot separator characters (although not starting or ending with a dash). The reason that these are only recommendations of adhering to these standards is that, in practice, many hosts do not follow the convention and use characters such as underscore in the hostname. However, the HOST field cannot contain a character sequence of ": " (colon and space) as this sequence is used as a field delimiter in the CiscoLog format.
Foreign language characters outside of the printable US ASCII characters have to be encoded according to internationalization rules.
Use of non-printable (control) ASCII characters is not allowed in the HOST field. Control characters include characters with ASCII decimal values 0-31 and 127. If an application provides a CiscoLog-compliant library with a host string, which includes one or more control characters, the logging library must do the following. If the horizontal tab character (ASCII decimal value 9) is encountered, it must be replaced with one or more space characters (ASCII decimal value 32). Eight spaces per tab are recommended because this is a convention on most Unix and Windows platforms. Other control characters must each be replaced with a question mark character (ASCII decimal value 63).
While DNS is letter-case agnostic, CiscoLog places an additional recommendation of using only lower-case characters in the HOST field for ease of readability. The use of the trailing dot at the end of the FQDN is optional. The following examples are valid HOST fields:
host123
host-123
host123.cisco.com
host123.cisco.com.
IP addresses
The IP address value used in the HOST field can be either an IPv4 or IPv6 address. If a device has multiple IP addresses, the primary IP address of the device must be used regardless of the interface through which the CiscoLog message is sent to syslog server. If no primary IP address is designated, a fixed/static IP address is preferred to a dynamically assigned one. If multiple static IP addresses exist, any one can be used, but it must be used consistently in all messages until a relevant configuration event occurs on the system.
IPv4 Address—IPv4 address should be represented in dot notation "x.x.x.x", where x is a decimal value from 0 to 255 encoded as ASCII text. If an IP address is unknown, "0.0.0.0" (without quotes) must be used as a place holder. Examples of valid IPv4 addresses are 0.0.0.0 and 212.1.122.11.
Below is an example of a message with an IPv4 address in the HOST field:
11: 212.1.122.11: Jun 13 2003 23:11:52.454 UTC:
%BACC-3-BAD_REQUEST: Bad request received from device [1.2.3.4]. Missing header.
Below is an example of a CiscoLog message when FQDN, hostname or IP are all unknown:
11: 0.0.0.0: Jun 13 2003 23:11:52.454 UTC:
%BACC-3-BAD_REQUEST: Bad request received from device [1.2.3.4]. Missing header.
IPv6 Address—IPv6 address representation must follow conventions outlined in RFC 3513, sections 2.2.1, 2.2.2 and 2.2.3. Specifically, all three conventions are supported. Both lower-case and upper-case letters can be used in the IPv6 address, but the lower-case letters are recommended. If an IP address is unknown, "0.0.0.0" (without quotes) should be used as the IP address. Examples of valid IPv6 addresses:
1080:0:0:800:ba98:3210:11aa:12dd (full notation)
1080::800:ba98:3210:11aa:12dd (use of "::" convention)
0:0:0:0:0:0:13.1.68.3 (last 4 octets expanded as in IPv4)
0.0.0.0 (unknown FQDN, hostname and IP address )
Below is an example of a message with an IPv6 address in the HOST field:
11: 1080:0:0:800:ba98:3210:11aa:12dd: Jun 13 2003 23:11:52.454 UTC:
%BACC-3-BAD_REQUEST: Bad request received from device [1.2.3.4]. Missing header.
TIMESTAMP field
The TIMESTAMP field provides date with year, time with milliseconds and a time zone identifier in the following format:
Jun 13 2003 23:11:52.454 UTCJun 3 2003 23:11:52.454 UTC
Jun 22 2003 05:11:52.525 -0300
*Feb 14 2003 01:02:03.005 EST
In some cases, it is possible that a device may not have the knowledge of the date and/or time due to hardware or software limitations. In such circumstances, the following string must be produced TIMESTAMP field: "--- 00 0000 00:00:00.000 ---". Below is an example of a CiscoLog message from a device which has no knowledge of date and/or time:
11: host.domain.com: --- 00 0000 00:00:00.000 ---: %BACC-3-BAD_REQUEST: Bad request received from device [1.2.3.4]. Missing header.
Devices which are not aware of their clock, may choose to provide an uptime as a relative measure of time. If device is capable of providing uptime, it is recommended that does so as a substitute for unavailable time stamp. If uptime is provided it must be provided with a standard uptime tag as outlined in the CiscoLog Standard Tags specification.
The following table details each field specification.
Table 1 TIMESTAMP field specifications
Field
Specification
ACCURACY
This is an optional field. If present, it must be either a single asterisk character (ASCII decimal value 42), or a single dot character (ASCII decimal value 46). No separator character is used after this field. This field indicates the status of clock synchronization.
Cisco IOS uses a special convention for time prefixes to indicate the accuracy of the time stamp. If dot character appears before the date, it means that the local time was synchronized at some point via NTP, but currently no NTP servers are available. The asterisk character in front of the date means that the local time is not authoritative, i.e. NTP servers are not setup.
CiscoLog supports the use of this convention, but does not require it. If an application is integrated with NTP client software, and knows that its time is out of sync, then it can optionally prefix the message with asterisk character. However, because applications may choose not to use this scheme, the lack of "." or "*" in CiscoLog messages should not be interpreted to mean that the local time is synchronized.
MONTH
Must be one of the following three-character month designations followed by a single space (ASCII decimal value 32) as a delimiter character: Jan, Feb, Mar, Apr, May, Jun, Jul, Sep, Oct, Nov or Dec.
DAY
Must consist of two characters. If day is a single digit, it must be prefixed with a single space character. The acceptable range of values is from 1 to 31. The day value must be followed by a single space as a delimiter character.
YEAR
Must consist of exactly 4 digit characters followed by a space as a delimiter character.
HOUR
Must consist of exactly two number characters. The hour value is based on a 24-hour clock. Values range from 00 to 23. If hour value is a single digit, it must be prefixed with a single zero character. The hour value must be followed by a single colon as a delimiter character.
MINUTES
Must consist of exactly two number characters. Values range from 00 to 59. If minute value is a single digit, it must be prefixed with a single zero character. The minutes value must be followed by a single colon as a delimiter character
SECONDS
Must consist of exactly two number characters. Values range from 00 to 59. If seconds value is a single digit, it must be prefixed with a single zero character. The seconds value must be followed by a period as a delimiter character.
MILLISECONDS
Must consist of exactly 3 digit characters. Values range from 000 to 999. If milliseconds value is less then 3 digits in length it must be prefixed with extra zeros to make it a 3-character field. The milliseconds value is followed by a space as a delimiter character.
TIMEZONE
Must consist of at least one, but no more than 7 characters in the following ASCII decimal value range: 32-126. The value must not include a combination of colon-space-percent of characters – ": %" (ASCII decimal values 58, 32, 37) – as this character combination is reserved as a field delimiter that follows the time stamp.
There is no standard set of acronyms for time zones1. A list of common time zone acronyms and corresponding time offsets from UTC is provided in the UTC specification.
Uppercase letters are recommended for time zone acronym values. CiscoLog recommends the use of time offset instead of time zone identifier in this field. The offset, if provided, must follow the following format "-hhmm" or "+hhmm" to indicate hour and minute offset from UTC.
In this format time zone field must always contain 5 characters, with the last 4 characters being constrained to numbers only. Unlike a textual time zone identifier, this format provides a specific time offset from universal standard time.
Cisco IOS Release 12.3 supports any 7-character string as a time zone identifier, so it can be configured in a way which is compatible with this recommendation. Multiple messages may and sometimes must be produced with exactly the same time stamp. This can happen naturally on a non-preemptive operating system or may need to be deliberately induced as in the case of multi-part messages. Sequence numbers then become helpful for establishing message order. Time stamp should always be accurate to the millisecond unless it can significantly hinder performance of the application.
In either case, applications must always provide the administrator with an option to output messages with exact time stamp in milliseconds. If an application uses time stamp with accuracy to the second (instead of a millisecond), it must put the last known milliseconds value or 000 in place of the milliseconds. Whatever convention is chosen by the application, it should be followed consistently.
The APPNAME field in the HEADER defines the name of the application producing the message. Cisco IOS uses FACILITY in place of APPNAME that names the logical component producing the message. Cisco IOS 12.3 defines approximately 287 facilities for 3950 messages. Example of some easily recognizable facilities: AAAA, SYS, ATM, BGP, CRYPTO, ETHERNET, FTPSERVER, CONFIG_I, IP, ISDN, RADIUS, SNMP, SYS, TCP, UBR7200, X25. A complete list of defined facilities is available in Cisco IOS documentation at http://.
Outside of the Cisco IOS, there can be multiple applications on the same host originating log messages. Therefore, it is necessary that APPNAME field identify the specific application. Additional source identifiers are available in the HOST field as well as various standard TAGS field values (pname, pid, comp, etc).
The APPNAME field must consist of at least two uppercase letters or digits and may include underscore characters. More precisely, the acceptable character set is limited to characters with the following ASCII decimal values: 48-57 (numbers), 65-90 (upper-case letters) and 95 (underscore).
The length of the APPNAME field must not exceed 24 characters.
Application names cannot conflict with other Cisco software applications and with Cisco IOS facilities.
On the Solaris platform, it is recommended (not required) that the application name values used in the APPNAME field be consistent with those used for the application installation package name, only in upper case and without the CSCO prefix. For example, an application registering as "CSCObacc" on Solaris should use "BACC" as the value of the APPNAME field.
Some applications may choose to specify a version as part of the APPNAME field. This is acceptable and may be useful in cases where the meaning of certain messages is redefined from one release to another. For example, an APPNAME value could be "BACC_2_5" for BACC version 2.5. The use the version within an application name is optional and may be introduced by applications in any release.
SEVERITY field
The SEVERITY field is a numeric value from 0 to 7, providing eight different severities. The severities defined below match Cisco IOS severity levels. They are also standard syslog severities.
It is important that messages use the correct severity. An error in a certain component may be severe as far as the component is concerned, but if the overall application handles it gracefully, then the severity may be lower for the application as a whole. The following table lists guidelines that should be followed in determining the severity of a message.
Table 2 Name and Severity Level and Descriptions in Error Messages
Name/Severity Level
Description
Emergency (0)
System or service is unusable. Examples:
Service repeatedly fails to startup
System ran out of disk space while disk space is essential for this system to operate
Application requires root privileges to run but does not have them
Alert (1)
Action must be taken immediately. Examples:
Application is about to run out of licenses
Application is about to run out of disk space
Too many unauthorized access attempts detected
Denial of service attack is detected
Critical (2)
Critical condition. Similar to alert, but not necessarily requiring an immediate action. Examples:
Received an invalid authentication request
Service crashed due to an error that could not be handled, like an out of memory condition, (provided it has a watchdog process to restart it, it does not necessarily require immediate action)
Unexpected code error that could not be handled
Error (3)
An error condition, which does not necessarily impact the ability of the service to continue to function. Examples:
Problem parsing/processing a particular request which does not prevent the application from handling other requests
Unexpected, but handled code exception
Warning (4)
A warning about some bad condition, which is not necessarily an error. Examples:
Lost network connection to some resource
Timed out waiting for a response
Notice (5)
Notifications about system-level conditions, which are not error conditions. Examples:
Configuration was updated (not audit level information)
Process has started
Process is shutting down gracefully on request
Informational (6)
Informational messages are distinguished from notification in that they provide information for internal flows of the application or per-request information instead of system-wide notifications. Informational messages are used for troubleshooting by users who are familiar with the basic flows of the application. Examples:
Request received
Request was parsed successfully
Request being processed
Response sent back
Acknowledgement received
Detailed audit information
Debug (7)
Debugging messages are similar to informational messages, but provide more detail and require the user to have better knowledge of system internal processing. These messages are typically reserved for very advanced users or Cisco technical support. Examples:
Complete details for a request packet
Internal state machine state changes
Internal profiling statistics
Internal events
If an application uses a default severity level to determine which messages should be logged, then it is recommended that this level be set at 5 (notice). This ensures that all messages of severity 5 or higher are logged by default.
MSGNAME field
The MSGNAME field of the HEADER uniquely identifies the message within the context of a given APPNAME. A fixed severity and logical meaning is associated with a specific MSGNAME within a specific APPNAME. In other words, the same message name cannot appear with different severity or a completely different logical meaning for the same APPNAME value even if the message is originated by a different process.
Message names are only unique within a given application (a given APPNAME value) unless the message is one of the standard messages. Thus, applications interpreting CiscoLog messages should be careful not to assume that a message with a given name has the same meaning for all applications that may use this message name. Indeed, if the message is not one of the standard messages, it may have a different severity and meaning in a different application.
The MSGNAME field must consist of at least two characters. Acceptable characters are limited to the following ASCII decimal values: 48-57 (numbers), 65-90 (upper-case letters) and 95 (underscore). While IOS allows lower-case letters as well, the vast majority of IOS messages use only the upper-case letters. In order to be consistent with established conventions we opted to restrict the character set to upper-case letters, numbers and underscore characters.
Both numeric-only or alphanumeric message names are acceptable. However, per IOS convention, it is recommended that a user-friendly alphanumeric label be preferred to a numeric-only label. For example, "NO_MEMORY" message name is preferred to a "341234" identifier.
A special tag mid is defined in the CiscoLog Standard Tags specification for identifying a numeric id corresponding to a message name. This tag can be used to provide a numeric message is in addition to the MSGNAME. When this tag is used, a given MSGNAME must always correspond to a single message id value. CiscoLog defines mid tag values for each standard message.
The length of the MSGNAME field must not exceed 30 characters, but most message names should be more concise. MSGNAME value may not conflict with the names defined in this standard.
A separate message name must be defined for each logically different message. In other words, while the message text for a given message name can vary by virtue of some substitutable parameters, logically different messages must have different message names.
The following is an example of correct use of message name:
11: host.cisco.com: Jun 13 2003 23:11:52.454 UTC: %BACC-4-CONNECTION_LOST: %[pname.orig=rdu]: Server lost connection to host [1.1.1.1]12: host.cisco.com: Jun 13 2003 23:11:52.458 UTC: %BACC-4-CONNECTION_LOST: %[pname.orig=rdu]: Server lost connection to host [2.2.2.2
Notice that while the IP address of the host changes, it is still logically the same type of message. The following is an example of an INCORRECT use of the message name:
15: host.cisco.com: Jun 13 2003 23:11:52.458 UTC: %BACC-4-CONNECTION: %[pname.orig=rdu]: Server lost connection to host [2.2.2.2]16: host.cisco.com: Jun 13 2003 23:11:52.468 UTC: %BACC-4-CONNECTION: %[pname.orig=rdu]: Server re-established connection to host [2.2.2.2]
The use of a single message name for two different events in the above example is wrong and unacceptable. This is referred to as a "catch-all" message name and they must be avoided. Another extreme example is defining a message named "ERROR" and providing all error log messages under the same message name. This defeats the purpose of having the message name field, which is to enable external filtering of messages or easily trigger actions.
The only exception to the "no-catch-all" rule is when message cannot be identified ahead of time with anything better than a generic description or the users will not benefit from distinguishing the various subtypes of the message.
Although some applications may choose to do so, there is generally no need to define a separate message name for all debugging messages because debugging messages are not intended for automated filtering and action triggering based on message name. The sheer number of debugging messages and the highly dynamic nature of what is produced in them makes it very hard to define separate messages.
This specification proposes establishing a mailing list that could be used by groups for consulting purposes when in doubt about how to define certain messages. Currently, the mailing list alias used for this purpose is "cmn-logging".
TAGS field
The TAGS field is optional in the message format. It provides a standard mechanism for applications to provide structured content in the form of key-value pairs which can be used to categorize or filter a set of messages externally.
Tags can be used to identify virtual logging channels. A set of messages flagged with the same tag can later be grouped together. For example, an application may flag messages belonging to a particular thread by supplying the corresponding tag. This would then allow filtering and viewing messages based on threads.
Virtual logging channels can also be established across multiple applications. For example, if all applications could tag requests from a device with device id (mac, ip, etc), then it would be easy to filter all messages related to that device even thought it communicates with multiple components.
Each application may define its own set of supported tags. A single tag consists of key and value pair separated by the equals sign and surrounded by square bracket characters as in the following format: [KEY=VALUE]. This is an example of a valid tag key-value pair [ip=123.23.22.22].
The TAGS field is prefixed with a percent character (ASCII decimal value 37) and ends with a sequence of colon and space characters (ASCII decimal values 58 and 32). When multiple tags are assembled together, no characters should appear between the tags as separators. The following example has a complete CiscoLog message with four tags:
12: host.cisco.com: Jun 13 2003 23:11:52.454 UTC: %BACC-4-BAD_REQUEST: %[pname.orig=rdu][comp=parser][mac=1,6,aa:bb:cc:11:22:33][txn=mytxn123]: Bad request received from device [1,6,aa:bb:cc:11:22:33]. Missing header.
If TAGS field is missing, the percent character prefix and the trailing colon and space must be omitted. Thus, when the TAGS field is missing, the HEADER and MESSAGE fields must be separated by just a single colon and a space which follows the HEADER field. For example:
12: host.cisco.com: Jun 13 2003 23:11:52.454 UTC: %BACC-4-BAD_REQUEST: Bad request received from device [1,6,aa:bb:cc:11:22:33]. Missing header.
Multiple tags with the same tag key can be provided in the same message. This essentially provides the capability for handling multi-valued keys. Below is an example of a message produced from a device which has two IP addresses where the application chose to provide both IP addresses in the TAGS field as well as the process name:
12: host.cisco.com: Jun 13 2003 23:11:52.454 UTC: %BACC-4-BAD_REQUEST: %[pname.orig=rdu][ip.orig=1.1.1.1][ip.orig=1.1.1.2]: Bad request received from device [1,6,aa:bb:cc:11:22:33]. Missing header.
Any number of tags can be provided in a given message. The only limit is the overall length limit of the CiscoLog message of 800 octets.
If multiple tags are present, it is recommended that they appear in the alphanumeric order of the keys. This insures that tags are always produced in the same order. However, a different order may be chosen by an application if the order of tags is used to communicate some semantic value.
Tag key must contain at least one character. The characters are limited to ASCII characters with decimal values 48-57 (numbers), 65-90 (upper-case letters), 95 (underscore), 97-122 (lower case letters). Use of lower-case letters is recommended. There is no strict limit on tag key length, although a general message limit of 800 octets applies and dictates that one should attempt to define short tag key names.
Tag semantic extensions
In some cases, a tag can have a standard value syntax, but different meaning depending on the content in which it is used. Tag semantic extensions are used to differentiate the contextual meaning of tags.
The semantic extension tags are created by appending the tag key with a single dot character (ASCII decimal value 46) and a text string consisting of characters from a proper character set.
For example, an "ip" tag defines syntax for an IP address representation, but no semantic value. An "ip" tag found in a CiscoLog message generally means only that this IP address is somehow related to the message. In some cases, such vague association is sufficient. However, sometimes, communicating semantic value could be useful.
A message may have two IP address tags associated with it, for example, from and to IP addresses. In this case, using tags "ip.from" and "ip.to" would communicate both the syntax of the tags and some semantic value. Another example, is a standard tag "ip.orig", which specifies the IP address of the host which originated the message. The following is an example of all three tags appearing together:
Multiple levels of semantic extension tags are allowed with each extension providing meaning that is more specific. For example, tag key "ip.to.primary" is valid and could mean the primary IP address of the destination host.
The semantic value is much harder to standardize than the syntax because there can an infinite number of meanings for a given value depending on the context. Thus, it is anticipated that defining tag semantics extensions will be largely application specific.
Tag values
Tag values may contain zero or more characters. The empty (zero characters) value is interpreted as unknown or undetermined value. The value must only include printable US ASCII characters (those in the ASCII decimal value range 32-126) and foreign language characters
There is a restriction on the use of three characters: "["," ]" and "\". The bracket characters (ASCII decimal values 91 & 93) must be escaped with a back slash character (ASCII decimal value 92) . This helps to avoid confusion with the brackets that signify the start/end of the tag. Thus, when the tag value needs to represent characters "[" or "]", a sequence of "\[" or "\]" is used instead respectively. When the escape character itself needs to be represented in the tag value, then instead of the "\" character a sequence of "\\" is used.
Use of non-printable (control) ASCII characters is not allowed in the TAG value field. Control characters include characters with ASCII decimal values 0-31 and 127. If application provides to a CiscoLog-compliant library a tag value string, which includes one or more control characters, the logging library must do the following. If the horizontal tab character (ASCII decimal value 9) is encountered, it must be replaced with one or more space characters (ASCII decimal value 32). Eight spaces per tab are recommended because this is a convention on most Unix and Windows platforms. Other control characters must each be replaced with a question mark character (ASCII decimal value 63). Technically, we only need to require escaping a closing bracket. However, requiring escaping both open and closing brackets simplifies parser code and provides for a more consistent display in raw form.
There is no strict limit on tag value length; although a general message length limit of 800 octets applies and dictates that one must be conservative.
Tag guidelines
The TAGS field is optional in the CiscoLog message format. Tags do not replace substitutable parameters in the message body. Tags merely provide an additional way to identify and categorize messages.
Since tags are optional, they can be enabled or disabled by the application/user as required. There is no requirement for the same message to always be produced with the same set of tags. If the application supports a given tag, it does not necessarily mean that it must always produce it. This can be configurable. Indeed, it is recommended that applications provide the administrator with at least limited control over which tags get produces.
Application developers have a choice as to what information to make available in the tags and what in the message body. In some cases, the information may be duplicated between the two. This is acceptable.
The general guideline is to put all required information in the message body and make appropriate information available via tags. In other words, the message should provide sufficient meaning even when all tags are disabled. Tags merely provide additional useful information and a way to present it in a standard, easily filtered, form.
The following are two valid examples of a message where both the message and the message tags contain a MAC address. Example with tags disabled:
11: host.cisco.com: Jun 13 2003 23:11:52.454 UTC: %BACC-3-BAD_REQUEST: Bad request received from device [1,6,aa:bb:11:22:33:aa]. Missing header.
In the above example, the MAC address appears as part of the message field – it is not a tag. In the following example, the tags are enabled. Even though MAC address is duplicated between the tag and the message, it is acceptable.
11: host.cisco.com: Jun 13 2003 23:11:52.454 UTC: %BACC-3-BAD_REQUEST: %[mac=1,6,aa:bb:11:22:33:aa][tid=thread1][txn=mytxn123]: Bad request received from device [1,6,aa:bb:11:22:33:aa]. Missing header.
Process identification tag
One of the standard tags, pname.orig, is used to identify the logical process name which originates the message. Any application that seeks to provide originating process information must do so using the "pname.orig" tag.
This tag is extremely valuable in addition to information in the APPNAME field because some applications consist of multiple processes, each of which may originate logging messages. It is recommended that any application which consists of multiple processes always provide the "pname.orig" tag.
MESSAGE field
The MESSAGE field provides a descriptive message about the logging event. This field may consist of one or more characters. The character set is limited to printable US ASCII characters (ASCII decimal values 32-126) and foreign language characters.
Use of non-printable (control) ASCII characters is not permitted in the MESSAGE field. Control characters include characters with ASCII decimal values 0-31 and 127. If application provides a CiscoLog-compliant library with message string, which includes one or more control characters, the logging library must do the following. If the horizontal tab character (ASCII decimal value 9) is encountered, it must be replaced with one or more space characters (ASCII decimal value 32). Eight spaces per tab are recommended because this is a convention on most Unix and Windows platforms. Other control characters must each be replaced with a question mark character (ASCII decimal value 63).
The maximum length of the MESSAGE field is constrained only by the maximum length of the entire message. The maximum length of the CiscoLog message must not exceed 800 octets. Another practical limitation is a potentially highly variable length of the TAGS field.
Message text may contain substitutable parameters, which provide necessary details about the message. For example, the IP address in the following example is a substitutable parameter.
11: host.cisco.com: Jun 13 2003 23:11:52.454 UTC: %BACC-3-INVALID_REQUEST: Invalid request received from device [1.22.111.222]. Missing header.
It is recommended (but not required) that substitutable parameters be surrounded by bracket characters "[" and "]" as in the above example. It is further recommended that the message text and values of substitutable parameters do not include bracket characters. When it is not possible to avoid brackets characters in the values of substitutable parameters, it is recommended that the value at least does not include unbalances brackets (like an opening bracket without a closing one). When these recommendations are followed, it would be possible to programmatically extract substitutable parameter values out of a CiscoLog message. However, this recommendation is not a strict requirement.
Message text should be spell-checked. Editorial review is recommended. This includes all messages that can be seen by the customers, even debugging messages.
If the first word of the message is an English word, the first letter should be capitalized. Single sentence messages do not require a period at the end.
Internationalization
Foreign language characters are defined as characters with ASCII decimal values 0-126. Foreign language characters are supported in the HOST field, the value part of the TAGS field and the MESSAGE field.
Foreign language characters must be encoded using the Unicode standard UTF-8. UTF-8 provides encoding for any language without requiring the application to know local encoding/decoding rules for a particular language. In fact, the application encoding the message does not even need to know the language of the message. UTF-8 can encode any Unicode character.
UTF-8 encodes US ASCII characters exactly as they would normally be encoded in a 7-bit ASCII convention. This means that applications interpreting CiscoLog messages can assume that entire messages are encoded in UTF-8. On the other hand, applications producing CiscoLog messages can encode the entire message using US-ASCII 7-bit convention if they are known not to support foreign languages in their products.
Since UTF-8 can encode characters in any language, it is possible to mix and match languages. For example, it is anticipated that a one use-case would be the inclusion of just some parameters in foreign language in an otherwise English message. For example, an English message about user authentication could have a username in Japanese. Similarly, any number of languages can be combined in a CiscoLog message.
In order to take advantage of messages, which include a foreign language, a log viewer capable of interpreting UTF-8 would be necessary. Most likely, the log viewer would also require that the appropriate language fonts be installed on a given system. In a US-ASCII only editor, the user will see garbage for non-US-ASCII characters encoded in UTF-8, but should be able to see all US-ASCII text.
Internationalization support can be readily used with CiscoLog messages written to a local file. Syslog RFC 3164, however, does not currently define foreign language support. Thus, in order to take advantage of internationalization with a syslog server, one would need to use a server implementation, which was tested to correctly relay or store all 8-bits of each octet unchanged. This would ensure that UTF-8 encoded parts of the message retain all their information when foreign languages are used.
In UTF-8, a single character is encoded with one or more octets. The CiscoLog message length limit is specified as 800 octets. Developers must be aware that with foreign languages, the 800-octet length limit may mean fewer than 800 characters. When a message is split into a multi-part message, octets belonging to a single character must never be split into separate lines.
CiscoLog does not provide any versioning information in the message format. Extensions to the format must be made within the restrictions of the format. CiscoLog message formats provides for extensions by way of defining additional tags.
If applications require changes to existing messages, the value of APPNAME can redefine message within the new space. For example, the application version can be appended to the application name as BACC_2_5 for BACC 2.5.
Preconfigured system alarm notifications
There are preconfigured system alerts in RTMT. Refer to the Real-Time Monitoring Tool Administration Guide for information on configuration.
Authentication validates the user ID and password that are submitted during log in. An alarm gets raised when an invalid user ID and/or the password gets used.
Table 3 Default configuration for the AuthenticationFailed RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Number of AuthenticationFailed events exceeds:
1 time in the last 1 minute
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable e-mail
Selected
Trigger Alert Action
Default
CiscoDRFFailure
This alert occurs when the DRF backup or restore process encounters errors.
Table 4 Default configuration for the CiscoDRFFailure RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
CiscoDRFFailure event generated
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
CoreDumpFileFound
This alert occurs when the CoreDumpFileFound event gets generated. This indicates that a core dump file exists in the system.
Table 5 Default configuration for the CoreDumpFileFound RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
CoreDumpFileFound event generated
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Trace download Parameters
Not Selected
Enable E-mail
Selected
Trigger Alert Action
Default
CpuPegging
CPU usage gets monitored based on configured thresholds. If the usage goes above the configured threshold, this alert gets generated.
Table 6 Default configuration for the CpuPegging RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
99%
Duration
Trigger alert only when value remains constantly below or over threshold for 60 seconds
Frequency
Trigger up to 3 alerts within 30 minutes
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
CriticalServiceDown
The CriticalServiceDown alert gets generated when the service status equals down (not for other states).
Table 7 Default configuration for the CriticalServiceDown RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
Service status is DOWN
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Trace download Parameters
Enable Trace Download not selected
Enable E-mail
Selected
Trigger Alert Action
Default
HardwareFailure
This alert occurs when a hardware failure event (disk drive failure, power supply failure, and others) triggers.
Table 8 Default configuration for the HardwareFailure RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
HardwareFailure event generated
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
LogFileSearchStringFound
This alert occurs when the LogFileSearchStringFound event gets generated. This indicates that the search string was found in the log file.
Table 9 Default configuration for the LogFileSearchStringFound RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Warning
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
LogFileSearchStringFound event generated
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
LogPartitionHighWaterMarkExceeded
This alert occurs when the percentage of used disk space in the log partition exceeds the configured high water mark. When this alert gets generated, LPM deletes files in the log partition (down to low water mark) to avoid running out of disk space.
Note
LPM may delete files that you want to keep. You should act immediately when you receive the LogPartitionHighWaterMarkExceeded alert.
Table 10 Default configuration for the LogPartitionHighWaterMarkExceededRTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
Log Partition Used Disk Space Exceeds High Water Mark (95%)
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
LogPartitionLowWaterMarkExceeded
This alert occurs when the LogPartitionLowWaterMarkExceeded event gets generated. This indicates that the percentage of used disk space in the log partition exceeded the configured low water mark.
Note
Be aware that this alert is an early warning. The administrator should start freeing up disk space. Using RTMT/TLC, you can collect trace/log files and delete them from the server. The administrator should adjust the number of trace files that are kept to avoid hitting the low water mark again.
Table 11 Default configuration for the LogPartitionLowWaterMarkExceeded RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
Log Partition Used Disk Space Exceeds Low Water Mark (90%)
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
LowActivePartitionAvailableDiskSpace
This alert occurs when the percentage of available disk space on the active partition is lower than the configured value.
Table 12 Default configuration for the LowActivePartitionAvailableDiskSpaceRTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
Active Partition available diskspace below (4%)
Duration
Trigger alert immediately
Frequency
Trigger up to 3 alerts within 30 minutes
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
LowAvailableVirtualMemory
RTMT monitors virtual memory usage. When memory runs low, a LowAvailableVirtualMemory alert gets generated.
Table 13 Default configuration for the LowAvailableVirtualMemory RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
Available virtual memory below (30%)
Duration
Trigger alert immediately
Frequency
Trigger up to 3 alerts within 30 minutes
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
LowInactivePartitionAvailableDiskSpace
This alert occurs when the percentage of available disk space of the inactive partition equals less than the configured value.
Table 14 Default configuration for the LowInactivePartitionAvailableDiskSpace RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
Inactive Partition available disk space below (4%)
Duration
Trigger alert immediately
Frequency
Trigger up to 3 alerts within 30 minutes
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
LowSwapPartitionAvailableDiskSpace
This alert indicates that the available disk space on the swap partition is low.
Note
The swap partition makes up part of virtual memory, so low available swap partition disk space means low virtual memory as well.
Table 15 Default configuration for the LowSwapPartitionAvailableDiskSpaceRTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
Swap Partition available disk space below (105)
Duration
Trigger alert immediately
Frequency
Trigger up to 3 alerts within 30 minutes
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
ServerDown
This alert occurs when a remote node cannot be reached.
Note
Cisco Unified CM clusters only—The ServerDown alert gets generated when the currently "active" AMC (primary AMC or the backup AMC, if the primary is not available) cannot reach another server in a cluster. This alert identifies network connectivity issues in addition to a server down condition.
Table 16 Default configuration for the ServerDown RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
ServerDown occurred
Duration
Trigger alert immediately
Frequency
Trigger up to 1 alert within 60 minutes
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
SparePartitionHighWaterMarkExceeded
This alert occurs when the SparePartitionHighWaterMarkExceeded event gets generated. It indicates that the percentage of used disk space in the spare partition exceeds the configured high water mark. Some core file or log files are purged until the percentage of used disk space in the spare partition is below the configured low water mark. Check if the configured high water mark for used disk space in the spare partition is too low.
Cisco Log Partition Monitoring Tool (LPM) starts purging trace log files in the spare partition and keeps deleting trace log files in the spare partition until spare partition disk usage is just below the low water mark.
Name of the service generating this alarm is Cisco Log Partition Monitoring Tool.
Check if the configured high water mark for used disk space in the spare partition is too low; if it is, change the high water mark setting to a higher value. Also examine each application trace log files under spare partition and delete those trace log files that are too old or too big.
Note
Spare Partition is not used for Intercompany Media Engine server. So this alert will not be triggered for Intercompany Media Engine.
Table 17 Default configuration for the SparePartitionHighWaterMarkExceeded RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
Spare Partition Used Disk Space Exceeds High Water Mark (95%)
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
SparePartitionLowWaterMarkExceeded
This alert occurs when the SparePartitionLowWaterMarkExceeded event gets generated. It indicates that the percentage of used disk space in the spare partition has exceeded the configured low water mark threshold. There are files to be purged by Cisco Log Partition Monitoring Tool (LPM). If the spare partition disk usage keeps increasing until it exceeded the configured high water mark, Cisco LPM starts purging the trace log files in the spare partition. Cisco LPM sends the alarm periodically if the spare partition disk usage has not changed.
Name of the service generating this alarm is Cisco Log Partition Monitoring Tool.
Check if the configured low water mark for used disk space in the spare partition is too low; if, change the low/high water mark settings to the higher values. Also examine each application trace log files under spare partition and clean up those trace log files that are too old or too big before the used disk space exceeds the high water mark.
Note
Spare Partition is not used for Intercompany Media Engine server. So this alert will not be triggered for Intercompany Media Engine.
Table 18 Default configuration for the SparePartitionLowWaterMarkExceededRTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
Spare Partition Used Disk Space Exceeds Low Water Mark (90%)
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
SyslogSeverityMatchFound
This alert occurs when the SyslogSeverityMatchFound event gets generated. This indicates that a syslog message with the matching severity level exists.
Table 19 Default configuration for the SyslogSeverityMatchFound RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
SyslogSeverityMatchFound event generated
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Syslog Severity Parameters
Critical
Enable E-mail
Selected
Trigger Alert Action
Default
SyslogStringMatchFound
This alert occurs when the SyslogStringMatchFound event gets generated. The alert indicates that a syslog message with the matching search string exists.
Table 20 Default configuration for the SyslogStringMatchFound RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
SyslogStringMatchFound event generated
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Syslog Alert Parameters
(Text box for search string)
Enable E-mail
Selected
Trigger Alert Action
Default
SystemVersionMismatched
This alert occurs when a mismatch in system version exists.
Table 21 Default configuration for the SystemVersionMismatched RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
SystemVersionMismatched occurred
Duration
Trigger alert immediately
Frequency
Trigger up to 1 alert within 60 minutes
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
TotalProcessesAndThreadsExceededThreshold
This alert occurs when the TotalProcessesAndThreadsExceededThreshold event gets generated. The alert indicates that the current total number of processes and threads exceeds the maximum number of tasks that are configured for the Cisco RIS Data Collector Service Parameter. This situation could indicate that a process is leaking or that a process has thread leaking.
Table 22 Default configuration for the TotalProcessesAndThreadsExceededThreshold RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
The following list comprises the preconfigured CallManager alerts in RTMT. Refer to the Real-Time Monitoring Tool Administration Guide for information on configuration.
This alert occurs when the BeginThrottlingCallListBLFSubscriptions event gets generated. This indicates that the Cisco Unified Communications Manager initiated a throttling of the CallList BLF Subscriptions to prevent a system overload.
Table 23 Default configuration for the BeginThrottlingCallListBLFSubscriptions RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
This alert occurs when the percentage of CPU load on a call processing server exceeds the configured percentage for the configured time.
Note
If the administrator takes no action, high CPU pegging can lead to a crash, especially in CallManager service. CoreDumpFound and CriticalServiceDown alerts might also get issued.
The CallProcessingNodeCpuPegging alert gives you time to work proactively to avoid a Cisco Unified Communications Manager crash.
Table 24 Default configuration for the CallProcessingNodeCpuPegging RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
Processor load over (90%)
Duration
Trigger alert only when value constantly below or over threshold for 60 seconds
Frequency
Trigger up to 3 alerts within 30 minutes
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
CDRAgentSendFileFailed
This alert gets raised when the CDR Agent cannot send CDR files from a Cisco Unified Communications Manager node to a CDR repository node within the Cisco Unified Communications Manager cluster.
Table 25 Default configuration for the CDRAgentSendFileFailed RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
CDRAgentSendFileFailed event generated
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
CDRFileDeliveryFailed
This alert gets raised when(s) FTP delivery of CDR files to the outside billing server fails.
Table 26 Default configuration for the CDRFileDeliveryFailed RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
CDRFileDeliveryFailed event generated
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
CDRHighWaterMarkExceeded
This alert gets raised when the high water mark for CDR files gets exceeded. It also indicates that some successfully delivered CDR files got deleted.
Table 27 Default configuration for the CDRHighWaterMarkExceeded RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
CDRHighWaterMarkExceeded event generated
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
CDRMaximumDiskSpaceExceeded
This alarm gets raised when the CDR files disk usage exceeds the maximum disk allocation. It also indicates that some undeliverable files got deleted.
Table 28 Default Configuration for the CDRMaximumDiskSpaceExceeded RTMT Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
CDRMaximumDiskSpaceExceeded event generated
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
CodeYellow
The AverageExpectedDelay counter represents the current average expected delay to handle any incoming message. If the value exceeds the value that is specified in Code Yellow Entry Latency service parameter, the CodeYellow alarm gets generated. You can configure the CodeYellow alert to download trace files for troubleshooting purposes.
Table 29 Default Configuration for the CodeYellow RTMT Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
Cisco CallManager CodeYellowEntry event generated
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Trace Download Parameters
Enable Trace Download not selected
Enable E-mail
Selected
Trigger Alert Action
Default
DBChangeNotifyFailure
This alert occurs when the Cisco Database Notification Service experiences problems and might stop. This condition indicates change notification requests that are queued in the database got stuck and changes made to the system will not take effect. Ensure that the Cisco Database Layer Monitor is running on the node where the alert exists. If it is, restart the service. If that does not return this alert to safe range, collect the output of show tech notify and show tech dbstateinfo and contact TAC for information about how to proceed.
Table 30 Default configuration for the DBChangeNotifyFailure RTMT alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
DBChangeNotify queue delay over 2 minutes
Duration
Trigger alert immediately
Frequency
Trigger up to 1 alert within 30 minutes
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
DBReplicationFailure
This alarm indicates a failure in IDS replication and requires database administrator intervention.
Note
Be aware that DBReplicationFailure is based on the replication status perfmon counter (instead of DBReplicationFailure alarm as was previously the case). This alert gets triggered whenever the corresponding replication status perfmon counter specifies a value of 3 (Bad Replication) or 4 (Replication Setup Not Successful).
Table 31 Default Configuration for the DBReplicationFailure RTMT Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
DBReplicationFailure occurred
Duration
Trigger alert immediately
Frequency
Trigger up to 1 alert within 60 minutes
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
DDRBlockPrevention
This alert gets triggered when the IDSReplicationFailure alarm with alarm number 31 occurs, which invokes a proactive procedure to avoid denial of service. This procedure does not impact call processing; you can ignore replication alarms during this process.
The procedure takes up to 60 minutes to finish. Check that RTMT replication status equals 2 on each node to make sure that the procedure is complete. Do not perform a system reboot during this process.
Table 32 Default Configuration for the DDRBlockPrevention RTMT Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
IDSReplicationFailure alarm with alarm number 31 generated
Duration
Trigger alert immediately
Frequency
Trigger up to 1 alert within 60 minutes
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
DDRDown
This alert gets triggered when the IDSReplicationFailure alarm with alarm number 32 occurs. An auto recover procedure runs in the background, and no action is needed.
The procedure takes about 15 minutes to finish. Check that RTMT replication status equals 2 on each node to make sure the procedure is complete.
Table 33 Default Configuration for the DDRDown RTMT Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
IDSReplicationFailure alarm with alarm number 32 generated
Duration
Trigger alert immediately
Frequency
Trigger up to 1 alert within 60 minutes
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
ExcessiveVoiceQualityReports
This alert gets generated when the number of QRT problems that are reported during the configured time interval exceed the configured value. The default threshold specifies 0 within 60 minutes.
Table 34 Default Configuration for the ExcessiveVoiceQualityReports RTMT Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
Number of quality reports exceeds 0 times within the last 60 minutes
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
IMEDistributedCacheInactive
This alarm gets generated when a Cisco Unified Communications Manager attempts to connect to the Cisco IME server, but the IME distributed cache is not currently active.
Ensure that the certificate for the Cisco IME server is provisioned and that the IME distributed cache has been activated via the CLI.
Default Configuration
Table 35 Default Configuration for the IMEDistributedCacheInactive RTMT Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Error
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
Inactive IME Distributed Cache
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
IMEOverQuota
This alert indicates that the Cisco Unified Communications Manager servers that use this Cisco IME service have exceed the quota for published direct inward dialing numbers (DIDs) to the IME distributed cache. The alert includes the name of the Cisco IME server as well as the current and target quota values.
Ensure that you have correctly provisioned the DID prefixes on all of the Cisco Unified Communications Manager servers that use this Cisco IME service.
If you have provisioned the prefixes correctly, you have exceeded the capacity of your Cisco IME service, and you need to configure another service and divide the DID prefixes across the Cisco IME client instances (Cisco Unified Communications Managers) on different Cisco IME services.
Default Configuration
Table 36 Default Configuration for the IMEOverQuota Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Alert
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
VAP Over Quota
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
IMEQualityAlert
This alert gets generated when Cisco Unified Communications Manager determines that a substantial number of Cisco IME calls fail back to PSTN or fail to be set up due to IP network quality problems. Two types of events trigger this alert:
A large number of the currently active Cisco IME calls have all requested fallback or have fallen back to the PSTN.
A large number of the recent call attempts have gone to the PSTN and not been made over IP.
When you receive this alert, check your IP connectivity. If no problems exist with the IP connectivity, you may need to review the CDRs, CMRs, and logs from the firewalls to determine why calls have fallen back to the PSTN or have not been made over IP.
Default Configuration
Table 37 Default Configuration for the IMEQualityAlert Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Error
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
Cisco IME link quality problem
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
InsufficientFallbackIdentifiers
This alert gets generated when too many Cisco IME calls that are currently in progress use the same fallback DID and no more DTMF digit sequences exist to allocate to a new Cisco IME call that Cisco Unified Communications Manager is processing. The new call continues, but the call cannot fallback to the PSTN if voice-quality deteriorates.
If this alert gets generated, note the fallback profile that associates with this call. Check that profile in Cisco Unified Communications Manager Administration, and examine the current setting for the "Fallback Number of Correlation DTMF Digits" field. Increase the value of that field by one, and check whether the new value eliminates these alerts. In general, this parameter should be large enough so that the number of simultaneous Cisco IME calls that are made to enrolled numbers that associate with that profile is always substantially less than 10 raised to the power of this number. For example, if you always have fewer than 10,000 simultaneous Cisco IME calls for the patterns that associate with this fallback profile, setting this value to 5 (10 to the power of 5 equals 100,000) should keep Cisco Unified Communications Manager from generating this alert.
However, increasing this value results in a small increase in the amount of time it takes to perform the fallback. As such, you should set the "Fallback Number of Correlation DTMF Digits" field to a value just large enough to prevent this alert from getting generated.
Instead of increasing the value of the DTMF digits field, you can add another fallback profile with a different fallback DID and associate that fallback profile with a smaller number of enrolled patterns. If you use this method, you can use a smaller number of digits.
Default Configuration
Table 38 Default Configuration for the InsufficientFallbackIdentifiers Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Error
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
Cannot allocate fallback identifier
Duration
Trigger alert immediately
Frequency
Trigger up to 1 alerts within one minute
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
IMEServiceStatus
This alert indicates the overall health of the connection to the Cisco IME services for a particular Cisco IME client instance (Cisco Unified Communications Manager). The alert indicates the following states:
0—Unknown. Likely indicates that the Cisco IME service has not been activated.
1—Healthy. Indicates that the Cisco Unified Communications Manager has successfully established a connection to its primary and backup servers for the Cisco IME client instance, if configured.
2—Unhealthy. Indicates that the Cisco IME has been activated but has not successfully completed handshake procedures with the Cisco IME server. Note that this counter reflects the handshake status of both the primary and the secondary IME servers.
Default Configuration
Table 39 Default Configuration for the IMEServiceStatus Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
VAP Connection Problem
Duration
Trigger alert immediately
Frequency
Trigger up to 1 alert every 60 minutes
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
InvalidCredentials
The alert indicates that the Cisco Unified Communications Manager cannot connect to the Cisco IME server because the username and/or password configured on Cisco Unified Communications Manager do not match those configured on the Cisco IME server.
The alert includes the username and password that were used to connect to the Cisco IME server as well as the IP address and name of the target Cisco IME server. To resolve this alert, log into the Cisco IME server and check that the configured username and password match the username and password that are configured in Cisco Unified Communications Manager.
Default Configuration
Table 40 Default Configuration for the InvalidCredentials Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Alert
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
Credential Failure to Cisco IME server
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
LowCallManagerHeartbeatRate
This alert occurs when the CallManager heartbeat rate equals less than the configured value.
Table 41 Default Configuration for the LowCallManagerHeartbeatRate RTMT Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Warning
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
CallManager Server heartbeat rate below 24 beats per minute.
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
LowTFTPServerHeartbeatRate
This alert occurs when TFTP server heartbeat rate equals less than the configured value.
Table 42 Default Configuration for the LowTFTPServerHeartbeatRate RTMT Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Warning
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
TFTP server heartbeat rate below 24 beats per minute
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
MaliciousCallTrace
This indicates that a malicious call exists in Cisco Unified Communications Manager. The malicious call identification (MCID) feature gets invoked.
Table 43 Default Configuration for the MaliciousCallTrace RTMT Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
Malicious call trace generated
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
MediaListExhausted
This alert occurs when the number of MediaListExhausted events exceeds the configured threshold during the configured time interval. This indicates that all available media resources that are defined in the media list are busy. The default specifies 0 within 60 minutes.
Table 44 Default Configuration for the MediaListExhausted RTMT Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Warning
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
Number of MediaListExhausted events exceeds 0 times within the last 60 minutes
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
MgcpDChannelOutOfService
This alert gets triggered when the MGCP D-Channel remains out of service.
Table 45 Default Configuration for the MgcpDChannelOutOfService RTMT Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
MGCP D-Channel is out-of-service
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
NumberOfRegisteredDevicesExceeded
This alert occurs when the NumberOfRegisteredDevicesExceeded event gets generated.
Table 46 Default Configuration for the NumberOfRegisteredDevicesExceeded RTMT Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
NumberOfRegisteredDevicesExceeded event generated
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
NumberOfRegisteredGatewaysDecreased
This alert occurs when the number of registered gateways in a cluster decreases between consecutive polls.
Table 47 Default Configuration for the NumberOfRegisteredGatewaysDecreased RTMT Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
Number of registered gateway decreased
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
NumberOfRegisteredGatewaysIncreased
This alert occurs when the number of registered gateways in the cluster increased between consecutive polls.
Table 48 Default Configuration for the NumberOfRegisteredGatewaysIncreased RTMT Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Threshold
Trigger alert when following condition met:
Number of registered gateways increased
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
NumberOfRegisteredMediaDevicesDecreased
This alert occurs when the number of registered media devices in a cluster decreases between consecutive polls.
Table 49 Default Configuration for the NumberOfRegisteredMediaDevicesDecreased RTMT Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Threshold
Trigger alert when following condition met:
Number of registered media devices decreased
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
NumberOfRegisteredMediaDevicesIncreased
This alert occurs when the number of registered media devices in a cluster increases between consecutive polls.
Table 50 Default Configuration for the NumberOfRegisteredMediaDevicesIncreased RTMT Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Threshold
Trigger alert when following condition met:
Number of registered media devices increased
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
NumberOfRegisteredPhonesDropped
This alert occurs when the number of registered phones in a cluster drops more than the configured percentage between consecutive polls.
Table 51 Default Configuration for the NumberOfRegisteredPhonesDropped RTMT Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Threshold
Trigger alert when following condition met:
Number of registered phones in the cluster drops (10%)
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
RouteListExhausted
An available route could not be found in the indicated route list.
Table 52 Default Configuration for the RouteListExhausted RTMT Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Warning
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
Number of RouteListExhausted exceeds 0 times within the last 60 minutes
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
SDLLinkOutOfService
This alert occurs when the SDLLinkOutOfService event gets generated. This event indicates that the local Cisco Unified Communications Manager cannot communicate with the remote Cisco Unified Communications Manager. This event usually indicates network errors or a nonrunning, remote Cisco Unified Communications Manager.
Table 53 Default Configuration for the SDLLinkOutOfService RTMT Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
SDLLinkOutOfService event generated
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
TCPSetupToIMEFailed
This alert occurs when Cisco Unified Communications Manager cannot establish a TCP connection to a Cisco IME server. This alert typically occurs when the IP address and port of the Cisco IME server are misconfigured in Cisco Unified Communications Manager Administration or when an Intranet connectivity problem exists and prevents the connection from being set up.
Ensure that the IP address and port of the Cisco IME server in the alert are valid. If the problem persists, test the connectivity between the Cisco Unified Communications Manager servers and the Cisco IME server.
Default Configuration
Table 54 Default Configuration for the TCPSetupToIMEFailed Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Critical
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
Connection Failure to Cisco IME server
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
TLSConnectionToIMEFailed
This alert occurs when a TLS connection to the Cisco IME service could not be established because the certificate presented by the Cisco IME service has expired or is not in the Cisco Unified Communications Manager CTL.
Ensure that the Cisco IME service certificate has been configured into the Cisco Unified Communications Manager.
Default Configuration
Table 55 Default Configuration for the TLSConnectionToIMEFailed Alert
Value
Default Configuration
Enable Alert
Selected
Severity
Alert
Enable/Disable this alert on the following servers
Enabled on listed servers
Threshold
Trigger alert when following condition met:
TLS Failure to Cisco IME service
Duration
Trigger alert immediately
Frequency
Trigger alert on every poll
Schedule
24 hours daily
Enable E-mail
Selected
Trigger Alert Action
Default
Emergency-level alarms
The emergency-level alarm equals zero (0) and means that your system or service is unusable. These alarms generally indicate platform failures. Examples follow:
Service repeatedly fails to startup
System ran out of disk space while disk space is essential for this system to operate
System ran out of memory
Motherboard failure occurred
This level is not suitable for events associated with an individual end point.
Ensure that the primary file path is valid and the corresponding drive has sufficient disk space. Also, make sure that the path has security permissions similar to default log file path.
GlobalSPUtilsCreationError
There was an error during the GlobalSPUtils creation.
No feature license found. Cisco Unified Communications Manager (Unified CM) requires a license to function. Also, Unified CM licenses are version-specific so be certain that the license is for the version you are trying to run. You can run a license unit report in Cisco Unified CM Administration (System > Licensing > License Unit Report).
Request license generation for Cisco Unified Communications Manager SW FEATURE for your version of Unified CM and upload the license in Cisco Unified CM Administration (System > Licensing > License File Upload).
OutOfMemory
The process has requested memory from the operating system, and there was not enough memory available.
An executable is trying to start but cannot because it is not configured as a service in the service control manager. The service is %s. Service is not installed.
The alert-level alarm equals 1 and action must take place immediately. A system error occurred and will not recover without manual intervention. Examples follow:
Application is about to run out of licenses
Application is about to run out of disk space
Application is almost out of memory
100% CPU occurs for long period of time
Be aware that this level is not suitable for events that are associated with an individual end point.
Regenerate the certificate that is about to expire by accessing the Cisco Unified Operating System and go to Certificate Management. If the certificate is issued by a CA, generate a CSR, submit the CSR to CA, obtain a fresh certificate from CA, and upload it to Cisco Unified CM.
CMIException
Error while reading the database.
This alarm is always associated with other alarms, which are triggered due to configuring CMI service parameter with invalid values or due to invalid handle value returned by the serial port.
Refer to the associated alarm for further information.
CMOverallInitTimeExceeded
Initialization of the Cisco Unified Communications Manager system has taken longer than allowed by the value specified in the System Initialization Timer service parameter; as a result, the system will automatically restart now to attempt initialization again. Initialization may have failed due a database error, or due to a large amount of new devices added to the system, or any number of other potential causes. The required time to initialize Cisco Unified Communications Manager has exceeded the time allowed by the Cisco CallManager service parameter, System Initialization Timer. This could be due to an increase in system size.
Cisco Unified Communications Manager Overall Initialization Time (in minutes) [Int]
Recommended Action
Try increasing the value of the Cisco CallManager service parameter, System Initialization Timer, in the Service Parameters Configuration window in Cisco Unified CM Administration. Use RTMT to discover the number of devices and number of users in the system and evaluate whether the numbers seem accurate. Try increasing the value of the Cisco CallManager service parameter, System Initialization Timer, in the Service Parameters Configuration window in Cisco Unified CM Administration. If increasing the time in the System Initialization Timer service parameter does not correct this issue, contact the Cisco Technical Assistance Center (TAC).
ConfigThreadChangeNotifyServerInstanceFailed
Failed to allocate resources to handle configuration change notification from database. This usually indicates a lack of memory when there is a system issue such as running out of resources.
History
Cisco Unified CommunicationsRelease
Action
7.0(1)
Name changed from kConfigThreadChangeNotifyServerInstanceFailed
Use RTMT to monitor the system memory resources and consumption and correct any system issues that might be contributing to a reduced amount of system resources.
ConfigThreadChangeNotifyServerSingleFailed
Failed to allocate resources to handle configuration change notification from database. This usually indicates a lack of memory when there is a system issue such as running out of resources.
History
Cisco Unified CommunicationsRelease
Action
7.0(1)
Name changed from kConfigThreadChangeNotifyServerSingleFailed.
Use RTMT to monitor the system memory resources and consumption and correct any system issues that might be contributing to a reduced amount of system resources.
ConfigThreadChangeNotifyServerStartFailed
Failed to start listening to configuration change notification from database. This usually indicates a lack of memory when there is a system issue such as running out of resources.
History
Cisco Unified CommunicationsRelease
Action
7.0(1)
Name changed from kConfigThreadChangeNotifyServerStartFailed.
Use RTMT to monitor the system memory resources and consumption and correct any system issues that might be contributing to a reduced amount of system resources.
CiscoLicenseApproachingLimit
License units consumption approaching its authorized limit.
One or more Unified CM nodes in a cluster are running different Cisco CallManager versions.
This alarm indicates that the local Unified CM is unable to establish communication with the remote Unified CM due to a software version mismatch. This is generally a normal occurrence when you are upgrading a Unified CM node.
Failed to create a new thread. See Reason string for where it failed. This usually happens when there are system issues such as running out of memory resources.
Use RTMT to monitor the system memory resources and consumption and correct any system issues that might be contributing to a reduced amount of system resources.
DBLException
An error occurred while performing database activities. A severe database layer interface error occurred. Possible causes for this include the database being unreachable or down or a DNS error.
Review the System Reports provided in the Cisco Unified Reporting tool, specifically the Cisco Unified CM Database Status report, for any anomalous activity. Check network connectivity to the server that is running the database. If your system uses DNS, check the DNS configuration for any errors.
InvalidCredentials
Credential Failure to IME server.
The connection to the IME server could not be completed, because the username and/or password configured on Unified CM do not match those configured on the IME server.
The alarm will include the username and password which were used to connect to the IME server, along with the IP address of the target IME server and its name. Log into the IME server and check that the username and password configured there match those configured in Unified CM.
In Cisco Unified Serviceability, enable Detailed level traces in the Trace Configuration window for the Cisco Database Layer Monitor service. Check network connectivity and operation of SQL Server services.
ParityConfigurationError
The CMI service parameter, Parity, has an invalid configuration.
An invalid parity has been configured for the serial port that CMI uses to connect to the voice messaging system. It is possible that the parity value has been updated via AXL or a CLI command where validation of the value was not performed. For this reason, it is best to set this value in the Service Parameter Configuration window in Cisco Unified CM Administration and the value can be validated against the accepted range of values for this field.
Verify that the Cisco Messaging Interface service parameter Parity is set to a valid (allowable) value.
SerialPortOpeningError
When CMI tries to open the serial port, the operating system returns an error.
For a system running CMI, the serial port through which the voice messaging system is connected is always USB0, and that value is configured in the Cisco Messaging Interface service parameter, Serial Port. It is possible that the Serial Port value has been updated via AXL or a CLI command where validation of the value was not performed. CMI triggers this alarm if the value in the Serial Port service parameter is anything other than USB0.
Ensure that USB0 is configured in the Cisco Messaging Interface service parameter Serial Port. Also, physically confirm that the cable is firmly connected to the USB0 port.
SDIControlLayerFailed
Failed to update trace logging or alarm subsystem for new settings. This usually indicates a lack of system resources or a failure in database access by the trace logging or alarm subsystem.
In Cisco Unified Serviceability, enable Detailed level traces in the Trace Configuration window for TFTP and Cisco Database Layer Monitor services. Also, use RTMT to look for errors that may have occurred around the time of the alarm. Ensure that the database server is running, and that the Cisco Database Layer Monitor service is running without problems. If this alarm persists, contact the Cisco Technical Assistance Center (TAC) with TFTP service and database trace files.
SDLLinkOOS
SDL link to remote application out of service. This alarm indicates that the local Unified CM has lost communication with the remote Unified CM. This alarm usually indicates that a node has gone out of service (whether intentionally for maintenance or to install a new load for example; or unintentionally due to a service failure or connectivity failure).
Remote IP address of remote application [String] Unique Link ID. [String] Local node ID [UInt] Local Application ID. [Enum]RemoteNodeID [UInt] Remote application ID. [Enum]
Recommended Action
In the Cisco Unified Reporting tool, run a CM Cluster Overview report and check to see if all servers can talk to the Publisher. Also check for any alarms that might have indicated a CallManager failure and take appropriate action for the indicated failure. If the node was taken out of service intentionally, bring the node back into service.
LocalApplicationID and RemoteApplicationID Enum definitions
Code
Reason
100
CallManager
200
CTI
SocketError
Failed to open network connection for receiving file requests. This usually happens when the IP address that the TFTP service uses to open the network connection is invalid.
Verify that the TFTP service parameter, TFTP IP Address, accurately specifies the IP address of the NIC card to use for serving files via TFTP. See the help for the (advanced) TFTP IP Address service parameter for more information. If the problem persists, go to Cisco Unified Serviceability and enable Detailed level traces in the Trace Configuration window for the TFTP service and contact the Cisco Technical Assistance Center (TAC).
StopBitConfigurationError
The Cisco Messaging Interface service parameter, Stop Bits, has an invalid configuration.
An invalid stop bit has been configured for the serial port that CMI uses to connect to the voice messaging system. It is possible that the Stop Bits value has been updated via AXL or a CLI command where validation of the value was not performed. For this reason, it is best to set this value in the Service Parameter Configuration window in Cisco Unified CM Administration and the value can be validated against the accepted range of values for this field.
Verify that the Cisco Messaging Interface service parameter Stop Bits is set to a valid (allowable) value.
TFTPServerListenSetSockOptFailed
Failed to increase the size of the network buffer for receiving file requests. This usually indicates a lack of memory when there is a system issue such as running out of resources.
History
Cisco Unified CommunicationsRelease
Action
7.0(1)
Name changed from kTFTPServerListenSetSockOptFailed.
Use RTMT to monitor the system memory resources and consumption and correct any system issues that might be contributing to a reduced amount of system resources.
TFTPServerListenBindFailed
Fail to connect to the network port through which file requests are received. This usually happens if the network port is being used by other applications on the system or if the port was not closed properly in the last execution of TFTP server.
Verify that the port is not in use by other application. After stopping the TFTP server, at the command line interface (CLI) on the TFTP server, execute the following command—show network status listen. If the port number specified in this alarm is shown in this CLI command output, the port is being used. Restart the Cisco Unified Communications Manager system, which may help to release the port. If the problem persists, go to Cisco Unified Serviceability and enable Detailed level traces in the Trace configuration window for the TFTP service and contact the Cisco Technical Assistance Center (TAC).
A TLS connection to the IME server could not be established because of a problem with the certificate presented by the IME server. (For example, not in the Unified CM CTL, or is in the CTL but has expired).
Check to see that the certificate of the IME server is configured properly in the Unified CM.
Routing List
SDL
SDI
Sys Log
Event Log
Alert Manager
Parameter(s)
SSLErrorCode(UInt)
SSLErrorText(String)
TVSServerListenBindFailed
Fail to connect to the network port through which file requests are received. This usually happens if the network port is being used by other applications on the system or if the port was not closed properly in the last execution of TVS server.
Cisco Unified Serviceability Alarm Catalog
System/TVS
Severity
ALERT
Routing List
SDI
Event Log
Data Collector
Sys Log
Parameter(s)
nError(Int)
IPAddress(String)
Port(Int)
Recommended Action
Verify that the port is not in use by other application. After stopping the TVS server, at the command line interface (CLI) on the TVS server, execute the following command: show network status listen. If the port number specified in this alarm is shown in this CLI command output, the port is being used. Restart the Cisco Unified Communications Manager system, which may help to release the port. If the problem persists, go to Cisco Unified Serviceability and enable Detailed level traces in the Trace Configuration window for the TVS service and contact the Cisco Technical Assistance Center (TAC).
TVSServerListenSetSockOptFailed
Failed to increase the size of the network buffer for receiving file requests. This usually indicates a lack of memory when there is a system issue such as running out of resources.
Cisco Unified Serviceability Alarm Catalog
System/TVS
Severity
ALERT
Routing List
SDI
Event Log
Data Collector
Sys Log
Parameter(s)
nError(Int)
IPAddress(String)
Port(Int)
Recommended Action
Use RTMT to monitor the system memory resources and consumption and correct any system issues that might be contributing to a reduced amount of system resources.
UnknownException
Unknown error while connecting to database.
When CMI service is started, it tries to read CMI service parameters from DB. During this, if there is an unknown error, CMI triggers this alarm.
CMI cannot register with Cisco Unified Communications Manager because of an invalid Voice Mail DN. This alarm occurs because the Cisco Messaging Interface service parameter, Voice Mail DN, is empty or has invalid characters other than digits (0-9). It is possible that the Voice Mail DN value has been updated via AXL or a CLI command where validation of the value was not performed. For this reason, it is best to set this value in the Service Parameter Configuration window in Cisco Unified CM Administration and the value can be validated against the accepted range of values for this field.
Check the CMI service parameter Voice Mail DN to confirm that a valid directory number has been configured.
Critical-Level Alarms
The critical-level alarm equals 2 and action may need to be taken immediately; auto-recovery is expected, but monitor the condition.
This alarm acts similar to the alert-level alarm but not necessarily requiring an immediate action. A system-affecting service had a failure but recovered without intervention. Examples follow:
Service crashed due to an error that could not be handled but a watchdog process exists that will restart the service. The crash does not necessarily require immediate action. Examples are:
Out of memory conditions
Uninitialized variables
Memory scribblers
Unexpected code error occurred that could not be handled but for which the system automatically restarts.
The B-channel is out of service. The B-channel indicated by this alarm has gone out of service. Some of the more common reasons for a B-channel to go out of service include are as follows:
Taking the channel out of service intentionally to perform maintenance on either the near- or far-end
MGCP gateway returns an error code 501 or 510 for a MGCP command sent from Cisco Unified Communications Manager (Cisco Unified CM)
MGCP gateway does not respond to an MGCP command sent by Cisco Unified CM three times
Speed and duplex mismatch exists on the Ethernet port between Cisco Unified CM and the MGCP gateway.
Check the Cisco Unified CM advanced service parameter, Change B-channel Maintenance Status to determine if the B-channel has been taken out of service intentionally; Check the Q.931 trace for PRI SERVICE message to determine whether a PSTN provider has taken the B-channel out of service; Reset the MGCP gateway; Check the speed and duplex settings on the Ethernet port.
CallManagerFailure
Indicates an internal failure in the Cisco Unified Communications system. The service should restart in an attempt to clear the failure.
Reason Code Enum definitions for CallManagerFailure
Code
Reason
1
Unknown—Unified CM has failed for an unknown reason.
2
HeartBeatStopped—An internal heart beat has stopped after the preceding heart beat interval.
3
RouterThreadDied—An internal thread has failed.
4
TimerThreadDied—An internal thread has failed.
5
CriticalThreadDied—An internal thread has failed.
CertExpiryCritical
Certificate is about to expire in less than 7 days. Regenerate or reimport certificate. Name of the service generating this alarm is Cisco Certificate Expiry Monitor. The alarms are generated when any certificate generated by the system or uploaded into the system expires. Cisco Unified CM uses certificates for Tomcat (Web Server), CallManager, IPSEc and Directory. Refer Security guide for more details on various certificates. When a certificate generated by Cisco Unified CM, the default validity of the self-signed certificate is for 5 years. In case of Certificates signed by a CA, the validity is dependent on the Expiry date set by CA while issuing the certificate. Once a certificate is about to expire "Cisco Certificate Expiry Monitor" service generates alarms. The severity of the alarm is dependent on how much time is left for the certificate to expire.
The impact to system operation depends on the which certificate expired. This information is contained in the alarm. If Tomcat certificate expired, while connecting to Cisco Unified CM web pages, browser will throw an error stating certificate has expired. One can still ignore the warning and continue to connect to Cisco Unified CM pages.
In case of Directory-trust, if Directory trust certificate uploaded to Cisco Unified CM expires, Cisco Unified CM may not be able to establish SSL connection with external LDAP server. The overall impact is that SSL connection between Cisco Unified CM and other external Servers will fail.
Login to CUOS page. Go to Security > Certificate Management and regenerate the certificate that has expired (based on the information in alarm). This will generate a new self-signed certificate with a new expiry date. In case the certificate is signed by a CA, Generate a new CSR, send it to the CA, get the certificate signed by CA and upload the new certificate.
CertValidfor7days
Alarm indicates that the certificate has expired or expires in less than seven days.
Regenerate the certificate that is about to expire by accessing the Cisco Unified Operating System and go to Certificate Management. If the certificate is issued by a CA, generate a CSR, submit the CSR to CA, obtain a fresh certificate from CA, and upload it to Cisco Unified CM.
CDRMaximumDiskSpaceExceeded
The CDR files disk usage exceeded maximum disk allocation. Some undeliverable files may have been deleted to bring disk usage down. The CDR files disk usage has exceeded the maximum allocated disk space. CDRM may have deleted some CDR files that have not been sent to the outside billing servers yet, in order to bring the disk usage down to below High Water Mark. The decision whether to delete undeliverable files or not depends on how deletionDisable flag is configured at CDRM Configuration page. E-mail alert will be sent to the admin.
History
Cisco Unified CommunicationsRelease
Action
8.0(1)
Facility and sub-facility changed. Added Routing List and changed Data Collector to Alert Manager.
Unified CM has entered Code Red condition and will restart.
Unified CM has been in Code Yellow state for an extended period and is unlikely to recover on its own. The Cisco CallManager service automatically restarts in an attempt to clear the condition that is causing the Code Yellow state. The amount of time that the system will remain in Code Yellow state is configurable in the Code Yellow Duration service parameter. If the duration of this parameter is set to 99999, Code Red condition will never occur.
Expected Average Delay [UInt] Entry Latency [UInt] Exit Latency [UInt] Sample Size [UInt] Code Yellow Duration [UInt] Number of Calls Rejected Due to Call Throttling [UInt] Total Code Yellow Entry [UInt] Total Code Yellow Exit [UInt]
Recommended Action
You should have attempted the steps in the recommended actions defined in the CodeYellowEntry alarm. If you have not, try those after the system is online. There is no other action for Code Red because the only action is to restart which is performed for you automatically.
CodeYellowEntry
CallManager has initiated call throttling due to unacceptably high delay in handling incoming calls.
Expected Average Delay [UInt] Entry Latency [UInt] Exit Latency [UInt] Sample Size [UInt] Total Code Yellow Entry [UInt]
Recommended Action
Memory problems or high CPU usage are generally at the root of a Code Yellow state. A bad disk could also be the cause. Also, trace level settings can consume tremendous amounts of CPU (especially when the Enable SDL TCP Event Trace checkbox is enabled on the SDL Trace Configuration window in Cisco Unified Serviceability). Check these areas to try to correct the Code Yellow condition. You can also determine the level of fragmentation on the hard disk by issuing the File Fragmentation command from the CLI for the trace directories. Monitor the situation and collect existing trace files. If the CodeYellowExit alarm is not issued in a reasonable amount of time as deemed by your organization, or if the system is frequently entering Code Yellow state, contact TAC and supply the trace information you have collected.
CoreDumpFileFound
The new core dump files have been found in the system. One of the component has crashed and generated a core dump. Use admin cli or RTMT to featch the backtrace.
This serious internal error should be investigated by the Cisco Technical Assistance Center (TAC). Before contacting TAC, Login to cli on CCM serve and run "active analyze core file name" to generate the backtrace of the core dump. The core file name is listed in the alert details. After the analyze command is executed, collect the backtrace using cli command "file get activelog analyze" or "Collect Traces" option from RTMT. Send these backtraces to Cisco TAC for further analysis.
DChannelOOS
The D-channel is out of service. D-channel indicated by this alarm has gone out of service. Common reasons for a D-channel going out of service include losing T1/E1/BRI cable connectivity; losing the gateway data link (Layer 2) due to an internal or external problem; or gateway reset.
Channel Id. [UInt] Unique channel Id [String] Device Name. [String] Device IP address [String] Reason. [Enum]
Enum Definitions
0—None Defined
Recommended Action
Check the connection of the T1/E1/BRI cable; reset the gateway to restore Layer 2 connectivity; investigate whether the gateway reset was intentional. If the reset was not intentional, take steps to restrict access to the Gateway Configuration window in Cisco Unified Communications Manager Administration and the gateway terminal.
DUPLEX_MISMATCH
This alarm is generated by Cisco CDP whenever there is a duplex mismatch between local interface and switch interface.
Ensure that duplex settings are set to auto or full on local interface as well as switch interface.
ErrorChangeNotifyClientBlock
A change notification client is busy (blocked). If the change notification client continues to be blocked for 10 minutes, the system automatically clears the block and change notification should resume successfully. Changes made to the database are not being consumed by one of the recipients. This does not always represent an issue. However, if the change notification client continues to be blocked for 10 minutes, the system automatically clears the block for all clients except the blocked one, which means that change notifications should resume successfully for all other clients. To clear the blocked client, you must restart the server.
At the command line interface (CLI) on the database server, execute the following command:
show tech notify
The CLI command output will provide information about the block. Use Cisco Unified Serviceability to restart the server that was indicated in the alarm. You may also want to gather traces to examine them for anomalous activity during the time that client was blocked. In Cisco Unified Serviceability, enable Detailed level traces in the Trace Configuration window for the Cisco Database Layer Monitor service. Also, use RTMT to look for a change that may have occurred around the time of the alarm.
LogPartitionHighWaterMarkExceeded
The percentage of used disk space in the log partition has exceeded the configured high water mark. Some of the core file and / or trace files will be purged until the percentage of used disk space in the log partition gets below the configured low water mark.
Login into RTMT and check the configured threshold value for LogPartitionHighWaterMarkExceeded alert in Alert Central. If the configured value is set to a lower than the default threshold value unintentionally, change the value to default.
If you continue to receive this alert for half an hour after receiving the 1st alert, check for the disk usage for Common partition under "Disk Usage" tab in RTMT. If the disk usage shown under that tab is higher than configured value in LogPartitionLowWaterMarkExceeded alert configuration, contact Cisco TAC to troubleshoot the cause of high disk usage in Common partition.
MaxCallsReached
The maximum number of simultaneous connections in a Cisco Unified Communications Manager (Unified CM) node has been reached. This is an internally-set value and when it is exceeded, Unified CM starts throttling calls to keep the number of calls below the internal threshold.
In the Real-Time Monitoring Tool, check the CallsActive counter in the Cisco CallManager object for an unusually high number of calls. Internal mechanisms will attempt to correct this condition. If this alarm continues to occur, collect existing SDL and CCM trace files and check to be sure that CM Services trace collection in Cisco Unified CM Serviceability is set to Detailed level.
MGCPGatewayLostComm
The MGCP gateway is no longer in communication with Cisco Unified Communications Manager (Cisco Unified CM). This could occur because Cisco Unified CM receives an MGCP unregister signal from the gateway such as RSIP graceful/forced; Cisco Unified CM doesn't receive the MGCP KeepAlive signal from the gateway; the MGCP gateway doesn't response to an MGCP command sent by Cisco Unified CM three times; a speed and duplex mismatch exists on the Ethernet port between Cisco Unified CM and the MGCP gateway; the gateway has reset.
Reset the MGCP gateway in an attempt to restore communication with Cisco Unified CM; check the speed and duplex settings on the Ethernet port. In the case of an unwanted reset of the gateway which caused communication to be lost, take precautions to ensure that no unauthorized personnel resets the gateway from Cisco Unified CM Administration or via the gateway terminal.
Verify the Cisco Unified Communications Manager IP address is configured and is not configured as the loop back address for the IP version. If the IP settings are correct, collect SDL and SDI traces and contact TAC.
TCPSetupToIMEFailed
Connection Failure to IME server.
This alarm occurs when Unified CM is unable to establish a TCP connection to an IME server. It typically occurs when the IP address and port of the IME server are misconfigured or an Intranet connectivity problem is preventing the connection from being set up.
Check to make sure that the IP address and port of the IME server - which are present in the alarm - are valid. If so, this may be due to a network connectivity problem. Test the connectivity between the Unified CM servers and the IME server.
Routing List
SDL
SDI
Sys Log
Event Log
Alert Manager
Parameter(s)
IP address(String)
Port number(UInt)
TimerThreadSlowed
Verification of the Cisco Unified Communications Manager (Unified CM) internal timing mechanism has slowed beyond acceptable limits. This generally indicates an increased load on the system or an internal anomaly.
If this alarm occurs at the same general day or time, or if it occurs with increasing frequency, collect all system performance data in Real-Time Monitoring Tool as well as all trace information for the 30 minutes prior to the time that this alarm occurred and contact TAC.
The error-level alarm is 3 and you should investigate important devices or subsystems and determine if immediate action is needed. Errors that do not necessarily impact the ability of the service to continue to function and do not create a system outage. More related to device or subsystems.
An example would be a device or subsystem failing for an unexpected reason.
ANN device recovery create failure. The ANN device recovery class create failed, possibly due to lack of memory. If the error code is non-zero it may help determine the cause of the error. The announcement device will not be available.
Restart the Cisco IP Voice Media Streaming App service or restart server.
AwaitingResponseFromPDPTimeout
Cisco Unified Communication Manager timed out waiting for the routing response from the policy decision point. Cisco Unified Communications Manager (Unified CM) did not receive a call routing response from the policy decision point (PDP) within the time specified by the Cisco CallManager service parameter, Call Intercept Routing Request Timer, or on the Call Intercept Profile Configuration window in Cisco Unified CM Administration.
Check whether the PDP is in service and working normally. Verify that the PDP is not overloaded; if it is, take appropriate action to reduce the load on the PDP by following some or all of these recommendations:
Consider adding more PDPs and provisioning Unified CM with additional call intercept profiles and call intercept trigger points in the various configuration pages under the Call Routing menu in Cisco Unified CM Administration.
Provision a pair of policy servers per call-intercept profile to enable load balancing.
OR
Verify that the PDP server in your deployment meets or exceed the hardware requirements specified in the documentation for Cisco Enterprise Policy Manager (CEPM) or the third-party PDP solution you have deployed. If necessary, increase the value in the Cisco CallManager service parameter, Call Intercept Routing Request Timer or the value in the Call Intercept Profile for this PDP.
BadCDRFileFound
Bad CDR or CMR flat file found during CDR Load to CAR database. The file could be corrupted. However, CAR loader is able to skip the bad records and load the good ones to CAR database. The name of the service generating this alarm is CAR Loader (DailyCdrLoad) job. Part of Cisco CAR Scheduler service.
Find the bad file from the cdr_repository folders, and check its problematic record based on the information given by the cause and summary. Collect the associated SDI and SDL traces for the bad records found in this file as soon as possible. Collect and check the CAR Scheduler traces for more details.
CAR scheduled job failed. A normal CAR scheduled job failed such as the pre-generated Daily/Weekly/Monthly/Monthly-Bill reports jobs. The particular CAR scheduler job that fails cannot be run properly. This does not cause any significant impact on CAR functions. For pre-generated CAR report, this would result failure to run on a particular report, which leads to missing of CAR report.
Check the contents in tbl_system_preferences table.
Check the number of records in tbl_billing_data, tbl_billing_error, and tbl_error_id_map tables.
Check if the scheduled job configuration is correct from CAR page.
Collect and check the CAR Scheduler traces for more details.
CARSchedulerJobFailed
Critical CAR scheduled job failed. The jobs are PopulateSchedules, DailyCdrLoad, TaskMonitor, or DatabaseMaintenance. The particular CAR scheduler job that failed cannot be run properly. This can cause significant impact on CAR functions.
If PopulateSchedules job fails, CAR scheduler cannot schedule jobs to run for the day; this would result some/all of CAR scheduler jobs cannot start.
If DailyCdrLoad job fails, CAR loader would not be able to load CDR/CMR records from CDR/CMR flat files into CAR database; this would result records found upon running CAR reports, and causes accumulation of CDR/CMR flat files unprocessed.
If TaskMonitor job fails, CAR scheduler will not be able to perform the daily DB IDS memory clean up task; this would result higher DB shared memory usage.
If DatabaseMaintenance job fails, CAR scheduler will not be able to perform the daily optimized database maintenance Update statistics procedures; this would result CAR database not optimized for its operations.
Name of the service generating this alarm is CAR Scheduler service.
Cisco Unified CommunicationsRelease
Action
8.0(1)
Routing list changed from Data Collector to Alert Manager and existing parameters added.
Check the contents in tbl_system_preferences table.
Check the number of records in tbl_billing_data, tbl_billing_error, and tbl_error_id_map tables.
Check if the scheduled job configuration is correct from CAR page.
Collect and check the CAR Scheduler traces for more details.
CCDIPReachableTimeOut
CCD Requesting Service IP Reachable Duration times out.
The CCD requesting service detected that it can no longer reach the learned patterns through IP. All learned patterns from this forward will be marked as unreachable (via IP) and to allow calls to learned patterns to continue to be routed until IP becomes reachable again, all calls to learned patterns will be routed through the PSTN. Calls can be routed through the PSTN for a certain period of time before PSTN failover times out.
Check IP connectivity and resolve any TCP or IP problems in the network.
CCDPSTNFailOverDurationTimeOut
The internal limit on PSTN failover has expired.
When learned patterns are not reachable through IP, Unified CM routes calls through the PSTN instead. Calls can be routed through PSTN for an internally-controlled duration. When this alarm occurs, the PSTN failover duration has expired and calls to learned patterns cannot be routed. All learned patterns will be purged from Unified CM.
Troubleshoot your network to get IP connectivity restored. After IP connectivity is restored, Unified CM will automatically relearn Hosted DN patterns and calls to learned patterns will proceed through IP.
CDRAgentSendFileFailed
CDR Agent cannot send CDR files from CCM node to CDR Repository node within the CCM cluster because of timeout or other reasons. E-mail alert will be sent to the admin.
History
Cisco Unified CommunicationsRelease
Action
8.0(1)
Changed Data Collector Routing List element to Alert Manager.
Check if CDR Repository node (first node in the cluster) is alive.
Check if CDR Repository Manager is activated on the first node.
Check CDRM Configuration under Serviceability > Tools.
Check CDR Agent trace on the specific node where error occurred.
Check CDR Repository Manager trace.
Check if the Publisher is being upgraded. If the CDRAgentSendFileFailureContinues alarm is no longer present, the condition is corrected.
CDRAgentSendFileFailureContinues
CDR Agent cannot send CDR files from CCM node to CDR Repository node on retries. CDR Agent cannot send CDR files on retries after the initial failure from CCM node to CDR Repository node within the cluster.
Check if CDR Repository node (first node in the cluster) is alive.
Check if CDR Repository Manager is activated on the first node.
Check CDRM Configuration under Serviceability > Tools.
Check CDR Agent trace on the specific node where error occurred.
Check CDR Repository Manager trace.
Check if the Publisher is being upgraded.
CDRFileDeliveryFailed
FTP delivery of CDR files to the Billing Server outside of the cluster failed because of timeout or other reasons. E-mail alert will be sent to the admin.
History
Cisco Unified CommunicationsRelease
Action
8.0(1)
Changed Data Collector Routing List element to Alert Manager.
Check if (s)FTP Server on the billing server is running and accepting request.
Check if CDRM Configuration is correct - under Serviceability > Tools.
Check CDR Repository Manager trace.
CFBDeviceRecoveryCreateFailed
The CFB device startup failed, possibly due to lack of memory. If the error code is non-zero it may help determine the cause of the error. The conference bridge device will not be available.
Ensure that the LDAP server is online. If SSL is used, please make sure the required certificate is available on local CM server. The application will automatically retry
See application logs, verify that the license file is proper.
CLM_MsgIntChkError
ClusterMgr message integrity check error. ClusterMgr has received a message which has failed a message integrity check. This can be an indication that another node in the cluster is configured with the wrong security password.
Verify that this IP address is currently configured as a server in this cluster.
ConfigItAllBuildFilesFailed
A complete rebuild of all device configuration files has failed. Probable causes of this alarm could be failure to access the Cisco Unified Communications Manager database, or misconfiguration of some devices.
In Cisco Unified Serviceability, enabled Detailed level traces in the Trace Configuration window for TFTP and Cisco Database Layer Monitor services. Also, use RTMT to look for errors that may have occurred around the time of the alarm.
ConfigItAllReadConfigurationFailed
Failed to retrieve enterprise parameter values from database when rebuilding all configuration files. This is usually caused by a failure to access the Cisco Unified Communications Manager database.
History
Cisco Unified CommunicationsRelease
Action
7.0(1)
Name changed from kConfigItAllReadConfigurationFailed.
In Cisco Unified Serviceability, enable Detailed level traces in the Trace Configuration window for TFTP and Cisco Database Layer Monitor services. Also, use RTMT to look for errors that may have occurred around the time of the alarm.
ConfigThreadBuildFileFailed
Failed to build all device configuration files at TFTP service startup. This is usually caused by database access failure.
In Cisco Unified Serviceability, enable Detailed level traces in the Trace Configuration window for TFTP and Cisco Database Layer Monitor services. Also, use RTMT to look for errors that may have occurred around the time of the alarm.
ConfigThreadCNCMGrpBuildFileFailed
Failed to rebuild configuration files for changes in Cisco Unified Communications Manager Group settings. This is usually caused by a failure to access the Cisco Unified Communications Manager database.
History
Cisco Unified CommunicationsRelease
Action
7.0(1)
Name changed from kConfigThreadCNCMGrpBuildFileFailed.
In Cisco Unified Serviceability, enable Detailed level traces in the Trace Configuration window for TFTP and Cisco Database Layer Monitor services. Also, use RTMT to look for errors that may have occurred around the time of the alarm.
ConfigThreadCNGrpBuildFileFailed
Failed to rebuild configuration files for changes at group level settings such as Device Pool or Common Device Config settings. This is usually caused by a failure to access the Cisco Unified Communications Manager database.
History
Cisco Unified CommunicationsRelease
Action
7.0(1)
Name changed from kConfigThreadCNGrpBuildFileFailed.
In Cisco Unified Serviceability, enable Detailed level traces in the Trace Configuration window for TFTP and Cisco Database Layer Monitor services. Also, use RTMT to look for errors that may have occurred around the time of the alarm.
ConfigThreadReadConfigurationFailed
Failed to retrieve enterprise parameter values from database at TFTP service startup. This is usually caused by database access failure.
History
Cisco Unified CommunicationsRelease
Action
7.0(1)
Name changed from kConfigThreadReadConfigurationFailed.
In Cisco Unified Serviceability, enable Detailed level traces in the Trace Configuration window for TFTP and Cisco Database Layer Monitor services. Also, use RTMT to look for errors that may have occurred around the time of the alarm.
ConfigThreadUnknownExceptionCaught
An exception is caught in the main processing routine. This alarm is sent in conjunction with other alarms for failure when building configuration files or when the TFTP service is attempting to retrieve the values in the system's enterprise parameters.
History
Cisco Unified CommunicationsRelease
Action
7.0(1)
Name changed from kConfigThreadUnknownExceptionCaught.
In Cisco Unified Serviceability, enable Detailed level traces in the Trace Configuration window for TFTP service. Also, use RTMT to look for errors that may have occurred around the time of the alarm.
ConflictingDataIE
A call has been rejected because the incoming PRI/BRI Setup message had an invalid IE.
A call has been rejected because an incoming PRI/BRI Setup message contained an invalid Coding Standard value in the Bearer Capability information element (IE). Unified CM only accepts PRI/BRI Setup messages with Coding Standard values of 0 or 1. When an invalid IE is received, Unified CM rejects the call setup and issues this alarm.
Notify the service provider responsible for sending the Setup message that an IE with Coding Standard values of 0 or 1 must be included in Setup messages.
ConnectionFailure
Cisco CallManager failed to open TLS connection for the indicated device. Possible reasons could be wrong "Device Security Mode" configured, wrong "X.509 Subject Name" configured or unsupported cipher algorithm.
Check the Security profile of the indicated device. Make sure "Device Security Mode" is either "Authenticated" or "Encrypted". Make sure "X.509 Subject Name" field has the right content. It should match the Subject Name in the certificate from the peer. Unified CM only supports AES_128_SHA cipher algorithm. Let the peer regenerate its certificate with the right algorithm.
Verify the network connectivity between Unified CM and the PDP by pinging the policy server host from Cisco Unified OS Administration and take steps to establish connectivity if it has been lost. If the connection failure is due to an authentication problem, verify that the valid certificate of the PDP has been imported to Cisco Unified OS Administration and certificates from every node in the Unified CM cluster have been imported to every node in the PDP. Also, check whether the PDP service is active.
CNFFBuffWriteToFilefopenfailed
Failed to create Config File on disk or update existing Config File on disk. This may happen if disk is full or the file is in use.
Cisco Unified CommunicationsRelease
Action
7.0(1)
Name changed from kCNFFBuffWriteToFilefopenfailed.
Using RTMT, check the disk utilization and correct any issue discovered. If you do not discover a disk space issue, try restarting the TFTP service from Cisco Unified Serviceability (Tools > Control Center - Feature Services). Stopping and restarting the TFTP service is useful because the Config File that the TFTP service is trying to save might be an existing file that is in use. If you still get this error, go to Cisco Unified Serviceability and enable Detailed level traces in the Trace Configuration window for the TFTP service and contact the Cisco Technical Assistance Center (TAC).
CNFFBuffWriteToFilefwritefailed
Failed to save Config File to disk. This may happen if disk is full or the file is in use.
Cisco Unified CommunicationsRelease
Action
7.0(1)
Name changed from kCNFFBuffWriteToFilefwritefailed.
Using RTMT, check the disk utilization and correct any issue discovered. If you do not discover a disk space issue, try restarting the TFTP service from Cisco Unified Serviceability (Tools > Control Center - Feature Services). Stopping and restarting the TFTP service is useful because the Config File that the TFTP service is trying to save might be an existing file that is in use. If you still get this error, go to Cisco Unified Serviceability and enable Detailed level traces in the Trace Configuration window for the TFTP service and contact the Cisco Technical Assistance Center (TAC).
CtiProviderOpenFailure
CTI application is unable to open the provider. The IP address is shown in either IPv4 or IPv6 format depending on the IP addressing mode of the application.
Reason Code Enum definitions for CtiProviderOpenFailure
Value
Definition
0
Unknown
0x8CCC0075 (2362179701)
Login request to authenticate user has timed out. Possible causes include LDAP server misconfiguration such as LDAP server referrals misconfiguration or Unified CM node experiencing high CPU usage. Recommended action is to verify the CPU utilization is in the safe range for Unified CM (this can be monitored using RTMT via CPU Pegging Alert)
0x8CCC0060 (2362179680)
Directory login failed. Verify that credentials are not misconfigured, check the userID and password configured in the application matches with what is configured in Unified CM Admin under (User Management > End User/Application User)
0x8CCC005E (2362179678)
Directory is unavailable. Verify that the LDAP server is reachable from Unified CM node, make sure that the network connectivity between Unified CM and the LDAP server by pinging the LDAP server host from Cisco Unified OS Administration and take steps to establish connectivity if it has been lost
0x8CCC00D1 (2362179793)
Application is connecting to a non secure port but has security privileges enabled for the user associated with the application. Check the user group configuration for the user in Unified CM Admin under (User Management > End User/Application User), select the user and verify the associated permissions information
0x8CCC005F (2362179679)
Standard CTI Use permission is not enabled. Users associated with applications are required to be included in "Standard CTI Enabled" user group. Verify the user group configuration for the user in Unified CM Admin under (User Management > End User/Application User), select the user and review the associated permissions information
0x8CCC00D0 (2362179792)
User is not enabled for a secure connection but the application connecting to secure port. Consider the application configuration and security configuration for the user, for TAPI applications review the Control Panel > Phone and Modem Options > Advanced > select a CiscoTSP > Configure... > Security and disable "Secure Connection to CTIManager". For JTAPI applications from JTPrefs choose Security and disable Enable Secure Connection. Also check the user group configuration for the user in Unified CM Admin under (User Management > End User/Application User), select the user and verify the associated permissions information
Device type mismatch between the information contained in the device TFTP config file and what is configured in the database for that device.
The device type indicated in the device configuration file does not match the database configuration. This could indicate that a change was made in the database configuration that failed to get updated at the device itself.
Database device type [Enum]Device type. [Enum]Name of device. [String]
Recommended Action
Check the Unified CM Database Status report in Cisco Unified Reporting to verify that database replication is working. You can also go to Real-Time Reporting Tool (RTMT) and check the Replication Status in the Database Summary page. If status shows 2, then replication is working. Restart the phone to download a new configuration file from TFTP. Also, refer to the reason code definitions for additional recommended actions.
The TCP socket for the SCCP device has been closed due to excessive events in a 5-second period. Under normal conditions, the device will reregister automatically.
The indicated SCCP device exceeded the maximum number of events allowed per-SCCP device. Events can be phone calls, KeepAlive messages, or excessive SCCP or non-SCCP messages. The maximum number of allowed events is controlled by the Cisco CallManager service parameter, Max Events Allowed. When an individual device exceeds the number configured in that service parameter, Unified CM closes the TCP connection to the device; automatic reregistration generally follows. This action is an attempt to stop malicious attacks on Unified CM or to ward off excessive CPU usage.
Total Events Received [UInt] IP Address [String] TCP Handle [String] Max Events Allowed [UInt] Number Of Skinny Device Throttled [UInt]
Recommended Action
Check the CCM trace data for the indicated SCCP device to determine the reason for the high number of events. Confirm that the value configured in the Cisco CallManager service parameter, Max Events Allowed, is a suitable number for your deployment.
DeviceInitTimeout
Device initialization timeout occurred. This alarm does not occur under normal working conditions; it will only occur if a device fails to respond to an initialize request.
DRF uses ipsec truststore certificate for securing communication between the MA and LA service. This certificate is missing on the node, DRF LA will not be able to connect to MA.
Download ipsec.pem file from Publisher and upload it as ipsec-trust only on the missing node then restart Cisco DRF Local service.
DRFUnknownClient
The DRF Master Agent running on the Publisher has received a Client connection request from an unknown server outside the cluster.The request has been rejected.
Remove the suspect server from the network. Refer to the Reason section for suspect servers: Hostname and IP Address.
DRFSecurityViolation
The DRF System has detected a malicious pattern which could result in a security violation. The DRF Network Message contains a malicious pattern which could result in a security violation like code injection or directory traversal. DRF Network Message has been blocked.
This alarm occurs when CCD requesting service received a duplicate Hosted DN.
The Call Control Discovery (CCD) requesting service received the same hosted DN from multiple call control entities such as Unified CM Express or another Unified CM cluster. The Cisco CallManager service parameter, Issue Alarm for Duplicate Learned Patterns, controls whether this alarm gets issued.
In RTMT, check the Pattern Report (CallManager > Report > Learned Pattern) and look for the duplicate pattern identified in this alarm. Learned patterns must be unique. Determine which call control entity (such as Unified CM or Unified CM Express) needs to be changed so that there is no duplicate pattern. Refer to the call control entity's configuration guide (help text) to learn how to update a hosted DN pattern. In Unified CM, to change the Hosted DN Pattern go to Cisco Unified CM Administration to update the Hosted DN Pattern configuration (Call Routing > Call Control Discovery > Hosted DN Patterns).
EMAppInitializationFailed
EM Application not started. Error occurred while starting application.
Ensure that every remote cluster added for EMCC has valid hostname/IP address for EM and PSTN access in the Remote Cluster administration window (From Unified CM Administration window, go to System > EMCC > Remote Cluster).
Ensure that the entries are enabled.
Ensure that a bundle of all Tomcat certificates (PKCS12) has been imported into the local tomcat-trust keystore (From the OS Administration window, go to Security > Certificate Management and check the certificates in tomcat-trust).
EMServiceConnectionError
EM Service not reachable. EM Service might be down in one or more nodes in the cluster.
Check if Cisco Extension Mobility service is running on all nodes of the cluster where the service is activated.
EndPointTransientConnection
End point transient connection attempt.
A connection was established and immediately dropped before completing registration. Incomplete registration may indicate that a device is rehoming in the middle of registration. The alarm could also indicate a device misconfiguration, database error, or an illegal/unknown device trying to attempt a connection. Network connectivity problems can affect device registration, or the restoration of a primary Unified CM may interrupt registration.
Investigate any network connectivity problems in the system. It's possible that you have reached the maximum number of devices; the Cisco Unified Communications Manager service parameter, Maximum Number of Registered Devices, controls the number of devices allowed in the system. Taking licensing, system hardware and other related concerns into consideration, you could increase the value of the service parameter. Also, refer to the reason code definitions for recommended actions. No action is required if this event was issued as a result of a normal device rehome.
Device type Enum definitions for EndPointTransientConnection
Value
Definition
1
CISCO_30SP+
2
CISCO_12SP+
3
CISCO_12SP
4
CISCO_12S
5
CISCO_30VIP
6
CISCO_7910
7
CISCO_7960
8
CISCO_7940
9
CISCO_7935
12
CISCO_ATA_186
20
SCCP_PHONE
61
H323_PHONE
72
CTI_PORT
115
CISCO_7941
119
CISCO_7971
255
UNKNOWN
302
CISCO_7989
307
CISCO_7911
308
CISCO_7941G_GE
309
CISCO_7961G_GE
335
MOTOROLA_CN622
336
BASIC_3RD_PARTY_SIP_DEVICE
348
CISCO_7931
358
CISCO_UNIFIED_COMMUNICATOR
365
CISCO_7921
369
CISCO_7906
374
ADVANCED_3RD_PARTY_SIP_DEVICE
375
CISCO_TELEPRESENCE
404
CISCO_7962
412
CISCO_3951
431
CISCO_7937
434
CISCO_7942
435
CISCO_7945
436
CISCO_7965
437
CISCO_7975
446
CISCO_3911
468
CISCO_UNIFIED_MOBILE_COMMUNICATOR
478
CISCO_TELEPRESENCE_1000
479
CISCO_TELEPRESENCE_3000
480
CISCO_TELEPRESENCE_3200
481
CISCO_TELEPRESENCE_500
484
CISCO_7925
493
CISCO_9971
495
CISCO_6921
496
CISCO_6941
497
CISCO_6961
20000
CISCO_7905
30002
CISCO_7920
30006
CISCO_7970
30007
CISCO_7912
30008
CISCO_7902
30016
CISCO_IP_COMMUNICATOR
30018
CISCO_7961
30019
CISCO_7936
30035
IP_STE
Reason Code Enum definitions for EndPointTransientConnection
Value
Definition
1
Unknown—(SCCP only) The device failed to register for an unknown reason. If this persists, collect SDL/SDI traces with "Enable SCCP Keep Alive Trace" enabled and contact TAC.
2
NoEntryInDatabase—(MGCP only) The device is not configured in the Cisco Unified CM database and auto-registration is either not supported for the device type or is not enabled. To correct this problem, configure this device via Cisco Unified CM Administration.
3
DatabaseConfigurationError—The device is not configured in the Cisco Unified CM database and auto-registration is either not supported for the device type or is not enabled. To correct this problem, configure this device via Cisco Unified CM Administration.
4
DeviceNameUnresolveable—For SIP third-party devices, this reason code means that Cisco Unified CM could not determine the name of the device from the Authorization header in the REGISTER message. The device did not provide an Authorization header after Cisco Unified CM challenged with a 401 Unauthorized message. Verify the device is configured with digest credentials and is able to respond to 401 challenges with an Authorization header. If this is a Cisco IP phone, the configuration may be out-of-sync. First go to the Cisco Unified Reporting web page, generate a Unified CM Database Status report, and verify "all servers have a good replication status". If DB replications looks good, reset the phone. If that still doesn't fix the problem, restart the TFTP and the Cisco CallManager services. For all other devices, this reason code means that DNS lookup failed. Verify the DNS server configured via the OS Administration CLI is correct and that the DNS name used by the device is configured in the DNS server.
5
maxDevRegExceeded—Maximum number of device registrations have been reached.
6
ConnectivityError - The network connection between the device and Cisco Unified CM dropped before the device was fully registered. Possible causes include device power outage, network power outage, network configuration error, network delay, packet drops and packet corruption. It is also possible to get this error if the Cisco Unified CM node is experiencing high CPU usage. Verify the device is powered up and operating, verify network connectivity between the device and Cisco Unified CM, and verify the CPU utilization is in the safe range (this can be monitored using RTMT via CPU Pegging Alert).
7
InitializationError—An internal error occurred within Cisco Unified CM while processing the device registration. It is recommended to restart the Cisco CallManager service. If this occurs repeatedly, collect SDL/SDI detailed traces with "Enable SIP Keep Alive (REGISTER Refresh) Trace" and "Enable SCCP Keep Alive Trace" under Cisco CallManager services turned on and contact TAC.
8
DeviceInitiatedReset—Indicates that the error was due to device initiated reset.
9
CallManagerReset—Indicates that the error was due to call manager reset.
10
AuthenticationError—The device failed either TLS or SIP digest security authentication. If the device is a SIP phone and is enabled for digest authentication (on the System > Security Profile > Phone Security Profile, check if "Enable Digest Authentication" checkbox is checked), verify the "Digest Credentials" in the End User config page are configured. Also, check the phone config page to see if the phone is associated with the specified end user in the Digest User drop box. If the device is a third-party SIP device, verify the digest credentials configured on the phone match the "Digest Credentials" configured in the End User page.
11
InvalidX509NameInCertificate—Configured "X.509 Subject Name" doesn't match what is in the certificate from the device. Check the Security Profile of the indicated device and verify the "Device Security Mode" is either "Authenticated" or "Encrypted". Verify the "X.509 Subject Name" field has the right content. It should match the Subject Name in the certificate from the peer.
12
InvalidTLSCipher—Unsupported cipher algorithm used by the device; Cisco Unified CM only supports AES_128_SHA cipher algorithm. Recommended action is for the device to regenerate its certificate with the AES_128_SHA cipher algorithm.
13
DirectoryNumberMismatch—Indicates mismatch between the directory number that the SIP device is trying to register with and the directory number configured in the Cisco Unified CM for the SIP device.
14
MalformedRegisterMsg—(SIP only) A SIP REGISTER message could not be processed because of an illegal format. Possible causes include a missing Call-ID header, a missing AoR in the To header, and an expires value too small. Verify the REGISTER message does not suffer from any of these ills.
15
ProtocolMismatch—The protocol of the device (SIP or SCCP) does not match the configured protocol in Cisco Unified CM.
Recommended actions:
Verify the device is configured with the desired protocol.
Verify the firmware load ID on the Device Defaults page is correct and actually exists on the TFTP server
If there is a firmware load ID configured on the device page, verify it is correct and exists on the TFTP server (On Cisco Unified OS Administration page, Software Upgrades > TFTP File Management, look for the file name as specified by load ID).
Restart the TFTP and Cisco CallManager services. Use the Cisco Unified OS Administration TFTP File Management page to verify the configured firmware loads exist.
16
DeviceNotActive—The device has not been activated.
17
AuthenticatedDeviceAlreadyExists—A device with the same name is already registered. If this occurs repeatedly, collect SDL/SDI detailed traces with "Enable SIP Keep Alive (REGISTER Refresh) Trace" and "Enable SCCP Keep Alive Trace" under Cisco CallManager services turned on and contact TAC. There may be an attempt by unauthorized devices to register.
18
ObsoleteProtocolVersion—(SCCP only) A SCCP device registered with an obsolete protocol version. Power cycle the phone. Verify that the TFTP service is activated. Verify that the TFTP server is reachable from the device. If there is a firmware load ID configured on the Phone Config page, verify that the firmware load ID exists on the TFTP server (On Cisco Unified OS Administration page, Software Upgrades > TFTP File Management, look for the file name as specified by load ID).
23
DatabaseTimeout—Cisco Unified CM requested device configuration data from the database but did not receive a response within 10 minutes.
25
RegistrationSequenceError—(SCCP only) A device requested configuration information from the Cisco Unified CM at an unexpected time. The Cisco Unified CM had not yet obtained the requested information.
26
InvalidCapabilities— (SCCP only) Cisco Unified CM detected an error in the media capabilities reported by the device during registration. The device reported the capabilities in the StationCapabilitiesRes message.
27
CapabilityResponseTimeout— (SCCP only) Cisco Unified CM timed out while waiting for the device to respond to a request to report its media capabilities.
28
SecurityMismatch—Cisco Unified CM detected a mismatch in the security settings of the device and/or the Unified CM. The following mismatches are detected:
The device established a secure connection, yet reported that it does not have the ability to do authenticated signaling.
The device did not establish a secure connection, but the security mode configured for the device indicates that it should have done so.
The device established a secure connection, but the security mode configured for the device indicates that it should not have done so.
29
AutoRegisterDBError—(SCCP only) Auto-registration of a device failed for one of the following reasons:
Auto-registration is not allowed for the device type.
An error occurred in the auto-registration stored procedure.
30
DBAccessError—(SCCP only) Auto-registration of a device failed because of an error that occurred while building the station registration profile.
31
AutoRegisterDBConfigTimeout—(SCCP only) Cisco Unified CM timed out during auto-registration of a device. The registration profile of the device did not get inserted into the database in time.
32
DeviceTypeMismatch—(SCCP only) The device type reported by the device does not match the device type configured on the Cisco Unified CM.
33
AddressingModeMismatch—(SCCP only) Cisco Unified CM detected an error related to the addressing mode configured for the device. One of the following errors was detected:
The device is configured to use only IPv4 addressing, but did not specify an IPv4 address.
The device is configured to use only IPv6 addressing, but did not specify an IPv6 address.
IPAddressAttributes Enum definitions for EndPointTransientConnection
Value
Definition
0
Unknown—The device has not indicated what this IPv4 address is used for
1
Administrative only—The device has indicated that this IPv4 address is used for administrative communication (web interface) only
2
Signal only—The device has indicated that this IPv4 address is used for control signaling only
3
Administrative and signal—The device has indicated that this IPv4 address is used for administrative communication (web interface) and control signaling
IPv6AddressAttributes Enum definitions for EndPointTransientConnection
Value
Definition
0
Unknown—The device has not indicated what this IPv6 address is used for
1
Administrative only—The device has indicated that this IPv6 address is used for administrative communication (web interface) only
2
Signal only—The device has indicated that this IPv6 address is used for control signaling only
3
Administrative and signal—The device has indicated that this IPv6 address is used for administrative communication (web interface) and control signaling
EndPointUnregistered
An endpoint that has previously registered with Cisco Unified Communications Manager has unregistered. In cases of normal unregistration with reason code "CallManagerReset", "CallManagerRestart", "DeviceInitiatedReset", "EMLoginLogout", or "EMCCLoginLogout", the severity of this alarm is lowered to INFORMATIONAL. An endpoint can unregister for many reasons, both intentional, such as manually resetting the device after a configuration change, or unintentional, such as loss of network connectivity. Other causes for this alarm could include a phone being registered to a secondary node and then the primary node come back online, causing the phone to rehome to the primary Cisco Unified CM node or lack of a KeepAlive being returned from the Cisco Unified CM node to which this endpoint was registered. Unregistration also occurs if Cisco Unified CM receives a duplicate registration request for this same device.
Actions to take vary depending on the reason specified for the endpoint unregistration. If the reason is ConfigurationMismatch, go to the Device Configuration page in Cisco Unified CM Administration, make a change to the Description field for this device, click Save, then reset the device. In the case of a network connectivity or loss of KeepAlives problem, use network diagnostic tools and the Cisco Unified CM Reporting tool to fix any reported network or Unified CM system errors. In the case of an endpoint rehoming to the primary Unified CM node, watch for a successful registration of the device on the primary node. In the case of a duplicate registration request, it may be a non-malicious occurrence due to timing of an endpoint registering and unregistering; if duplicate registration requests continue or if the same endpoint has different IP addresses, confirm the IP address on the physical device itself by checking the settings on the device (settings button). If unregistration of this device was expected, no action is required. Also, refer to the reason code descriptions for recommended actions.
Device type Enum definitions for EndPointUnregistered
Value
Definition
1
CISCO_30SP+
2
CISCO_12SP+
3
CISCO_12SP
4
CISCO_12S
5
CISCO_30VIP
6
CISCO_7910
7
CISCO_7960
8
CISCO_7940
9
CISCO_7935
12
CISCO_ATA_186
20
SCCP_PHONE
61
H323_PHONE
72
CTI_PORT
115
CISCO_7941
119
CISCO_7971
255
UNKNOWN
302
CISCO_7989
307
CISCO_7911
308
CISCO_7941G_GE
309
CISCO_7961G_GE
335
MOTOROLA_CN622
336
BASIC_3RD_PARTY_SIP_DEVICE
348
CISCO_7931
358
CISCO_UNIFIED_COMMUNICATOR
365
CISCO_7921
369
CISCO_7906
374
ADVANCED_3RD_PARTY_SIP_DEVICE
375
CISCO_TELEPRESENCE
404
CISCO_7962
412
CISCO_3951
431
CISCO_7937
434
CISCO_7942
435
CISCO_7945
436
CISCO_7965
437
CISCO_7975
446
CISCO_3911
468
CISCO_UNIFIED_MOBILE_COMMUNICATOR
478
CISCO_TELEPRESENCE_1000
479
CISCO_TELEPRESENCE_3000
480
CISCO_TELEPRESENCE_3200
481
CISCO_TELEPRESENCE_500
484
CISCO_7925
493
CISCO_9971
495
CISCO_6921
496
CISCO_6941
497
CISCO_6961
20000
CISCO_7905
30002
CISCO_7920
30006
CISCO_7970
30007
CISCO_7912
30008
CISCO_7902
30016
CISCO_IP_COMMUNICATOR
30018
CISCO_7961
30019
CISCO_7936
30035
IP_STE
Reason Code Enum definitions for EndPointUnregistered
Value
Definition
1
Unknown—The device has unregistered for an unknown reason. If the device does not reregister within 5 minutes, verify it is powered-up and verify network connectivity between the device and Cisco Unified CM.
2
NoEntryInDatabase—Device not configured properly in the Cisco Unified CM database.
3
DatabaseConfigurationError—Device configuration error in the Cisco Unified CM database.
4
DeviceNameUnresolveable—The Cisco Unified CM is unable to resolve the device name to an IP Address internally.
5
MaxDevRegExceeded—Maximum number of device registrations have been reached.
6
ConnectivityError—Network communication between the device and Cisco Unified CM has been interrupted. Possible causes include device power outage, network power outage, network configuration error, network delay, packet drops and packet corruption. It is also possible to get this error if the Cisco Unified CM node is experiencing high CPU usage. Verify the device is powered up and operating, verify network connectivity between the device and Cisco Unified CM, and verify the CPU utilization is in the safe range (this can be monitored using RTMT via CPU Pegging Alert).
7
InitializationError—Indicates that an error occurred when the Cisco Unified CM tries to initialize the device.
8
DeviceInitiatedReset—The device has initiated a reset, possibly due to a power cycle or internal error. No action required; the device will reregister automatically.
9
CallManagerReset—A device reset was initiated from Cisco Unified CM Administration, either due to an explicit command from an administrator, or due to internal errors encountered. No action necessary, the device will reregister automatically.
10
DeviceUnregistered—The device has explicitly unregistered. Possible causes include a change in the IP address or port of the device. No action is necessary, the device will reregister automatically.
11
MalformedRegisterMsg—(SIP only) A SIP REGISTER message could not be processed because of an illegal format. Possible causes include a missing Call-ID header, a missing AoR in the To header, and an expires value too small. Verify the REGISTER message does not suffer from any of these ills.
12
SCCPDeviceThrottling—(SCCP only) The indicated SCCP device exceeded the maximum number of events allowed per-SCCP device. Events can be phone calls, KeepAlive messages, or excessive SCCP or non-SCCP messages. The maximum number of allowed events is controlled by the Cisco CallManager service parameter, Max Events Allowed. When an individual device exceeds the number configured in that service parameter, Unified CM closes the TCP connection to the device; automatic reregistration generally follows. This action is an attempt to stop malicious attacks on Unified CM or to ward off excessive CPU usage. No action necessary, the device will reregister automatically.
13
KeepAliveTimeout—A KeepAlive message was not received. Possible causes include device power outage, network power outage, network configuration error, network delay, packet drops and packet corruption. It is also possible to get this error if the Unified CM node is experiencing high CPU usage. Verify the device is powered up and operating, verify network connectivity between the device and Unified CM, and verify the CPU utilization is in the safe range (this can be monitored using RTMT via CPU Pegging Alert). No action necessary, the device will reregister automatically.
14
ConfigurationMismatch—(SIP only) The configuration on the device does not match the configuration in Unified CM. This can be caused by database replication errors or other internal Unified CM communication errors. First go to the Cisco Unified Reporting web page, generate a Unified CM Database Status report, and verify "all servers have a good replication status". If this device continues to unregister with this reason code, go to the Cisco Unified CMAdmin Device web page for the device and click Save. This allows a change notify to be generated to the Unified CM and TFTP services and rebuild a new config file. If the problem still persists, restart the TFTP service and Unified CM service.
15
CallManagerRestart—A device restart was initiated from Cisco Unified CM, either due to an explicit command from an administrator, or due to a configuration change such as adding, deleting or changing a DN associated with the device. No action necessary, the device will reregister automatically.
16
DuplicateRegistration—Cisco Unified CM detected that the device attempted to register to two nodes at the same time. Cisco Unified CM initiated a restart to the phone to force it to rehome to a single node. No action necessary, the device will reregister automatically.
17
CallManagerApplyConfig—An ApplyConfig command was invoked from Unified CM Administration resulting in an unregistration. No action necessary, the device will reregister automatically.
18
DeviceNoResponse—The device did not respond to a reset or restart notification, so it is being forcefully reset. If the device does not reregister within 5 minutes, confirm it is powered-up and confirm network connectivity between the device and Cisco Unified CM.
19
EMLoginLogout —The device has been unregistered due to an Extension Mobility login or logout.
20
EMCCLoginLogout—The device has been unregistered due to an Extension Mobility Cross Cluster login or logout.
21
PowerSavePlus—The device powered off as a result of the Power Save Plus feature that is enabled for this device. When the device powers off, it remains unregistered from Unified CM until the Phone On Time defined in the Product Specific Configuration for this device.
22
CallManagerForcedRestart—(SIP Only) The device did not respond to an Apply Config request and as a result, Unified CM sent a restart request to the device. The device may be offline due to a power outage or network problem. Confirm that the device is powered-up and that network connectivity exists between the device and Unified CM.
23
SourceIPAddrChanged—(SIP Only) The device has been unregistered because the IP address in the Contact header of the REGISTER message has changed. The device will be automatically reregistered. No action is necessary.
24
SourcePortChanged—(SIP Only) The device has been unregistered because the port number in the Contact header of the REGISTER message has changed. The device will be automatically re-registered. No action is necessary.
25
RegistrationSequenceError—(SCCP only) A device requested configuration information from the Unified CM at an unexpected time. The Unified CM no longer had the requested information in memory.
26
InvalidCapabilities—(SCCP only) Unified CM detected an error in the updated media capabilities reported by the device. The device reported the capabilities in one of the StationUpdateCapabilities message variants.
28
FallbackInitiated—The device has initiated a fallback and will automatically reregister to a higher-priority Unified CM. No action is necessary.
29
DeviceSwitch—A second instance of an endpoint with the same device name has registered and assumed control. No action is necessary.
IPAddressAttributes Enum definitions for EndPointUnregistered
Value
Definition
0
Unknown—The device has not indicated what this IPv4 address is used for
1
Administrative only—The device has indicated that this IPv4 address is used for administrative communication (web interface) only
2
Signal only—The device has indicated that this IPv4 address is used for control signaling only
3
Administrative and signal—The device has indicated that this IPv4 address is used for administrative communication (web interface) and control signaling
IPV6AddressAttributes Enum definitions for EndPointUnregistered
Value
Definition
0
Unknown—The device has not indicated what this IPv6 address is used for
1
Administrative only—The device has indicated that this IPv6 address is used for administrative communication (web interface) only
2
Signal only—The device has indicated that this IPv6 address is used for control signaling only
3
Administrative and signal— The device has indicated that this IPv6 address is used for administrative communication (web interface) and control signaling
ErrorChangeNotifyClientTimeout
A change notification client was responding slowly and has been removed. A change notification recipient has not responded to change notification in several minutes and was thus removed. This may delay call processing features, such as call forwarding and so on.
History
Cisco Unified CommunicationsRelease
Action
8.0(1)
Added Routing List elements and deleted Data Collector element.
Rebooting the box will clear this situation. Alternatively, dbnotify trace could be analyzed to find the client that was removed and that service could be restarted in Cisco Unified Serviceability.
ErrorParsingDirectiveFromPDP
Cisco Unified Communications Manager (Unified CM) failed to parse the call routing directive or the diversion destination in the call routing response from the policy decision point (PDP).
A routing response was received but Cisco Unified Communications Manager (Unified CM) failed to parse the mandatory elements in the response. This means that a call routing directive or the call diversion destination could not be parsed correctly, or that the call routing directive was not recognized. The error may due to a syntax error or because the call routing directive is missing or the call diversion destination is missing in the call routing response.
Check the external call control documentation, including any applicable API documentation, to determine whether the call routing directive that was included as part of the policy obligations in the call routing response are correctly entered according to the information defined in the external call control documentation.
ErrorReadingInstalledRPMS
Could not read installed RPMs to populate component version table. The function that reads the RPM version information and populates database failed.
The policy decision point (PDP) returned a 4xx (client) or 5xx (server) status code in the HTTP response.
Cisco Unified Communications Manager (Unified CM) received a 4xx or 5xx response from the policy decision point (PDP). A 4xx response indicates errors in the call routing request from Unified CM, for example: a 400 response indicates the call routing request could not be understood by the PDP; a 404 indicates that the PDP did not find a matching request URI. A 5xx error indicates a PDP server error, for example: a 500 response indicates a PDP internal error; A 501 response indicates that the PDP does not support the functionality to generate a call routing response; a 503 indicates that the PDP is busy and temporarily cannot generate a response; a 505 indicates that the HTTP version number included in the call routing request from Unified CM is not supported. Other such errors may be responsible; please refer to generally available guidelines on HTTP or check the RFC 2616 for detailed explanations about HTTP Status Code definitions.
The status code and reason phrase for the failure(String)
Recommended Action
If a 4xx response caused the alarm, verify that the PDP has been accurately configured for the functionality and call routing that you expect it to perform. If a 500 response causes the alarm, check whether the PDP service is active and check the PDP server's log files for any errors. If a 503 causes the alarm, the PDP may be overloaded by requests. Take appropriate action to reduce the load on the PDP by following some or all of these recommendations: 1) consider adding more PDPs and provisioning Unified CM with additional call intercept profiles and call intercept trigger points in the various configuration pages under the Call Routing menu in Cisco Unified CM Administration; 2) provision a pair of policy servers per call-intercept profile to enable load balancing; or 3) verify that the PDP server in your deployment meets or exceed the hardware requirements specified in the documentation for Cisco Enterprise Policy Manager (CEPM) or the third-party PDP solution you have deployed. If a 505 response causes the alarm, check to be sure that the PDP supports HTTP version 1.1.
FailedToReadConfig
Service Manager failed to read configuration file.
This alarm indicates that Unified CM was unable to contact the firewall in order to make a IME call. As a consequence, outbound calls are being sent over the PSTN, and inbound calls may be routed over the PSTN by your partner enterprises.
Check to see that your firewall is up. Make sure the mapping service is enabled. Check that the IP address and port on the firewall for that mapping service match the configuration in Unified CM Administration. Check general IP connectivity between Unified CM and the firewall.
Routing List
SDL
SDI
Sys Log
Event Log
Parameter(s)
IP address(String)
Port number(UInt)
ICTCallThrottlingStart
Cisco CallManager stops handling calls for the indicated H.323 device due to heavy traffic or a route loop over the H.323 trunk.
Cisco Unified Communications Manager has detected a route loop over the H.323 trunk indicated in this alarm. As a result, Unified CM has temporarily stopped accepting calls for the indicated H.323 trunk. It's also possible that a high volume of calls are occurring over the intercluster trunk, which has triggered throttling.
Device Name [String] IP Address [String] Device type. [Optional] [Enum]Device description [Optional]. [String]
Enum Definitions for DeviceType
125—TRUNK
Recommended Action
In Real-Time Monitoring Tool, check the CallsActive and CallsInProgress counters for unusual activity on the indicated H.323 trunk. If the CallsActive count is significantly higher than usual, a traffic load issue may be occurring where the demand to send calls over the trunk is greater than the trunk's capacity. Monitor the situation and collect existing trace files. If the ICTCallThrottlingEnd alarm is not issued in a reasonable amount of time as deemed by your organization, contact TAC and supply the trace information you have collected. For a routing loop condition, the CallsInProgress counter will be significantly higher than usual. By examining trace files and CDR data for calls that occurred over the indicated trunk, you may be able to detect a translation pattern, route list or other routing mechanism that is part of the loop. Update the routing mechanism that resulted in the loop (generally the same number is configured on both near end and far end devices) and then reset the affected route list in an attempt to clear the route loop and if that fails, reset the affected trunk.
IDSEngineCritical
This alarm does not compromise data or prevent the use of the system but need to be monitored by the Administrator.
Event Class ID [String] Event class message [String] Event Specific Message [String]
Recommended Action
This alarm needs monitoring by the db admin.
IDSEngineFailure
Combined alarm for emergency and error situations. Something unexpected occurred that might compromise data or access to data or cause IDS to fail. This alarm indicates combined alarm for emergency and error situations. Something unexpected occurred that might compromise data or access to data or cause IDS to fail
This alarm is generated when Unified CM is processing a IME call, and is attempting to allocate a PSTN fallback DID and a DTMF digit sequence to associate with this call. However, there are too many IME calls currently in progress which are utilizing this same fallback DID, and as a result, there are no more DTMF digit sequences which could be allocated to this call. As such, this call will proceed, however mid-call fallback will not be possible for this call.
Your first course of action should be to identify the fallback profile associated with this call. Its name will be present in the alarm. Check that profile from the admin interface, and examine the current setting for "Fallback Number of Correlation DTMF Digits". Increase that value by one, and check if that eliminates these alarms. In general, this parameter should be large enough such that the number of simultaneous IME calls made to enrolled numbers associated with that profile is always substantially less than 10 raised to the power of this number. "Substantially" should be at least a factor of ten. For example, if you always have less than 10,000 simultaneous IME calls for the patterns associated with this fallback profile, setting this value to 5 (10 to the power of 5 is 100,000) will give you plenty of headroom and you will not see this alarm.
However, increasing this value also results in a small increase in the amount of time it takes to perform the fallback. As such, it should not be set arbitrarily large; it should be set just large enough to keep clear of this alarm. Another alternative to increasing this parameter is to add another fallback profile with a different fallback DID, and associate that fallback profile with a smaller number of enrolled DID patterns. This will allow you to get by with a smaller number of digits.
InvalidIPNetPattern
An invalid IP address is configured in one or more SIP route patterns in Cisco Unified CM Administration.
In Cisco Unified CM Administration, verify that the route pattern associated with the device that is identified in this alarm has an accurate and working IP address. You can learn more how to ensure that the IP address is valid by reviewing RFC 2373.
InvalidPortHandle
The handle for the opened serial port is invalid.
CMI cannot read/write to the serial port because the serial port returned an invalid handle value to CMI. The serial port may have returned an invalid handle because the system did not properly detect the USB cable.
IPv6 network interface is not installed. IPv6 option is enabled for TFTP service but the IPv6 network interface/address has not been configured on the system. Until the IPv6 network is functioning, devices that have been configured with IPv6-only will not be able to register. Devices that have been configured to use either IPv6 or IPv4 will register using IPv4. When the IPv6 network is online, IPv6-capable devices that have registered as IPv4 will remain IPv4 until they are reset, at which time they will use IPv6 if available.
Install IPv6 network interface and then restart TFTP service.
kANNDeviceRecordNotFound
ANN device record not found. A device record for the announcer device was not found in the database. The ANN device is normally automatically added when the server is added to the database.
To add the ANN device to database you will need to remove/delete the server and read the server. WARNING: This may result in having to manually reconfigure many different settings such as Media Resource Groups, CallManager Groups and many others.
kCFBDeviceRecordNotFound
CFB device record not found. A device record for the conference bridge device was not found in the database. The CFB device is normally automatically added when the server is added to the database.
History
Cisco Unified CommunicationsRelease
Action
3.x and 4.x
Added for Windows.
7.0(1)
Obsoleted.
8.0(1)
This alarm is available in 8.0(1). The severity changed from Informational to Error.
Enable trace for the database layer monitor to get specific error information.
kIPVMSDeviceDriverNotFound
Cisco IP voice media streaming driver not found. The Cisco IP voice media streaming driver was not found or is not installed. The Cisco IP Voice Media Streaming App service cannot run until this error is resolved. All software media devices (ANN, CFB, MOH, MTP) for this server will not be available.
Check the system log for an error when the system attempted to load IpVms driver at the last server startup. A server restart is required to cause the driver to be loaded.
kIpVmsMgrNoLocalHostName
Unable to retrieve the local host server name. The Cisco IP Voice Media Streaming App service will terminate. No software media devices (ANN, CFB, MOH, MTP) will be available while the service is stopped.
Check the configuration settings for the server name, DHCP, or DNS. Monitor the status of Cisco IP Voice Media Streaming App service. The service will not operate without a valid server name.
kIpVmsMgrNoLocalNetworkIPAddr
Unable to retrieve the network IP address for host server. Unable to obtain the network IP (dotted) address. The Cisco IP Voice Media Streaming App service will terminate. The software media devices (ANN, CFB, MOH, MTP) will be unavailable while this service is stopped.
Monitor the status of the Cisco IP Voice Media Streaming App service. It should be automatically restarted. If the error occurs again, check the server IP configuration (DHCP, IP address).
kIPVMSMgrWrongDriverVersion
Wrong version of device driver. An incompatible device driver was found. The Cisco IP Voice Media Streaming App service will terminate. The software media devices (ANN, CFB, MOH, MTP) will be unavailable while the service is stopped.
Restart the server to ensure the most recent driver is started. If the error continues, then reinstall Cisco Unified Communications Manager to get the proper driver version installed.
kMOHTFTPGoRequestFailed
Transfer of MOH source file to working path failed. An error was encountered when trying to copy or update a Music-on-Hold audio source file.
History
Cisco Unified CommunicationsRelease
Action
3.x and 4.x
Added for Windows.
7.0(1)
Obsoleted.
8.0(1)
This alarm is available in 8.0(1).
Following parameters added:
Error Description [String] Source Path [String] Destination Path [String] OS Error Code [Int] OS Error Description [String]
Use the Platform CLI to verify the source path and file exist. If the file does not exist then use Cisco Unified CM Admin to reupload the missing audio source to this specific server. Reinstall the Cisco Unified Communications Manager to have all required paths created.
kPWavMgrThreadxFailed
WAV playing manager thread creation failed. The process component used for playing WAV files failed to start, possibly due to low system resources.
Restart the Cisco IP Voice Media Streaming App service or restart server.
kReadCfgUserLocaleEnterpriseSvcParm
Error reading Enterprise User Locale configuration. A database exception was encountered when reading the default Enterprise User Locale setting. Default of US English will be used.
Verify that the Enterprise parameter setting for User Locale is configured using the CCM Admin web page. Restart the Cisco IP Voice Media Streaming App service.
kRequestedANNStreamsFailed
The requested resources for the configured number of annunciator calls (Call Count service parameter) was not available. If the value gets shown as "Allocated," it is non-zero.
Verify that the ANN Call Count service parameter is correct. A server restart may be needed to recover resources.
LostConnectionToSAFForwarder
Connection to the SAF Forwarder has been lost.
A TCP connection failure caused the connection between the SAF Forwarder and Unified CM to be lost. When the TCP connection is restored, Unified CM attempts to connect to the SAF Forwarder automatically. If IP connectivity is unreachable for longer than the duration of the Cisco CallManager service parameter CCD Learned Pattern IP Reachable Duration, calls to learned patterns will be routed through the PSTN instead. Calls through the PSTN to learned patterns will be maintained for a certain period of time before the PSTN failover times out.
Cisco Unified Serviceability Alarm Catalog
CallManager/CallManager
Severity
Error
Routing List
SDL
SDI
Sys Log
Event Log
Data Collector
Parameters
IP Address(String)
SafClientHandle(UInt)
Recommended Action
Investigate possible causes of a TCP connection failure, such as power failure, loose cables, incorrect switch configuration, and so on, and correct any issues that you find. After the connection is restored, CCD will try to register/sync with the SAF Forwarder automatically.
MultipleSIPTrunksToSamePeerAndLocalPort
Multiple trunks have been configured to the same destination and local port, which resulted in a conflict. Only one trunk is allowed for one destination/local port combination. The latest trunk invalidated earlier.
Peer IP Address. [String] Local IP Port [UInt] Old Device name. [String] Old Device Instance. [String] New Device name. [String] New Device Instance. [String]
Recommended Action
Check the SIP Trunk Configuration in Cisco Unified CallManager Administration and verify that only one SIP trunk has been configured to the same destination address and local port.
NodeNotTrusted
Untrusted Node was contacted. Application could not establish secure connection (SSL handshake failure) with another application. It could be due to certificate for tomcat service where the application is hosted is not trusted (not present in the keystore).
Ensure that "tomcat-trust" keystore on each CCM node contains the tomcat certificates for every other node within a cluster (Logon to OS Administration Page > Security > Certificate Management > Check the certificates in tomcat-trust).
If EMCC is enabled, then ensure that a bundle of all tomcat certificates (PKCS12) has been imported into the local tomcat-trust keystore (Logon to OS Administration Page > Security > Certificate Management > Look for certificates in tomcat-trust).
NumDevRegExceeded
The allowed number of registered devices was exceeded.
If you did not expect to exceed the number of devices and you have auto-registration enabled, go to Device > Phones in Cisco Unified CM Administration and search for phones starting with "auto". If you see any unexpected devices which may not belong in the system (such as intruder devices) locate that device using the IP address and remove it from the system. Or, if your licenses and system resources allow, increase the value in the Cisco CallManager service parameter, Maximum Number of Registered Devices.
PublishFailedOverQuota
Each IME server has a fixed quota on the total number of DIDs it can write into the IME distributed cache. When this alarm is generated, it means that, even though you should be under quota, due to an extremely unlikely statistical anomaly, the IME distributed cache rejected your publication, believing you were over quota. You should only see this alarm if you are near, but below, your quota. This error is likely to be persistent, so that the corresponding E.164 number from the alarm will not be published into the IME distributed cache. This means that you will not receive VoIP calls towards that number - they will remain over the PSTN.
The alarm will include the name of the IME server, and the current and target quota values. The first thing to check is to make sure that you have correctly provisioned the right set of DID prefixes on all of the Unified CM clusters sharing that same IME server on the same IME distributed cache. If that is correct, it means you have exceeded the capacity of your IME server, and you require another. Once you have another, you can now split your DID prefixes across two different IME client instances, each on a different IME server. That will alleviate the quota problem.
Routing List
SDL
SDI
Sys Log
Event Log
Parameter(s)
The DID for which the Publish was attempted(String)
Server name(String)
Current quota(UInt)
Maximum target quota(UInt)
ReadConfigurationUnknownException
An exception is caught while retrieving enterprise parameters value from database at TFTP service startup. This is usually caused by a failure to access the Cisco Unified Communications Manager database.
History
Cisco Unified CommunicationsRelease
Action
7.0(1)
Name changed from kReadConfigurationUnknownException.
In Cisco Unified Serviceability, enable Detailed level traces in the Trace Configuration window for TFTP and Cisco Database Layer Monitor services. Also, use RTMT to look for errors that may have occurred around the time of the alarm.
ReadingFileFailure
CMI failed to read SMDI messages from the serial port.
CMI opened the serial port, however it failed to successfully read data from the serial port because the serial port returned an invalid handle value to CMI. The serial port may have returned an invalid handle because the system did not properly detect the USB cable.
Make sure that the cable connecting the USB0 port and voice messaging system is firmly connected.
RsvpNoMoreResourcesAvailable
RSVP Agent resource allocation failed.
The alarm occurs when allocation of an RSVP Agent fails for all the registered RSVP Agents (RSVP Agents are basically MTPs or transcoder devices which provide RSVP functionalities) belonging to the Media Resource Group List and Default List. Each RSVP Agent may fail for different reasons. Following are some of the reasons that could cause an RSVP Agent allocation to fail: available MTP/transcoders do not support RSVP functionality; a capability mismatch between the device endpoint and MTP/transcoder, codec mismatch between the endpoint and the MTP/transcoder; a lack of available bandwidth between the endpoint and the MTP/transcoder; or because the MTP/transcoder resources are already in use.
A capability mismatch may be due to the MTP/transcoder not supporting one or more of the required capabilities for the call such as Transfer Relay Point (which is needed for QoS or firewall traversal), RFC 2833 DTMF (which is necessary when one side of the call does not support RFC 2833 format for transmitting DTMF digits and the other side must receive the DTMF digits in RFC2833 format, resulting in conversion of the DTMF digits), RFC 2833 DTMF passthrough (in this case, the MTP or transcoder does not need to convert the DTMF digits from one format to another format but it needs to receive DTMF digits from one endpoint and transmit them to the other endpoint without performing any modifications), passthrough (where no codec conversion will occur, meaning the media device will receive media streams in any codec format and transmit them to the other side without performing any codec conversion), IPv4 to IPv6 conversion (when one side of the call supports only IPv4 and the other side of the call supports only IPv6 and so MTP needs to be inserted to perform the necessary conversion between IPv4 and IPv6 packets), or multimedia capability (if a call involving video and/or data in addition to audio requires insertion of an MTP or transcoder then the MTP/transcoder which supports multimedia will be inserted).
History
Cisco Unified CommunicationsRelease
Action
8.0(1)
Media Resource List Name(String) parameter is added.
RSVP Agents are basically Cisco IOS MTPs or transcoder devices which provide RSVP functionalities. Check the user manual of the configured MTPs and transcoders to see whether they support RSVP functionality. If none of them support RSVP functionality either they need to be upgraded (if upgraded version support RSVP functionality) or additional MTP or transcoders need to be installed which support RSVP functionality. If the RSVP Agent (MTP or transcoder) allocation is failing due to a capability mismatch, it's possible that the media device does not support the requested capability (such as IPv4 to IPv6 conversion, passthrough) or the capability might not be configured in the device. Please check the user guide and documentation of the media device to make sure that device supports all the necessary capabilities.
Also, caution should be taken care if all the MTP or transcoders are configured with all the supported capabilities. There are certain capabilities (such as RFC 2833 DTMF or RFC 2833 DTMF passthrough or passthrough) which could be supported by most of the MTPs or transcoders and there may be certain capabilities (such as IPv4 to IPv6 conversion and vice versa or RSVP Agent functionality or Transfer Relay Point or multimedia capability) which can be supported by only by a single MTP or transcoder depending on the devices that you have.
For example, you may have end devices belonging to different locations and may need to reserve the bandwidth only between two locations; calls between other locations may not need to reserve the bandwidth. Now, suppose all the MTPs or transcoders are configured with all the supported capabilities and only one MTP/transcoder supports RSVP functionality; if this MTP/transcoder is configured with all the supported capabilities (which all the other MTPs or transcoders in the same MRGL or default MRGL also support) it may happen that this MTP can get allocated for Transfer Relay Point or RFC 2833 DTMF or RFC 2833 DTMF passthrough or passthrough instead. As a result, when a need arises to reserve the bandwidth (which other MTPs or transcoders in the same MRGL or default MRGL do not support), all the resources of this MTP/transcoder may be in use and the RSVP Agent allocation may fail.
To avoid this situation, set the priority of the media resources appropriately. This can be done only in the Media Resource Group List and not in the Default List of the media resources. In any Media Resource Group List all the Media Resource Groups have different priorities and during allocation the first Media Resource Group is checked for availability of the requested type of the media devices. The first Media Resource Group in the Media Resource Group List will have the highest priority, then the second one and so on. To check all the Media Resource Groups and their priority go the Media Resources and Media Resource Group List of Cisco Unified CM Administration page and click the appropriate Media Resource Group List and check the Selected Media Resource Groups; the priority decreases from top to bottom. Position the MTP or transcoder that you want to be selected for the basic functionalities in the higher priority Media Resource Groups whereas the ones with more rare functionality can be positioned in the Media Resource Groups with lower priority. RSVP Agent allocation may fail due to codec mismatch between the end point and the RSVP Agent or MTP/transcoder.
A solution may be to configure the MTP/transcoder with all the supported codecs (as specified in the user guide of the MTP/transcoder), but be aware that doing so might result in too much bandwidth being allocated for calls. You'll need to weigh different factors such as the total amount of available bandwidth, the average number of calls, approximate bandwidth use per call (not involving MTP/transcoder), and so on, and accordingly calculate the maximum bandwidth that can be allocated per call involving an MTP/transcoder and take that into consideration when configuring the supported codecs in the MTPs and transcoders. A good idea is to configure the media devices with all the supported codecs and set the region bandwidths to restrict too much bandwidth usage (refer to the Unified CM documentation for details on region and location settings).
Also, there may be codec mismatch between the endpoint and the MTP/transcoders after considering the region bandwidth between the MTP/transcoder and the endpoint. Increasing the region bandwidth may be a solution to the problem, but that decision should be made after careful consideration of the amount of bandwidth you're willing to allocate per call between the set of regions.
Another possible cause that an MTP/transcoder did not get allocated is because there was not enough available bandwidth for the call. This can happen if the MTP/transcoder and endpoint belong to different locations and the bandwidth that is set between the locations is already in use by other calls. Examine the bandwidth requirements in your deployment to determine whether bandwidth between the locations can be increased. However, note that increasing the bandwidth between these two locations means that you may need to reduce the bandwidth between other locations.
Refer to the System Guide, SRNDs, and related Unified CM documentation for more details. Be aware that reducing the bandwidth or removing the higher bandwidth codecs from configuration may result in poor voice quality during call. Consider increasing the total amount of network bandwidth. Finally, if RSVP Agent allocation fails due to MTP/transcoder not supporting RSVP functionality or capability mismatch or all the resources being in use, consider installing additional MTP or transcoder devices which support RSVP functionality.
RTMT_ALERT
A Real-Time Monitoring Tool (RTMT) process in the AMC service uses the alarm mechanism to facilitate delivery of RTMT alerts in the RTMT AlertCentral or through email.
Check AlertCentral in RTMT or any alerts that you have received through email to determine what issue has occurred and learn the recommended actions to resolve it. In AlertCentral, right-click the alert to open the alert information.
RTMT-ERROR-ALERT
This alert is generated by RTMT AlertMgr. See Alert Detail for explanation.
SAF_BAD_REQUEST - SAF Forwarder was unable to accept the request due to incorrect syntax (malformed), missing required attributes, and other similar reasons. Investigate the configuration between the SAF Forwarder and Unified CM to be certain that all settings are correct for your deployment. In particular, check the Client Label configured on the router to make certain that it matches the Client Label configured in Cisco Unified CM Administration on the SAF Forwarder Configuration window (SAF > SAF Forwarder).
431
SAF_INTEGRITY_CHECK_FAILURE - A message failed to pass SAF Forwarder security validation. This can occur because of misconfiguration, a potential attack, or more commonly by incorrect provisioning of the password on the Forwarder and SAF client. Reprovision the password and keep a watch on further SAF INTEGRITY CHECK FAILURE alarms. If you receive a persistent number of SAF INTEGRITY CHECK FAILURE alarms, close the interface between SAF Forwarder and Unified CM and investigate the source of the IP packets.
435
**INFO LEVEL** SAF_MISSING_NONCE - A nonce (a random parameter generated when the message is sent) is missing from the message. The system will resend with a new nonce automatically. No action is required.
436
SAF_UNKNOWN_USERNAME - Unified CM sent the SAF Forwarder an Application User name that is not configured on the router or that does not match the router's configuration. Check the Application User Name on the router and in the Application User Configuration window in Cisco Unified CM Administration to be sure they match.
438
**INFO LEVEL** SAF_STALE_NONCE - A nonce (a random parameter generated when the message is sent) has aged out (gone stale). The system will resend with a new nonce automatically. No action is required.
471
**INFO LEVEL** SAF_BAD_CLIENT_HANDLE - SAF_BAD_CLIENT_HANDLE - Unified CM sent the SAF Forwarder a Register message (for KeepAlive purposes) or unregister message with the mandatory CLIENT_HANDLE value, but the SAF Forwarder did not recognize the client handle. Unified CM will attempt to reregister with the SAF Forwarder without a client handle. This alarm is for informational purposes only; no action is required.
472
**INFO LEVEL** SAF_VERSION_NUMBER_TOO_LOW - Unified CM published a service (such as Hosted DN) whose version number is now lower than when it was previously published to the SAF Forwarder. The service is out of sync with the SAF Forwarder. Unified CM will republish the service in an attempt to resynch with the SAF Forwarder. This alarm is for informational purposes only; no action is required.
473
**INFO LEVEL** SAF_UNKNOWN_SERVICE - Unified CM attempted to unpublish a service from the SAF network but the SAF Forwarder does not have a publish record for that service. This alarm is for informational purposes only; no action is required.
474
**INFO LEVEL** SAF_UNREGISTERED - Unified CM attempted to publish or subscribe to the SAF Forwarder, but Unified CM is not registered with SAF Forwarder. Unified CM will automatically reregister with the SAF Forwarder before attempting to publish or subscribe. This alarm is for informational purposes only; no action is required.
475
**INFO LEVEL** SAF_BAD_FILTER - Unified CM attempted to subscribe to the SAF Forwarder with a filter that does not match any of the SAF Forwarder's current filters. Unified CM will resend the subscribe message with the appropriate filter value. This alarm is for informational purposes only; no action is required.
476
SAF_UNKNOWN_SUBSCRIPTION - Unified CM sent a subscribe or unsubscribe message to the SAF Forwarder but the message contained a Service ID that was not familiar to the SAF Forwarder. Without a recognized Service ID, Unified CM cannot subscribe to the SAF Forwarder. Recommended action is to contact the Cisco Technical Assistance Center (TAC).
477
**INFO LEVEL** SAF_ALREADY_REGISTERED - Unified CM attempted to register with the SAF Forwarder but SAF Forwarder indicates that Unified CM is already registered. Unified CM will close and reopen the TCP connection and send a new register request without a client handle to SAF Forwarder. This alarm is for informational purposes only; no action is required.
478
SAF_UNSUPPORTED_PROTOCOL_VERSION - Unified CM attempted to register with the SAF Forwarder using a SAF protocol version number that is greater than the protocol version number supported by the SAF Forwarder. Issue a show version command on the SAF Forwarder CLI to determine the SAF Forwarder protocol version; refer to the information in this alarm for the SAF protocol version number. If the versions do not match, check the Cisco Unified Communications Manager Software Compatibility Matrix (available on Cisco.com) to determine whether the SAF protocol version number that is in use on this Unified CM is compatible with the SAF Forwarder protocol version. If it is not, upgrade the lower-versioned component so that both Unified CM and the SAF Forwarder use the same, compatible version.
479
SAF_UNKNOWN_AS - Unified CM attempted to register to the SAF Forwarder but the registration message contained a Client Label that was not familiar to the Autonomous System (AS) on the SAF Forwarder router. Recommended action is to issue the appropriate CLI commands on the SAF Forwarder to associate the Client Label with the autonomous system on the router (refer to the Configuration Guide for the router) and configure the same Client Label in the Client Label field on the SAF Forwarder Configuration window in Cisco Unified CM Administration and click Save. When the Client Label is saved in Cisco Unified CM Administration, Unified CM automatically sends a new registration request to the SAF Forwarder with the updated Client Label information.
500
**INFO LEVEL** SAF_RESPONDER_ERROR - Unified CM sent a message (such as register/unregister/publish/unpublish/subscribe) to the SAF Forwarder but the SAF Forwarder responded that it is unable to process the message at this time. This might be due to heavy message queuing, internal resource issues, and so on. Unified CM will wait several seconds and then retry the request. This alarm is for informational purposes only; no action is required.
1000
SAF_INVALID_CONNECTION_DETAILS
SAFResponderError
SAF Responder Error 500.
This is raised when SAF forwarder doesn't know the transaction ID within SAF response from this Cisco Unified CM.
Review configuration for scheduled collection job under Job Status window.
SerialPortGetStatusError
When CMI tries to get the status of serial port, the operating system returns an error.
CMI triggers this alarm when it cannot get the status of the serial port. An inability to receive the serial port status information can be caused by a loose or disconnected USB cable.
Make sure that the cable connecting the USB0 port and voice messaging system is firmly connected.
SerialPortSetStatusError
When CMI tries to set the status of serial port, the operating system returns an error.
CMI triggers this alarm when it cannot set the status of the serial port. An inability to receive the serial port status information can be caused by a loose or disconnected USB cable.
The normalization script has exceeded an internal resource threshold.
The normalization script for the indicated SIP device has exceeded an internal threshold for resource consumption. This alarm can occur for memory consumption, or when the script is close to exceeding the configured allowance of Lua instructions. When the amount of memory (as defined in the Memory Threshold field) or the number of Lua instructions utilized by this script (as defined by the Lua Instruction Threshold) exceeds an internal threshold, this alarm is triggered.
Examples
If the memory threshold is set to 100 KB and the internal threshold is 80%, this alarm will occur when this script has consumed 80 KB of memory. The internal threshold is not configurable and may fluctuate from Cisco Unified CM release to release.
If the Lua Instruction Threshold is set to 2000 and the internal threshold is 50%, this alarm will occur when the script has executed 1000 Lua instructions.
This alarm warns that the resources (either memory or Lua instructions) have crossed an internal mark, where investigation into the consumption of those resources may be advisable to ensure the health of the script.
Examine the thresholds (Memory Threshold and Lua Instruction Threshold) configured in the SIP Normalization Script Configuration window.
Evaluate if the thresholds can be increased (take into consideration the CPU resources and memory when deciding to increase these values), or examine the script to determine if the message handlers can be written more efficiently to reduce the number of instructions in the script.
Examine the script for logic errors. If the script is functioning normally but contains extensive logic, consider increasing the value in the Lua Instruction Threshold field. Be aware that more computing resources will be consumed as a result. You can also examine SDI trace files for additional details about this resource condition. For scripts provided by Cisco, contact the Cisco Technical Assistance Center (TAC).
Investigate and correct the resource issue before the script closes. When the values that have been configured in the Memory Threshold field, or Lua Instruction Threshold field or both the fields on the SIP Normalization Script Configuration window are met, the script closes and the SIPNormalizationScriptClosed alarm also occurs. For additional information when troubleshooting, check the SIP Normalization counter, MemoryUsagePercentage to learn the current resource usage.
Reason Code Enum definitions for SIPNormalizationResourceWarning
Value
Definition
1
InternalLuaInstructionsThreshold—The script exceeds the internal threshold for the number of Lua instructions.
2
InternalMemoryThreshold—The script exceeds the internal threshold for script memory usage.
SIPNormalizationScriptError
Description
A script error occurred.
Explanation
Cisco Unified CM encountered an error during loading, initializing, or during execution of the SIP normalization script for the indicated SIP device. If the error was due to a resource issue, the SIPNormalizationResourceWarning alarm will also be issued. The Configured Action shown in this alarm may differ from the Resulting Action shown in this alarm because certain errors, such as those occurring during loading or initialization, cannot be configured. If the script closes three times within a 10 minute window due to errors, Cisco Unified CM will follow the configured action three times; on the fourth occurrence of the error, Unified CM disables the script and issues the SIPNormalizationAutoResetDisabled alarm.
Examine SDI trace files for details regarding the error such as function calls and the call ID. This will help you to troubleshoot the error.
Examine the script for syntax or logic errors; for scripts provided by Cisco, contact the Cisco Technical Assistance Center (TAC). If the error was due to a resource issue, the SIPNormalizationResourceWarning alarm will also be issued. Check the SIPNormalizationResourceWarning alarm for additional information and recommended actions.
LoadError —The script failed to load either due to a syntax error in the script or a resource error; check the Recommended Actions for instructions.
2
InitializationError—The script encountered a failure while initializing either due to a syntax error in the script or a resource error; check the Recommended Actions for instructions.
3
ExecutionError—The script encountered a failure during execution; check the Recommended Actions for instructions.
4
InternalError—The system encountered an unexpected condition during execution; check the Recommended Actions for instructions.
SIPTrunkOOS
All remote peers are out of service and unable to handle calls for this SIP trunk.
This alarm provides the list of unavailable remote peers, where each peer is separated by semicolon. It also provides the reason code received by the SIP trunk, in response to an Options request sent to remote peer. For each peer, the alarm provides the hostname or SRV (if configured on SIP trunk), resolved IP address, port number, and reason code in the following format:
ReasonCodeType=ReasonCode.
The ReasonCodeType depends on a SIP response from the remote peer as defined in SIP RFCs (remote), or depends on a reason code provided by Unified CM (local).
The examples of possible reason codes include:
Remote = 503 ("503 Service Unavailable" a standard SIP RFC error code)
Remote = 408 ("408 Request Timeout" a standard SIP RFC error code)
Local = 1 (request timeout)
Local = 2 (local SIP stack is unable to create a socket connection with remote peer)
Local = 3 (DNS query failed)
For Local=3, IP address in the alarm is represented as zero, and when DNS SRV is configured on SIP trunk then the port is represented as zero.
Route/SIP trunk for originating side does not exist on remote peer. If remote peer is Unified CM, add a new SIP trunk in Unified CM Administration for the remote peer (Device > Trunk) and ensure the Destination Address and Destination Port fields are configured to point to the originating host (the originating host is the same node on which this alarm was generated).
Route/SIP trunk for originating side does exist on remote peer but the port is either used for a SIP phone or a different SIP trunk. If remote peer is Unified CM, in the Unified CM Administration for the remote peer (Device > Trunk), ensure the Destination Port on the originating side is configured to be the same as the incoming port on the terminating side SIP Trunk Security Profile.
Remote peer has limited resources to handle new calls. If remote peer is administered by a different system administrator, communicate the resource issue with the other administrator.
For Remote = 408, the possible reason includes:
Remote peer has limited resources to handle new calls. If remote peer is administered by a different system administrator, communicate the resource issue with the other administrator.
For Local = 1, the possible reason could be that no responses are received for OPTIONS request after all retries, when UDP transport is configured in the SIP trunk Security Profile assigned to the SIP trunk on the originating side.
To fix this issue, perform the following steps:
If remote peer is Unified CM, in the remote peer Serviceability application, choose Tools > Control Center (Feature Services) and ensure the Cisco CallManager service is activated and started.
In the Unified CM Administration for the remote peer, choose Device > Trunk, and ensure the SIP trunk exists with the incoming port in associated SIP Trunk Security Profile configured to be same as originating side SIP Trunk destination port.
Check the network connectivity by using the CLI command utils network ping <remote peer> at the originating side.
For Local = 2, the possible reason could be that Unified CM is unable to create the socket connection with remote peer.
To fix this issue, perform the following steps:
If remote peer is Unified CM, in the remote peer Serviceability application, choose Tools > Control Center (Feature Services) and ensure the Cisco CallManager service is activated and started.
In the Unified CM Administration for the remote peer, choose Device > Trunk and ensure the SIP trunk exists with the incoming port in associated SIP Trunk Security Profile configured to be same as originating side SIP Trunk destination port.
Check the network connectivity by using the CLI command utils network ping <remote peer> at the originating side.
For Local = 3, the possible reason could be that DNS server is not reachable, or DNS is not properly configured to resolve the hostname or SRV which is configured on the local SIP trunk.
To fix this issue, perform the following steps:
In the OS Administration, choose Show > Network and verify whether the DNS details are correct. If it is not correct, configure the correct DNS server information by using the CLI command set network dns primary.
Check the network connectivity with DNS server by using the CLI command utils network ping <remote peer>, and ensure the DNS server is properly configured.
SparePartitionLowWaterMarkExceeded
The percentage of used disk space in the spare partition has exceeded the configured low water mark.
Note
Spare Partition is not used for Intercompany Media Engine server. So this alert will not be triggered for Intercompany Media Engine.
Login into RTMT and check the configured threshold value for LogPartitionLowWaterMarkExceeded alert in Alert Central. If the configured value is set to a lower than the default threshold value unintentionally, change the value to default. Also, examine the trace and log file setting for each of the application in trace configuration page under Cisco Unified CM Serviceability.
If the number of configured traces or logs is set to greater than 1000, adjust the trace settings from trace configuration page to default. Also, clean up the trace files that are less than a week old. You can clean up the traces using cli "file delete" or using Remote Browse from RTMT Trace and Log Central function.
Unknown exception was caught while processing file request. This usually indicates a lack of memory when there is a system issue such as running out of resources.
History
Cisco Unified CommunicationsRelease
Action
7.0(1)
Name changed from kThreadPoolProxyUnknownException.
Use RTMT to monitor the system memory resources and consumption and correct any system issues that might be contributing to a reduced amount of system resources.
UnableToRegisterwithCallManagerService
CTI cannot communicate with Cisco CallManager service to register supplementary service features.
Check the status of the Cisco CallManager service in Cisco Unified Serviceability > Tools > Control Center - Featured Services. At least one Cisco CallManager service should be running in the cluster for CTIManager to register feature managers. Restart the CTIManager service if the problem persists. If CallManager service is active, verify network connectivity between the Unified CM node that hosts CTIManager service and Unified CM node that hosts CallManager service.
UserLoginFailed
User log in failed because of bad user ID or password.
CMI failed to write SMDI messages to the serial port.
CMI opened the serial port, however it failed to successfully write data to the serial port because the serial port returned an invalid handle value to CMI. The serial port may have returned an invalid handle because the system did not properly detect the USB cable.
The warning-level alarm is 4 and action is needed but priority of action is determined by the condition. A warning about some bad condition, which is not necessarily an error. Configuration error or an alarm that by itself does not indicate a warning but several instances of the same alarm do. Examples are:
Configuration error
One alarm of this level may not mean that an error has occurred but multiple of these would be considered an error
Annunciator resource allocation failed for one or more of the following reasons: all Annunciator resources are already in use; there was a codec or capability mismatch (such as the endpoint using one type of IP addressing such as IPv6, while the Annunciator supports only IPv4) between the endpoint and the Annunciator resource; not enough bandwidth existed between the endpoint and the Annunciator.
If all the resources of the Annunciator are already in use, check to be sure that all the Annunciators that belong to the Media Resource Groups of the indicated Media Resource Group List and Default List are configured and registered in all the applicable Unified CM nodes of the cluster. To check the registration status go to the Media Resources > Annunciator menu and click the Find button. It will display all the Annunciators with their status, device pool, and so on.
Check the status field to see whether it is registered with Unified CM. Note that the display on the status field is not a confirmation that the device is registered to Unified CM. It may happen in a Unified CM cluster that the Publisher can only write to the Unified CM database before the Publisher goes down. Because the Subscriber may not be able to write to the database, the devices may still display registered in Unified CM Administration after they are actually unregistered. However, if the Publisher is down that should generate another alarm with higher priority than this alarm.
The Annunciator allocation can fail due to codec mismatch or capability mismatch between the endpoint and the Annunciator. If there is a codec mismatch or capability mismatch (such as the endpoint using IPv6 addressing but Annunciator supporting only IPv4), an MTP or transcoder should be allocated. So, if the MTP or transcoder is not allocated then either MediaResourceListExhausted (with Media Resource Type as Media termination point or transcoder) or MtpNoMoreResourcesAvailable alarm will be generated for the same Media Resource Group List and you should first concentrate on that.
The Annunciator allocation may even fail after checking the region bandwidth between the regions to which the held party belongs and the region to which the Annunciator belongs. Increasing the region bandwidth may be a solution to the problem, but that decision should be made after careful consideration of the amount of bandwidth you're willing to allocate per call between the set of regions. You'll need to weigh different factors such as the total amount of available bandwidth, the average number of calls, the average number of calls using the Annunciator, approximate bandwidth use per call, and so on, and accordingly calculate the region bandwidth.
Another possible cause is that the bandwidth needed for the call may not be available. This can happen if the Annunciator and endpoint belong to different locations and the bandwidth that is set between the locations is already in use by other calls. Examine the bandwidth requirements in your deployment to determine whether bandwidth between the locations can be increased.
However, note that increasing the bandwidth between these two locations means that you may need to reduce the bandwidth between other locations. Refer to the System Guide, SRNDs, and related Unified CM documentation for more details. Be aware that reducing the bandwidth or removing the higher bandwidth codecs from configuration may result in poor voice quality during call. Consider increasing the total amount of network bandwidth.
ApplicationConnectionDropped
Application has dropped the connection to CTIManager.
TCP or TLS connection between CTIManager and Application is disconnected.
Possible causes include Application server power outage, network power outage, network configuration error, network delay, packet drops or packet corruption. It is also possible to get this error if the Unified CM node or application server is experiencing high CPU usage. Verify the application is up and running, verify network connectivity between the application server and Unified CM, and verify the CPU utilization is in the safe range for application server and Unified CM (this can be monitored using RTMT via CPU Pegging Alert).
ApplicationConnectionError
CTIManager is unable to allow connections from Applications.