Cisco TV CDS 2.1 ISA Software Configuration Guide
System Monitoring

Table Of Contents

System Monitoring

System Level Monitoring

System Health

System Snapshot

Monitoring Content Objects

Ingests

List Content

Package Expirations

Publish Failures

Monitoring Stream Objects

Stream Monitor

Stream Failures

Array Level Monitoring

Array Snapshot

Barker Stream Monitor

Server Level Monitoring

Disk Monitor

S.M.A.R.T

NIC Monitor

Cache/Fill Bandwidth

Services Monitor

Recommended Monitoring Schedule

Daily Tasks

Weekly Tasks

Monitoring Tasks for Streamers and Vaults

Monitoring Tasks for Vaults

Monitoring Tasks for Streamers

Monthly Tasks

Other Tasks


System Monitoring


The CDSM provides tools that can be used for system monitoring and system diagnostics. The topics covered in this chapter include:

System Level Monitoring

Monitoring Content Objects

Monitoring Stream Objects

Array Level Monitoring

Server Level Monitoring

Recommended Monitoring Schedule


Note If Virtual Video Infrastructure (VVI) with split-domain management is enabled, the CDSM pages associated with the Vaults and Caching Nodes display only on the VVI Manager (VVIM), and the CDSM pages associated with the Streamers display only on the Stream Manager. For more information, see the "Virtual Video Infrastructure" section on page D-5.


System Level Monitoring

The System Level Monitoring pages provide an overall view of the health and activity of the CDS. The System Level links are:

System Health

System Snapshot

To view the System Level Monitoring pages, click Monitor from any page in the CDSM, and then click System Health or System Snapshot, as appropriate.

System Health

The System Health page provides a top-level view of the overall health of each server in the CDS. Any time there is an alarmed event, an alarm is displayed in the CDSM banner. The Alarms menu is displayed when you roll your mouse over the alarm icon. See Figure 5-1. Clicking the alarmed event in the Alarm menu takes you to the CDSM page that has more information. For example, in Figure 5-1, clicking System health problems reported takes you to the System Health page.

Figure 5-1 CDSM Banner—System Health Alarm

To view the System Health page, choose Monitor > System Health. See Figure 5-2.

Figure 5-2 System Health Page

The colored boxes on the System Health Monitor page have the following meaning:

Green—All components are operating.

Yellow—Some components are not operational.

Red—All components have failed.

You can view the details of a monitored area of a server by clicking the box in the appropriate column.

When you click the Network check box you are taken to the NIC Monitor page. See the "NIC Monitor" section for more information.

When you click the Disk check box you are taken to the Disk Monitor page. See the "Disk Monitor" section for more information.

When you click the Services check box you are taken to the Services Monitor page. See the "Services Monitor" section for more information.

The time shown at the bottom of the left-panel menu is not the current time, but rather the CDSM time that is used for the health status and monitoring the system.

System Snapshot

The System Snapshot page provides an overview of the current activity on the CDS. A summary of the state of all streams, content ingests, and disk usage is displayed. See Figure 5-3.

Figure 5-3 System Snapshot Page

Table 5-1 describes the information displayed on the System Snapshot page.

Table 5-1 System Snapshot Page 

Field
Description

Data Refresh Rate

How often the information is refreshed. The default is 10 seconds. The range is 10 to 300. All field values that are updated, based on the refresh rate, are initially shown in a green colored font.

Total Streams

Total number of stream objects the CDS is currently streaming.

HD Streams

Total number of high-definition stream objects the CDS is currently streaming.

SD Streams

Total number of standard-definition stream objects the CDS is currently streaming.

Active Stream Bandwidth

Total bandwidth, in megabits, used for active streams.

Active Fill Bandwidth

Total bandwidth, in megabits, used for caching content among Vaults and Streamers.

Total Content

Total number of content objects currently stored, ingested, provisioned for ingest, and failed ingest on the CDS.

Active Ingests

Total number of content objects currently being ingested on the CDS.

Completed Ingests

Total number of content objects currently stored on the CDS.

Prov. (push) Ingests

Total number of content objects that have been requested for ingestion, but have not yet begun active ingestion.

Failed Ingests

Total number of content objects that failed to complete the ingest process.

Temp Out of Service

Total number of content objects that are in a Temp Out of Service state. The backoffice may put a content object into this state for a certain amount of time.

Total Disk

Total disk space, in bytes, on the CDS.

Disk Used

Total used disk space, in bytes, on the CDS.

Disk Available

Total available disk space, in bytes, on the CDS.


Monitoring Content Objects

The content objects links on the Monitor System Level page provides information on the status of content ingests. Table 5-2 describes the different ingest states that are monitored.

Table 5-2 Monitored Ingest States

Completed Ingests

Lists content objects that have been fully ingested.

Active Ingests

Lists content objects that are in the process of being ingested.

Provisioned Ingests

Lists content objects that have been requested for ingestion, by way of the BMS administrator creating an entry, but have not yet begun active ingestion.

Unprovisioned Ingests

Lists content objects that were terminated by the BMS administrator or have been created but do not yet contain any information (in other words, they are "blank shells").

Failed Ingests

Lists content objects that failed to complete the ingest process. All failed ingests are reported back to the OpenStream system.

Package Expiration

Lists Package Expiration information, including expiration dates and all associated metadata. Allows for package expiration adjustments.

Publish Failures

Lists the packages that were not able to be published to the backoffice and provides a mechanism to republish the package.


In a VVI with split-domain management, the System Manager displays the following completed ingest details: Content Name, File Size, Rate, Create Time, and Last Modified. For the other completed ingests fields, see the same content asset on the VVIM.

Ingests

To view the details of content object ingests, do the following:


Step 1 Choose Monitor > Server Level from any page in the CDSM, and then click the link for the type of content object you want to view:

Completed Ingests

Active Ingests

Provisioned Ingests

Unprovisioned Ingests

Failed Ingests

Step 2 Select a content object from the drop-down list and click Display. The details of the content object are displayed. Figure 5-4 shows an example of the ingest details.

By typing the first character of the content object name, you can jump to that section of the list.

In addition, you can perform a text string search by typing the text string you want to search for in the Search Ingests field and clicking Search. A list of content objects that contain the text string are listed. To see the content object details, click the content object name listed. To return to the previous page without selecting a content object, click Back.

Figure 5-4 Completed Ingests Page

Table 5-3 describes the content object details that are displayed for each type of ingest.

Table 5-3 Content Object Details 

Field
Description

Content Name

The name of the content object.

Factory ID

The factory responsible for this content object.

Ingest IP

The IP address of the Vault's ingest interface used to download the content.

File Size

The file size, in bytes, of this content object.

Rate

The rate of ingest in bits per second (3750000 = 3.75 Mbps).

Create Time

The time and date this content object was created.

Last Modified

The time and date this content object was last modified.

Op State

The operational state of this content object. The possible operational states are:

Created—Content is loading.

In Service—Content is available for streaming.

Out of Service—Content is not available for streaming.

Admin State

The administrative state of this content object. The possible administrative states are:

Unprovisioned—Content is loading.

In Service—Content is available for streaming.

Out of Service—Content is not available for streaming.

Push Provision

The type of FTP provisioned. The provision types are:

FTP pull

FTP push

Content Copies

These fields display the following information about the copies of the content:

Speed/Direction—The trick-mode speed and direction (fast forward or rewind).

Server ID—The server ID where the copy is stored.

Status—The status of the stored content.

Trick Speeds [1-8]

Trick speeds 1 through 8 show the trick-mode speeds for this content object.

Full Content ID

The full identification of this content object.

Ior1

The Interoperable Object Reference (IOR) for this content object.

URL1

The Uniform Resource Locator (URL) address of the content has the following:

Protocol used (for example, FTP)

Username and password (for example, videolan:mpeg4ftp)

IP address of the content provider's server (for example, 192.168.100.184)

Directory where the content is stored on the provider's server (for example, videolan)

Name of the file (for example, long_encore_3.75.mpg)

Asset Ior1

The Asset IOR associated with this content object.

1 Not displayed in Figure 5-4.



List Content

The List Content option on the Completed Ingests page lists all content objects that meet the criteria (Only Damaged or List All) and that are associated with each content.

To list all associated content objects, do the following:


Step 1 Choose Monitor > System Level > Completed Ingests. The Completed Ingest page is displayed.

Step 2 In the List Content area, choose Only Damaged to list only the damaged content objects, or choose List All to list all content objects, and click List. The Content List is displayed (Figure 5-5).

Figure 5-5 Completed Ingests—Content List

Table 5-4 describes the information displayed in the Content List.

Table 5-4 Content List Fields

Field
Description

Content Name

The name of the content.

Duration

The duration of the content.

GOID

The Global Object ID for the content object associated with the content.

Version

The trick-play speed of the content object. The value, IGate, refers to an index file, which allows for the jumps between trick speeds, and so on. The value, redo, is an undo file. When the content is ingested, if there are any changes made during the ingest, the changes are recorded in the redo file. If the content is sent by using FTP Out, the changes are undone and the original file is sent.

Server ID

The server ID of the Vault that is storing the content object.

Status

The status of the storing process of the content object, either complete or partial.



Package Expirations


Note Package Expirations are part of the optional Ingest Manager feature. This option is only listed on the Monitoring System Level left-panel menu if the Ingest Manager is included in your deployment.


To view the details or adjust the license expiration of a package expiration, do the following:


Step 1 Choose Monitor > Package Expiration. The Package Expiration page is displayed.

Step 2 From the Available Packages drop-down list, choose a package and click Display. The Package Expiration details are displayed.

By typing the first character of the package name, you can jump to that section of the list.

Figure 5-6 shows an example of the Package Expiration details.

Figure 5-6 Package Expiration Page

Table 5-5 describes the package expiration details that are displayed.

Table 5-5 Package Expiration Details 

Field
Description

Package Name

The name of the package.

License Expiration

The date the package will expire.

Additional Package Window

The additional time added to the package.

Actual Package Expiration

The Actual Package Expiration is the License Expiration plus the Additional Package Window.


Step 3 To adjust the license expiration, enter the number of days (positive or negative) in the Adjust License Expiration field and click Update.

The license expiration is adjusted by the number of days you entered. The Additional Package Window is not affected and is still applied to create the Actual Package Expiration.

Step 4 In the Asset Details section, to view the metadata associated with the package expiration, click the plus sign (+) next to the metadata you want to view.


To delete a package, choose the package from the Available Packages drop-down list, click Display, and then click Delete in the Package Expiration Details section.

Publish Failures


Note Publish Failures are part of the optional Ingest Manager feature. This option is only listed on the Monitoring System Level left-panel menu if the Ingest Manager is included in your deployment.


The Publish Failures page lists the packages that were not able to be published to the backoffice and provides a mechanism to republish the package.

To publish an unpublished package, or delete an unpublished package, do the following:


Step 1 Choose Monitor > Publish Failures. The Publish Failures page is displayed.

Step 2 From the Unpublished Packages drop-down list, choose a package and click Display. The Unpublished Package details are displayed.

By typing the first character of the package name, you can jump to that section of the list.

Figure 5-6 shows an example of the Publish Failures details.

Figure 5-7 Publish Failures Page

Table 5-6 describes the publish failures details that are displayed.

Table 5-6 Publish Failures Details 

Field
Description

Package Name

The name of the package.

Source URL

The location of the original package information.

Target URL

The location where to place the package information.

Module Type

An internal identifier for what failed. In the example in Figure 5-7, the publishing of the package failed.

Creation Date

The date the database record was created for this failure.


Step 3 In the Assets section, to view the metadata associated with the unpublished package, click the plus sign (+) next to the metadata you want to view.

Step 4 To restart the publishing process and set the creation date to today, click Publish.

To delete an unpublished package, click Delete.


Monitoring Stream Objects

The monitored stream objects consist of:

Stream Monitor

Stream Failures

Stream Monitor

To view the details of stream objects, do the following:


Step 1 Choose Monitor > Stream Monitor. The Stream Monitor page is displayed.

Step 2 From the Browse By Session ID drop-down list, choose the Session ID of the stream object and click Display. The stream object details are displayed.

If you do not know the session ID, you can perform a text string search by selecting the field you want to search on, entering the text string in the Search String field, and clicking Search. A list of stream objects that contain the text string in the field you selected are listed. Click the linked field (Session Handle in Figure 5-8) to see the stream monitor details for the selected object, or click Back to return to the previous page.

Figure 5-8 Stream Search Results

Figure 5-9 shows an example of the stream object details.

Figure 5-9 Stream Monitor Page

Table 5-7 describes the stream details.

Table 5-7 Stream Details 

Field
Description

Stream ID

The internal unique identifier assigned to the stream session by the Streamer.

Content ID

The content identifier of the stream object.

LSCP IP

The source IP address of the LSCP transaction (set-top box).

Session ID

The session ID of the stream object.

Create Time

The date and time the stream object was created.

Last Modified

The date and time the stream object was last modified.

TSID out

This field is only applicable when Streaming Mode is set to ASI (0).

The output transport stream identification on the associated MQAM device.

Service Group

The service group that the stream object is transmitting on.

QAM IP

The IP address of the QAM device delivering this stream object.

QAM Port

The port the QAM device is using to receive the stream object.

Program Number

This field is only applicable when Streaming Mode is set to ASI.

The numerical MPEG program number for this stream object.

Bandwidth Used

The transport stream bandwidth, in bytes, required for this stream object.

Op State

Operational State indicates the state of the object. The possible states are:

InService—Stream object is functioning.

OutOfService—Occurs when the entity using the object wants to temporarily stop the object.

Created —Stream object not yet provisioned.

Destroyed—Stream object is destroyed.

Admin State

The administrative state of this stream object. The possible administrative states are:

Unprovisioned—Stream is loading.

InService—Stream is available for streaming.

OutOfService—Stream is not available for streaming.

Stream State

Stream state originates from the LSCP server and has the following modes:

Open—The server is not transporting a media stream.

Pause—The server is not transporting a media stream.

Search Transport—The server is searching for start normal play time (NPT). When at start NPT, it enters Transport mode.

Transport—The server is transporting the media stream and will pause at the end of the stream. If scale is positive, indicating a forward direction, end of stream is the end of media. If scale is negative, indicating a reverse direction, end of stream is the beginning of media.

Transport Pause—The server is transporting the media stream and will pause at stop NPT.

Search Transport Pause—The server is searching for start NPT. When at start NPT, it enters transport pause mode.

Pause Search Transport—The server is transporting the media stream. It does so until stop NPT, and then transitions to search transport mode.

End of Stream—The server is not transporting a media stream.

Speed Direction

Speed direction is as follows:

Play

Not playing (Pause)

n fast-forward, where n means n times fast-forward

-n means, where n means n times rewind

Provision Multiple

Provision Multiple is enabled if this field is" yes" and disabled if this field is "no."


The Graph This Stream button displays the trick-mode activity of the stream (Figure 5-10).

Figure 5-10 Stream Activity Report

To delete a stream object, display the object and click Delete.


Stream Failures

To view the details of Failed Streams, do the following:


Step 1 Choose Monitor > Failed Streams. The Stream Failures page is displayed.

Each failed stream is listed by the date and time the stream failed, followed by the session ID of the failed stream.


Note Stream Failure monitoring displays only the failed streams for the current day. To view past stream failures, see the "Stream Failures" section on page 6-18.


Step 2 From the Stream Failures drop-down list, choose the timestamp and session ID of the stream object and click Display. The stream failure details are displayed. See Figure 5-11.

To delete a failed stream, display the object and click Delete.

Figure 5-11 Stream Failures Page

Table 5-8 describes the stream failure details.

Table 5-8 Stream Failure Details 

Field
Description

Session ID

The session ID of the failed stream.

Failure Date

The date and time the failure occurred.

QAM IP

The IP address of the QAM device associated with the failure.

Service Group

The service group associated with the failure.

Server ID

The server responsible for streaming this stream object. To view the IP address associated with the Server ID, see the "Configuring the Servers" section on page 4-78.

Group ID

All servers that are part of the same CDS system (managed by one CDSM) have the same Group ID. This Group ID corresponds to the CDSM GUI array ID and should be unique across an enterprise. Table 5-9 describes the ID mapping between the CDSM GUI and the CServer.

Failed Operation

The operation that was taking place when the stream failed, for example, createStream, LSCP Command(), or createServant, destroy. These are the measurement points or transactional states of the system at the time of the failure.

Failed Task

A failed task is the event category that provides the type of execution sequence that the call stack was currently in at the time of the failure. The list of the high-level categories are:

 

Tune In

Load Application

Load Catalog

Eligibility Check

Select Subscription

Purchase Subscription

Select Movie

Purchase Check

Play Movie

Movie Setup

Movie Control

Movie Confirm

Purchase Confirm

Purchase Log

Stop Movie

Movie Release

Error Code

The error code provides a description of the event that caused an error. See Table 5-10, Table 5-11, and Table 5-12 for descriptions of the error codes.


Table 5-9 lists the CDSM GUI ID names and maps them to the CServer names in the setupfile and .arroyorc files.

Table 5-9 ID Names in the CDSM GUI and CServer Files 

CDSM GUI ID Name
CServer Files ID Name

Array ID on the Array Name page

groupid

Group ID on the Server-Level pages

groupid

Stream Group ID on the Server Setup page

arrayid

Cache Group ID on the Server Setup page

arrayid

Vault Group ID on the Server Setup page

arrayid

Stream Group ID on the Configuration Generator page

arrayid


Table 5-10 lists the Managed Services Architecture (MSA) error codes. Some MSA monitored events are monitored for the CDS as well, and are prefaced by "AVS_" instead of "MSA_." They are denoted with a footnote in the table. Some MSA monitored events are not errors, but rather information about an event.

Table 5-10 MSA Error Codes 

Error Code
Description

MSA_INT_ERR1

There is an internal error. INT_ERR has a subset of error codes that specifically describe where the error occurred. See Table 5-11.

MSA_FLOW1

An entry or exit of a measured or tracked flow, or some other important check point, and is recorded as non-realtime.

MSA_CMPT_NOT_EXIST1

A component does not exist.

MSA_REQ_TIMEOUT1

A client timed out waiting for a response to a request.

MSA_CMPT_OUT_OF_SVC1

A component is unavailable.

MSA_REQ_NOT_IMPL1

The requested item is not implemented.

MSA_RES_INVALID1

A resource is invalid.

MSA_RES_DUP1

A duplicate resource is being added to the session.

MSA_CMPT_DUP

It was determined that a component that was being added has the same name as a previously created component.

MSA_REQ_CREATE

An attempt to create a request failed.

MSA_UNKNOWN

An unclassified or undetermined error occurred.

MSA_REAP

A component is destroyed outside of the normal expected flows.

MSA_VS_LSC_TIMEOUT

The video server timed out waiting for the client to issue a resume or play command after the initial creation of the stream.

MSA_SVC_GROUP_MISSING

A request contains a missing service group.

MSA_RES_NO_CAPACITY

A resource is currently out of capacity and cannot satisfy the request.

MSA_RES_NO_BANDWIDTH

A resource does not have the bandwidth to deliver the stream.

MSA_REQ_FAIL

A request failed.

MSA_RES_UNAVAIL

A response is not available.

MSA_FLOW_RT

An entry or exit of a measured or tracked flow, and is recorded in real-time.

MSA_LSC_SERVER_FAILURE

LSC response; server failed.

MSA_LSC_NO_MEMORY

LSC response; dynamic memory allocation failure.

MSA_LSC_IMPL_LIMIT

LSC response; implementation limit exceeded.

MSA_LSC_NO_RESOURCES

LSC response; no resources.

MSA_LSC_SERVER_ERROR

LSC response; server error.

MSA_LSC_MPEG_DELIVERY

LSC response; unable to deliver MPEG stream.

MSA_LSC_ERR

A generic DSM-CC error event.

MSA_LSC_BAD_REQUEST

LSC response; invalid request.

MSA_LSC_BAD_STREAM

LSC response; invalid stream handle.

MSA_LSC_WRONG_STATE

LSC response; wrong state.

MSA_LSC_UNKNOWN

LSC response; unknown error.

MSA_LSC_NO_PERMISSION

LSC response; client does not have permission for the request.

MSA_LSC_BAD_PARAM

LSC response; invalid parameter.

MSA_LSC_NO_IMPL

LSC response; not implemented.

MSA_LSC_TRANSIENT

LSC response; transient error.

MSA_LSC_BAD_SCALE

LSC response; incorrect scale value.

MSA_LSC_BAD_START

LSC response; stream start time does not exist.

MSA_LSC_BAD_STOP

LSC response; stream stop time does not exist.

1 This event is monitored by the CDS as well as MSA, and is displayed with the prefix "AVS_" instead of "MSA_."


Table 5-11 lists the error codes for internal errors and external errors. Internal errors are errors that occurred in the CDS and specifically describe where the error occurred. External errors are errors that occurred in the network or network components, which includes the ContentStore, StreamService, and so on. The error codes listed in Table 5-11 provide more detail to the MSA_INT_ERR or AVS_INT_ERR error code.

Table 5-11 INT_ERR Error Codes 

Error Code
Description

INGEST_THREADS_NOT_RUNNING

Cache server threads are not running.

INGEST_NIC_DOWN

Ingest interface is disabled.

INGEST_DATA_BLOCKAGE

Ingest data read is backlogged, causing data socket blockage.

BAD_CONTENT

Content data is not recoverable.

NOT_ENOUGH_NIC_BANDWIDTH

Not enough bandwidth left over on NICs to perform the operation.

NOT_ENOUGH_SYSTEM_RESOURCES

Not enough system resources left to perform the operation.

NOT_ENOUGH_DISK_SPACE_AVAILABLE

Not enough disk space available.

STREAMER_MAX_SLOTS_LIMIT_EXCEEDED

No stream slot available to allocate the stream.

REMOTE_VAULT_DOWN

The remote Vault is not responding.

REMOTE_STREAMER_DOWN

The remote Streamer is down.

VAULT_DISK_BAD

The disk is bad on a Vault.

STREAMER_DISK_BAD

The disk is bad on a Streamer.

CONTENT_LOCATE_FAILED

Cannot locate the content on any Vault.

CONTENT_FILL_FAILED

Cannot push content to the Streamer from Vault.

NOT_ENOUGH_FILL_BANDWIDTH

Not enough fill bandwidth available.

FILL_LINKS_DOWN

The fill links are down.

STREAMING_LINKS_DOWN

The stream links are down.

VAULT_MIRRORING_SITE_DOWN

The mirroring site of the Vault array is down.

SET_CONTENT_BUNDLE_FAILED

Set content bundle descriptor array failed.

SET_DESTINATION_FAILED

Set destination of stream failed.

DESTROY_STREAM_FAILED

Destroy stream failed.

PLAY_STREAM_FAILED

Play stream failed.

FILLCB_FAILED

Fill CB failed.

WAIT_FOR_FTP_DATA_DONE_FAILED

Wait for FTP data done failed.

GET_CURRENT_NPT_FAILED

Get current NPT for LSCP status failed.

CAN_NOT_CONNECT_TO_NAME_SERVICE

Cisco ISA cannot connect to the BMS Naming Server.

CAN_NOT_CONNECT_TO_NOTIFY_SERVICE

Cisco ISA cannot connect to the Notify Server.

CAN_NOT_CREATE_EVENT_CHANNELS

Cisco ISA cannot create event channels.

NO_CONTENT_EVENT_CHANNEL_FOUND

Naming server does not have content event channel.

NO_STREAM_EVENT_CHANNEL_FOUND

Naming server does not have stream event channel.

EVENT_CHANNEL_OBJECT_NOT_EXISTS

Event channel object does not exist in Notify Server.

CORBA_CONNECTION_FAILED

CORBA System exception while connecting to other entity.

CORBA_BROKEN_PIPE

CORBA system exception with broken pipe with other entity.

CORBA_CONTENT_STORE_BIND_FAILED

CORBA bind exception while starting Cisco ContentStoreFactory.

CORBA_STREAM_SERVICE_BIND_FAILED

CORBA bind exception while starting Cisco StreamService.

CORBA_SYSTEM_ERROR

CORBA system exception while connecting to the servant.

CORBA_TRANSIENT_ERROR

CORBA system exception with object being transient.

CORBA_TIMEOUT_ERROR

CORBA timeout exception.

CORBA_IOR_NIL

The orb object is nil.

CORBA_IOR_NIL_AFTER_NARROW

The orb object is nil after narrow.

SERVICE_GROUP_NOT_SUPPORTED

Service group is not supported.

REMOTE_CONTENT_STORE_FACTORY_DOWN

Remote Cisco ContentStoreFactory is down.

VAULT_HAS_FULL_LOAD

Vault is running with full load.

FTP_CONNECTION_FAILED

Connection to FTP server failed.

FTP_SERVER_BIND_FAILED

FTP server can not bind to the port.

FTP_PUSH_TIMEOUT

FTP push timeout (PASV is not served fast enough).

FTP_QUIT_RECEIVED_DURING_INGEST

FTP server received QUIT request.

NO_LSCP_SET_TOP_CONNECTION

LSCP server to set-top box connection is down.

LSCP_SERVER_BIND_FAILD

LSCP server cannot bind to the running port.

LSCP_PROXY_BIND_FAILED

LSCP proxy cannot bind to the running port.

STREAMER_GROUP_MAX_LIMIT_EXCEEDED

The stream count is exceeding the limit for the Streamer group.

STREAMER_MAX_LIMIT_EXCEEDED

The stream count is exceeding the limit for a Streamer.

REMOTE_STREAMER_NOT_RESPONDING

The remote Streamer is down.

NOT_ENOUGH_MQAM_BANDWIDTH

Not enough MQAM bandwidth.

NO_QAM_FOR_SERVER_ID

Server is not connected to any QAM.

NOT_ENOUGH_QAM_BANDWIDTH

Not enough QAM bandwidth.

STREAMER_IS_NOT_IN_THE_SERVICE_GROUP

Streamer is not in the service group.

STREAMER_HAS_FULL_LOAD

Streamer is running with full load.

STREAMER_IS_NOT_CONNECTED_THAT_QAM

Server is not connected to the QAM.

INVALID_SERVICE_GROUP

The service group is not returned by session gateway.

CONTENT_CAN_NOT_BE_LOCATED

Content is not found in the related content stores.

CONTENT_OBJECT_NOT_YET_PROVISIONED

The content object is not yet provisioned.

STREAM_OBJECT_NOT_YET_PROVISIONED

The stream object is not yet provisioned.

STREAM_OBJECT_IS_OUT_OF_SERVICE

The stream object is out of service.

STREAM_OBJECT_IS_ALREADY_PROVISIONED

The stream object is already in service.

CONTENT_OBJECT_IS_ALREADY_PROVISIONED

The content object is already in service.

STREAM_SERVANT_OBJECT_NOT_EXIST

The remote streamer does not have a servant for stream object.

NO_DESTINATION_QAM_IP_FOUND

No QAM IP Address is received for the stream destination.

NO_DESTINATION_QAM_PORT_FOUND

No QAM port is received for the stream destination.

FAILED_TO_SET_STREAM_DESTINATION

Some error occurred while setting the stream destination.

UNABLE_TO_ACCEPT_CONNECTION

Cannot accept more TCP connections.

UNABLE_TO_REGISTER_EVENT_HANDLER

Cannot register event handle to serve the TCP connection.

CAN_NOT_LOCATE_QAM_IP_FOR_TSID_IN

Cannot locate the QAM IP addresses associated with TSID IN.

NS_LOG_MONITOR_ERROR

The ns_log file is not updating.

AVS_ISA_GENERIC_ERROR

Some unknown error occurred during execution of the operation.

CACHE2APP_INITIALIZE_ERROR

Failed to initialize Cache2App library.

FILLCB_FAILED

FillCB failed during content ingest.

DESTROYCB_FAILED

Failed while destroying the content bundle descriptor.

WAIT_FOR_FTP_DATA_DONE_FAILED

API wait for FTP data done has returned an error.

SET_CONTENT_BUNDLE_DESCRIPTOR_FAILED

Failed while setting content bundle descriptor for a stream.

SET_DESTINATION_FAILED

Failed while setting destination of the stream.

SET_ENCRYPTION_KEY_FAILED

Failed while setting ECM keys for the stream.

CREATE_STREAM_FAILED

AVS cache server cannot allocate the stream handle.

DESTROY_STREAM_FAILED

AVS cache server cannot tear down the stream.

DESTROY_REMOTE_STREAM_FAILED

AVS cache server cannot tear down stream on remote server.

PLAY_STREAM_FAILED

AVS cache server cannot play the stream.

DATABASE_DOWN

Database is down.

DATABASE_SYNCHRONIZING_REPLICATION_Q

Database is synchronizing with replication queue.

DATA_IS_NOT_IN_SYNC

Database is not in sync with master.

DATABASE_RETURNED_ERROR

The database has returned an error; may be there is no record found.

DATABASE_RECORD_NOT_FOUND

The record is not found in the database.

DATABASE_CAN_NOT_INSERT_RECORD

The record cannot be inserted into the database.

DATABASE_CAN_NOT_DELETE_RECORD

The record cannot be deleted from the database.

DATABASE_CAN_NOT_UPDATE_RECORD

The record cannot be updated.

DATABASE_QUERY_SEND_ERROR

Failed to make a query to the database.


Table 5-12 lists the Managed Services Architecture (MSA) error codes for the optional Ingest Manager feature.

Table 5-12 MSA Error Codes for the Optional Ingest Manager Feature

Error Code
Description

MSA_BAD_XML

There was an XML parsing error. Check the ADI XML for errors.

MSA_BAD_REQUEST

The request for content was bad. Check the target backoffice URL.

MSA_UNKNOWN_HOST

The host is unknown. Check the target backoffice URL.

MSA_CONNECTION_DROP

The connection was dropped. Check the URL. The Ingest Manager possibly misformatted the ADI XML.

MSA_BACKOFFICE_TIMEOUT

The backoffice did not respond within the allowed time interval.

MSA_UNKNOWN

An unknown error occurred. Check the /home/isa/bss/log/aim.log.

MSA_FAILED_POST

The Ingest Manager failed to post the ADI to the backoffice.

MSA_PKG_EXPIRED

The package has expired and the retry record will be removed.



Array Level Monitoring

The Array Level Monitoring pages provide an overall view of the health and activity of a specified array, as well as provide monitoring and deletion of barker streams. The Array Level links are:

Array Snapshot

Barker Stream Monitor

Array Snapshot

The Array Snapshot page provides an overview of the current activity for the specified array of servers on the CDS. A summary of the state of all streams, content ingests, and disk usage is displayed.

The fields displayed on the Array Snapshot page are the same fields that are displayed on the System Snapshot page, with the active stream bandwidth and active fill bandwidth shown for each Stream Group. For descriptions of the fields, see Table 5-1.

Barker Stream Monitor


Note The Barker Stream feature is optional and is not listed on the Array Level left-panel menu if it is not included in your deployment. The Barker Stream feature is also not available if the Stream Destination is set to IPTV. For more information, see the "Stream Destination" section on page D-4.


The Barker Stream Monitor page lists the barker streams currently configured. Figure 5-12 shows an example of barker streams for ASI streaming.

To delete a barker stream configuration, click the Delete button next to the associated barker stream.

Figure 5-12 Barker Stream Monitor Page

Server Level Monitoring

The Server Level Monitoring pages provide detail information on the health and activity of a Vault or Streamer server.

To view the Server Level Monitoring pages, do the following:


Step 1 Choose Monitor > Server Level, and then click one of the following as applicable:

Disk Monitor

NIC Monitor

Cache/Fill Bandwidth

Services Monitor

Step 2 Select a server's IP address from the drop-down list and click Display.


Disk Monitor

The Disk Monitor page provides real-time information on the status of a disk.

To view the current status of a disk, select a server's IP address from the drop-down list on the Disk page, click Display, and roll your mouse over one of the disks displayed in the graphic. If the server is a Lindenhurst CDE, click one of the disks displayed in the graphic. Figure 5-13 shows an example of a Streamer server.

Figure 5-13 Disk Monitor—Streamer Page

For Vault servers, the Disk Availability line graph shows the percentage of disk space available. The gigabytes displayed for "Total Space," "Available Space," and "% Used" are the sum of all the disks installed on the Vault server.

The Linux File System Stats table shows the combined total storage space for the partions of the disk drives, the combined available storage space for the partions of the disk drives, and the percentage of used storage for each combined partition. In Figure 5-13, the hda2 partition has an alarm indicator because the usage has exceeded the user-defined threshold of 40 percent. For information on setting thresholds, see the "Setting System Thresholds" section on page 7-10.

Table 5-13 describes the information displayed when a disk is selected.

Table 5-13 Disk Status Fields 

Field
Description

Current Temp

The current temperature of the hard disk.

Smart Status

The Self-Monitoring, Analysis and Reporting Technology (S.M.A.R.T.) status of a disk as determined by the manufacturer in accordance with the relevant ATA/SCSI standards. S.M.A.R.T. is logic embedded in the firmware that determines when a disk is going bad.

Number of Reads

Number of bytes read from the disk since it was powered on.

Number of Writes

Number of bytes written to the disk since it was powered on.


S.M.A.R.T

The CDS incorporates S.M.A.R.T. to monitor the reliability of a hard drive, predict drive failures, and to carry out different types of drive self-tests. S.M.A.R.T is firmware, native to most disk drives, that monitors disk attributes over time, making it possible to perform predictive failure analysis. Advanced warning of predictive failures allows the operator to perform preventative maintenance.

To view the current read/write activity that has occurred in the last five seconds on the selected disk, click Graph Disks. Figure 5-14 shows an example of the Disk Activity graph.

Figure 5-14 Disk Monitor—Vault Disk Activity Graph

The Disk Activity graph displays an average calculation of the amount of data read (in megabytes per second) and data writes over a five-second period.

NIC Monitor

The NIC Monitor page displays the status of each network interface card (NIC) on the server.

To view the current status of a NIC on a server, select a server's IP address from the drop-down list on the NIC Monitor page, click Display, and click one of the NIC ports displayed in the graphic. Figure 5-15 shows an example of the eth4 interface on a Vault server.

Figure 5-15 NIC Monitor—Vault Page

Table 5-14 describes the information displayed for a NIC port.

Table 5-14 NIC Port Status Fields 

Field
Description

Port Speed

The speed of the interface in megabits per second (Mbps).

Admin State

The administrative state of the port interface. The administrative state is determined at the time the server is booted. The possible administrative states are up or down.

Op State

The operational state of the port interface. The operational state is either up or down. If the port is not connected to the network or is malfunctioning, the operational state displayed is down.

Media Type

The physical conduit of the interface. The physical type is either copper or fiber optic.

Poll Interval

The number of seconds between each disk polling.

Transmit

The total number of bytes transmitted since this port has been operational and configured as administratively up.

Received

The total number of bytes received since this port has been operational and configured as administratively up.


To view the average transmit and receive activity that has occurred in the last two seconds for each port on this server, click Graph Ports. Figure 5-16 shows an example of the Port Activity graph.

Figure 5-16 NIC Monitor—Vault Port Activity Graph

Cache/Fill Bandwidth

The Cache/Fill Bandwidth page displays details on the content caching activity on a Streamer.

To view the caching activity on a server, select a server's IP address from the drop-down list on the Cache/Fill Bandwidth page and click Display. Figure 5-17 shows an example.

Figure 5-17 Cache/Fill Bandwidth Page

Table 5-15 describes the services listed in the Cache/Fill Bandwidth page.

Table 5-15 Cache/Fill Bandwidth Fields 

Service
Description

Data Refresh Rate

How often the information is refreshed. The default is 10 seconds. The range is 10 to 300. All field values that are updated, based on the refresh rate, are initially shown in a green-colored font.

Active Stream Count

The number of active streams on this Streamer.

Active Stream Bandwidth

The bandwidth (in Mbps) used for streaming on this Streamer.

Unique Stream Count

The number of unique streams on this Streamer.

Unique Stream Bandwidth

The bandwidth (in Mbps) used for serving unique streams on this Streamer.

Fill Receive Stream Count

The number of streams on this Streamer that are retrieving content from the Vault in order to fulfill requests for content.

Actual Fill Stream Bandwidth

The bandwidth (in Mbps) used on this Streamer for retrieving content from the Vault.

Disk Read Stream Count

The number of streams on this Streamer sending content that was retrieved from the Streamer's disks.

Disk Read Bandwidth

The bandwidth (in Mbps) used on this Streamer for retrieving locally stored content (content on the Streamer's disks).


Services Monitor

The Services Monitor page displays whether specific processes are running on a server.

To view the current status of the services running on a server, select a server's IP address from the drop-down list on the Services Monitor page and click Display. Figure 5-17 shows an example of a Streamer server.

Figure 5-18 Services Monitor Page—Streamer

Table 5-16 describes the services listed on the Services Monitor page.

Table 5-16 CDS Services 

Service
Server
Description

Cisco Cache Server

Both

The Cache server runs on all servers. The Cache server is responsible for the core functions of the CDS.

Cisco Content Store Master

Vault

The Content Store Master process will be running if you are looking at a master Vault server. The Content Store Master serves as the master Vault process for accepting inbound OpenStream connections.

Cisco Content Store Slave

Vault

The Content Store Slave process will be running if you are looking at either a master or a slave Vault server. The Content Store Slave handles requests proxied by the Content Store Master.

Cisco Stream Service Master

Streamer

The Streamer Service Master process will be running if you are looking at a master Streamer server. The Stream Service Master serves as the master Streamer process for accepting inbound OpenStream connections.

Cisco Primary Setup Service

Streamer

The Primary Setup Service will be running if you are looking at the Streamer server designated as the primary Setup server. The Setup server handles setting up stream sessions.

Cisco Stream Control Service

Streamer

The Stream Control Service accepts set-top box play stream commands, for example, LSCP.

Cisco Resource Manager

Streamer

The Resource Manager runs on a Streamer server. The Resource Manager handles orphaned streams.

Cisco DB Server

Both

The DB (database) server runs on all servers and is responsible for keeping track of all data objects in the CDS.

DB Synchronization Status

Both

Displays the status of the database synchronization among all servers. The states are "OK" and "not OK."

Cisco SNMP Server

Both

The SNMP server shows as running when the SNMP agent is running.

Cisco System Manager

Both

The System Manager runs on each server and facilitates communication with the CDSM.

Cisco Ingest Manager

Vault

The Ingest Manager process will be running if you are looking at a master Vault server and the optional Ingest Manager feature is part of your deployment.


Recommended Monitoring Schedule

This monitoring schedule is recommended to ensure that the CDS is functioning as expected and identify potential issues that may cause down time.


Caution Do not attempt to access the Linux command line unless you are familiar with the CDS, the Linux operating system, and the Linux command line.


Note Some error warnings in the logs are only informational and no action is necessary.


Daily Tasks

The following tasks should be performed daily:

Choose Monitor > System Level > System Health and check the System Health Monitor page for red or yellow states on any of the servers. Click any red or yellow boxes to see detail information on disk, NIC, or services. See the "System Health" section for more information.

Choose Monitor > System Level > Failed Ingests to check for any failed ingests. See the "Ingests" section for more information.

Choose Monitor > System Level > Failed Streams to check for any failed streams. See the "Stream Failures" section for more information. It is also possible to run a report for the previous day. See the "Stream Failures" section on page 6-18 for more information.

Weekly Tasks

The following tasks should be performed weekly:

Monitoring Tasks for Streamers and Vaults

Monitoring Tasks for Vaults

Monitoring Tasks for Streamers


Note All commands require that you log into each server's Linux operating system as root. Some tasks have a CDSM option.


Monitoring Tasks for Streamers and Vaults

To monitor the Streamer and Vaults weekly, do the following:


Step 1 Recover used disk space. Log in to each server using the root logon and run the following command:

dh -h

Filesystem            Size  Used Avail Use% Mounted on
/dev/hda1              13G  5.2G  7.0G  43% /
/dev/hda6              20G   16G  4.3G  78% /arroyo/log

If the disk usage is greater than 75 percent, recover the disk space using the following methods:

a. Search and remove any core files.

find /arroyo -name core*
find /home/isa -name core*

b. Copy any archived logs to an external device and delete them from the /arroyo/archive directory.

c. Check for the presence of old install or upgrade ISO files in the /root directory and delete them.

find /root -name *.iso
find /arroyo -name *.iso

Step 2 Verify the services are running. Choose Monitor > Server Level > Services Monitor to check the services for each server, or log in to each server and run the following commands:

su - isa
show_calypso_services 

Step 3 Check the CServer interfaces to verify the status of the Ethernet adapters. Choose Monitor > Server Level NIC Monitor, or log in to each server and use the following commands

a. Use the grep -i Link command to verify that all adapters should have a status of "link up," except those adapters that are not being used.

grep -i Link /proc/net/PRO_LAN_Adapters/*.info

b. Use the grep -i Speed command to verify that each adapter that has a "link up" status should have a speed of 1000.

grep -i Speed /proc/net/PRO_LAN_Adapters/*.info

c. Use the grep -i State command to verify that all adapters should have an "up" state, except those adapters that are not being used.

grep -i State /proc/net/PRO_LAN_Adapters/*.info

Step 4 Check the CServer streaming and cache-fill interfaces using the following command:

/home/stats/ifstats

Step 5 Check the database thread count using the following command:

netstat -an | grep 9999

Two connections for each Vault and Streamer should be listed with a status of "ESTABLISHED."

Step 6 Check the protocol timing logs for errors or problems.

a. Look at the protocol timing logs for packet retransmissions.

tail -f /arroyo/log/protocoltiming.log.{date} | grep retransmissions 

Step 7 Look for warning messages.

grep -i warning /arroyo/log/protocoltiming.log.<date> | more 


Note The "WARNING" messages can sometimes be misleading; for example, "datawait" and "slow disk" messages occur normally and do not indicate an problem.


Step 8 The number of GOIDs for a particular content object must be the same on all servers (Vaults and Streamers) that are supposed to have the content. The number of Vaults that must have the same number of GOIDs for a particular content object is determined by the mirrored copy configuration (see the "Configuring the Servers" section on page 4-78). The number of GOIDs is also dependent on the trick speeds configured you configured (see the "Configuring Ingest Tuning" section on page 4-27). If the GOID is different between a Vault and a Streamer, session setup is not created properly because of an issue of "no content available." This is because there is no content on the Vault that matches the GOID of the Streamer has.


Monitoring Tasks for Vaults

1. Check the available space on the Vault hard drives. Choose Monitor > Server Level > Disk Monitor. The disk availability is shown as a percentage and as a number of gigabytes. Alternatively, view the protocol timing logs by running the following command:

tail -f /arroyo/log/protocoltiming.<date> | grep "Capacity Disk:"

The number returned indicates the percentage of the disk space available on this server. If the number is 5 or lower, then steps need to be taken to increase storage space by adding more Vaults, replacing drives with higher capacity drives, or removing unused content.

2. Check the /home/isa/ContentStore/server/ContentStore.log for ingest errors on each Vault. The master Vault has an additional log.

Monitoring Tasks for Streamers

1. In an ISA environment, look for any errors in the /Streaming/lscp_server/LSCPService.log on the primary Control server, and /Streaming/master/StreamService.log on the primary Setup server.

2. Look at the streaming log.

tail -f /arroyo/log/streamevent.log.<date>

Monthly Tasks

1. Choose Monitor > System Level > System Snapshot and check that the "Disk Available" amount meets the requirements for the expected movie storage in the next three to six months.

2. Run the reports for the last month that are suitable for your requirements and save them as comma-separated value (CSV) files.

3. Using the bandwidth and streaming reports, check that the CDS is not exceeding required usage per service area.

4. Run a quick security check.

a. Ensure that the CDSM changes can be attributed to individual users and not to a generic admin account.

b. Reset CDSM passwords if necessary.

c. Reset Linux passwords if necessary.

d. Check that access policies and firewalls are still enforced.

Other Tasks

If you have access to an anything on demand (XOD) application, do the following:

1. Check the inspect-live log for excessive errors.

2. Check the inspect-live log for excessive communication times with the BMS or CDS.