System health checks
The Cisco NCS 1014 health check service is a system monitoring service that
-
monitors physical characteristics, current processing status, and the currently utilized resources to assess the condition of the device at any time,
-
analyzes the system health by tracking metrics that are critical for the functioning of Cisco NCS 1014, and
-
is installed with the Cisco NCS 1014 RPM.
The system health metrics are thresholds set on the device to monitor the usage of CPU and other system resources.
System resource metrics states
You can evaluate the system's health by examining the metric values. If these values cross or approach the set thresholds, it suggests potential problems. By default, metrics for system resources are configured with preset threshold values. You can customize the metrics to monitor by disabling or enabling metrics of interest based on your requirement.
Each metric is tracked and compared with that of the configured threshold, and the state of the resource is classified accordingly.
The system resources metrics can be in one of these states:
-
Normal: The resource usage is less than the threshold value.
-
Minor: The resource usage is more than the minor threshold, but less than the severe threshold value.
-
Severe: The resource usage is more than the severe threshold, but less than the critical threshold value.
-
Critical: The resource usage is more than the critical threshold value.
Infrastructure service metrics states
The infrastructure services metrics can be in one of these states:
-
Normal: The resource operation is as expected.
-
Warning: The resource needs attention. For example, a warning is displayed when the Field-Programmable Device (FPD) needs an upgrade.
Supported system health check metrics
Cisco NCS 1014 supports the following system health check metrics:
-
communication-timeout
-
cpu
-
filesystem
-
fpd
-
free-mem
-
hw-monitoring
-
lc-monitoring
-
pci-monitoring
-
platform
-
process-resource
-
process-status
-
shared-mem
-
wd-monitoring
Feedback