Application alarm configurations are required to configure in cee-ops-center within cnAAA deployment, following are the examples
of application Alarm:
-
Access Reject:
Description: This alarm is triggered when the number of Access Reject messages exceeds a threshold limit.
Expression:sum(rate(radius_responses_total{message_type="AccessReject"}[1m])) > N
-
Service Stop Request:
Description:This alarm is triggered when the number of Service Stop Requests exceeds a threshold limit.
Expression:avg(rate(radius_accounting_request_total{accountingType="SessionAccounting", statusType="Stop"}[1m])) > N
-
Service Stop Response:
Description: This alarm is triggered when the number of Service Stop Responses exceeds a threshold limit.
Expression:sum(rate(radius_accounting_response_total{accountingType="SessionAccounting", statusType="Stop"}[1m])) > N
-
PER EP TPS Radius:
Description: This alarm is triggered when the Transactions Per Second (TPS) exceeds a threshold limit.
Expression:sum(irate(radius_requests_total[1m]) or vector(0)) + sum(irate(radius_accounting_request_total[1m]) or vector(0)) by
(pod)
-
Memory Used in Radius POD:
Description: This alarm is triggered when the memory usage in a Radius POD exceeds a threshold limit.
Expression:sum(rate(jvm_memory_bytes_used{namespace="pcf", component="cps-radius-ep"}[1m]))
-
GC Time Period:
Description: This alarm is triggered when the garbage collection (GC) time exceeds a threshold limit.
Expression:
increase(jvm_gc_collection_seconds_sum{component="cps-radius-ep",namespace="pcf"}[$__interval])
-
Total Radius Auth Messages Overload Rejected:
Description: This alarm is triggered when the number of Radius auth messages rejected due to overload exceeds a threshold limit.
Expression:
sum(rate(total_radius_auth_messages_overload_rejected{message_type="AccessReject"}[1m])) > N
-
Total Radius Messages Overload Dropped on Session Accounting
Description: This alarm is triggered when the number of Radius messages dropped on Session Accounting due to overload exceeds a threshold
limit.
Expression:
sum(rate(total_radius_messages_overload_dropped{message_type="SessionAccounting",status_type="Start"}[1m])) > N
-
Total Radius Messages Overload Dropped on Service Accounting
Description: This alarm is triggered when the number of Radius messages dropped on Service Accounting due to overload exceeds a threshold
limit.
Expression:
sum(rate(total_radius_messages_overload_dropped{message_type="ServiceAccounting",status_type="Start"}[1m])) > N
-
GRPC Message Send Total on Accounting Request
Description: This alarm is triggered when the GRPC message send total for Accounting Request exceeds a threshold limit.
Expression:
sum(rate(grpc_message_send_total{message_type="AccountingRequest"}[1m])) > N
-
GRPC Message Send Total on Access Request
Description: This alarm is triggered when the GRPC message send total for Access Request exceeds a threshold limit.
Expression: sum(rate(grpc_message_send_total{message_type="AccessRequest"}[1m])) > N
-
Radius Proxy Accounting Response Total on Error
Description: This alarm is triggered when the Radius proxy accounting responses with errors exceed a threshold limit.
Expression: sum(rate(radius_proxy_accounting_response_total{accounting_type="ServiceAccounting",status_type="Start",result="ERROR"}[1m]))
> 1
-
CoA Timeout
Description: This alarm is triggered when Change of Authorization (CoA) timeouts exceed a threshold limit.
Expression: sum(rate(radius_request_timeout_total{message_type="CoaRequest"}[1m])) > N
Policy Engine Timeout Message on Accounting Request
Description: This alarm is triggered when Policy Engine timeout messages for Accounting Request exceed a threshold limit.
Expression: sum(rate(POLICY_ENGINE_TIMEOUT_MESSAGE{message_type="AccountingRequest"}[1m])) > N
-
Policy Engine Timeout Message on Access Request
Description: This alarm is triggered when Policy Engine timeout messages for Access Request exceed a threshold limit.
Expression: sum(rate(POLICY_ENGINE_TIMEOUT_MESSAGE{message_type="AccessRequest"}[1m])) > N
Policy Engine Message Total on Access Request
Description: This alarm is triggered when the total Policy Engine messages for Access Request exceed a threshold limit.
Expression: sum(rate(policy_engine_message_total{message_type="AccessRequest"}[1m])) > N
Policy Engine Message Total on Accounting Request
Description: This alarm is triggered when the total Policy Engine messages for Accounting Request exceed a threshold limit.
Expression: sum(rate(policy_engine_message_total{message_type="AccountingRequest"}[1m])) > N
-
Dispatch Error Total on Bundled CoA Request
Description: This alarm is triggered when dispatch errors for Bundled Change of Authorization (CoA) Requests exceed a threshold limit.
Expression: sum(rate(dispatch_error_total{message_type="AsyncCoARequest"}[1m])) > N
-
Dispatch Error Total on AsyncCoA Request
Description: This alarm is triggered when dispatch errors for Async Change of Authorization (CoA) Requests exceed a threshold limit.
Expression: sum(rate(dispatch_error_total{message_type="BundledCoARequest"}[1m])) > N
-
Process Message Total on Accounting Response
Description: This alarm is triggered when the total processed messages for Accounting Response exceed a threshold limit.
Expression: sum(rate(process_message_total{message_type="AccountingResponse"}[1m])) > N
-
Process Message Total on Access Accept
Description: This alarm is triggered when the total processed messages for Access Accept exceed a threshold limit.
Expression: sum(rate(process_message_total{message_type="AccessAccept"}[1m])) > N
-
Outbound Request Total on Proxy Accounting
Description: This alarm is triggered when the total outbound requests for Proxy Accounting exceed a threshold limit.
Expression: sum(rate(outbound_request_total{message_type="ProxyAccounting"}[1m])) > N
-
Outbound Request Total on CoA Request
Description: This alarm is triggered when the total outbound requests for Change of Authorization (CoA) Requests exceed a threshold limit.
Expression: sum(rate(outbound_request_total{message_type="CoARequest"}[1m])) > N
-
Inbound Request Total on Proxy Accounting
Description: This alarm is triggered when the total inbound requests for Proxy Accounting exceed a threshold limit.
Expression: sum(rate(inbound_request_total{message_type="ProxyAccounting"}[1m])) > N
-
Inbound Request Total on Access Request
Description: This alarm is triggered when the total inbound requests for Access Request exceed a threshold limit.
Expression:sum(rate(inbound_request_total{message_type="AccessRequest"}[1m])) > N
-
Record Conflict Merge Total
Description: This alarm is triggered when the total record conflict merges exceed a threshold limit.
Expression:sum(rate(record_conflict_merge_total[1m])) > N
-
Radius Access Request Message on Error
Description: This alarm is triggered when Radius Access Request messages on errors exceed a threshold limit.
Expression: sum(rate(message_total{type="radius-access-request-message", status="error"}[1m])) > N
-
I Send Access Accept on Error
Description: This alarm is triggered when "I Send Access Accept" messages on errors exceed a threshold limit.
Expression:sum(rate(action_total{type="i-send-access-accept", status="error"}[1m]))
-
Radius Accounting Message on Error
Description: This alarm is triggered when Radius Accounting messages on errors exceed a threshold limit.
Expression:sum(rate(message_total{type="radius-accounting-message", status="error"}[1m])) > N