What is System Health?

Description

The NSP System Health dashboard displays a number of system KPIs. The default view includes a graphical representation of the number of pods in each state, such as Running or Pending, for quick identification of problems. The view also lists relevant information for each pod, such as the pod uptime, host NSP cluster node, and number of pod restarts.

The view displays additional information in the following dashlets:

  • News Feed—list of alarms with pod and alarm information

  • Kubernetes Cluster Status—graphical representation of the clusters and the state of each cluster. The view also lists relevant information for each cluster and for nodes in a cluster.

  • Database Backup Status—a graphical representation of databases and the state of each backup for database backups. The view displays important information such as the backup pod status, current backup status, last run time, and last successful backup.

  • Auxiliary Database Clusters—a graphical representation of clusters and the state of each cluster. The view lists relevant information for each cluster and for nodes in a cluster. You can also run auxiliary database backups directly from this dashlet.

You can also invoke the following from the System Health dashboard:

  • Log Viewer—local OpenSearch instance with dashboards for viewing and analyzing NSP application log data

  • Grafana—local Grafana instance that draws on various data sources to provide visualizations and alerts