domino logo
Latest (5.5)
  • About Domino
  • Architecture
  • Security and Compliance
  • Domino Infrastructure
  • Kubernetes Infrastructure
  • Installation
  • Data Planes
  • Execution Resources
  • Users
  • Keycloak Authentication Service
  • Data
  • Projects
  • Environments
  • Model Monitoring
  • Control Center
  • Logs and Monitoring
  • Backup and Restore
  • Disaster Recovery
  • Configuration Reference
  • Run the Admin Toolkit
  • MongoDB Console
  • Get Help
domino logo
About Domino
Domino Data LabKnowledge BaseData Science BlogTraining
Admin Guide
>
Logs and Monitoring
>
Monitoring
>
Use Grafana Dashboards

Use Grafana Dashboards

Domino comes with a set of default Grafana dashboards. Many of the Domino platform services have corresponding Grafana dashboards. There are also several dashboards for Kubernetes observability.

Retrieve the Grafana password and log in to explore the dashboards.

Get the Grafana password

The Grafana password is the secret from the domino-platform namespace. Retrieve the password with kubectl:

echo -e "\nyour password is: $(kubectl get secret grafana -n domino-platform -ojsonpath='{.data.admin-password}' | base64 -d)\n"

Access Grafana

  1. Go to https://\<your-domino-domain\>/grafana.

  2. Enter the Email or username as grafana.

  3. Enter the password you retrieved above.

  4. Click Log in.

Recommended dashboards

You’ll find many dashboards available in Grafana. The ones described here are especially useful.

Kuberhealthy

Use the Kuberhealthy dashboard as your single source of truth for all defined synthetic checks for basic Kubernetes cluster functionality deployed as part of Domino.

You can see panels such as the following:

Kuberhealthy Check Status

Determines if a synthetic test defined has failed. You can see each test with a result of 1 (succeeded) or 0 (failed) with the time it last ran the check.

Kubernetes Check History

Shows a historical view of when the checks failed.

General / Keycloak JVM Dashboard

Keycloak pods are responsible for user authentication and session management. A Java Virtual Machine (JVM) executes the program that performs these responsibilities. The "Keycloak JVM Dashboard" shows the overall status of Keycloak pods, metrics about JVM resource usage, including heap and non-heap memory usage, garbage collection stats, and thread counts. You can use this dashboard to help understand performance or scaling issues related to Domino’s authentication engine.

General / Nucleus Dispatcher JVM Metrics: Current Pod

The nucleus-dispatcher is a Kubernetes pod that has several important responsibilities within the Domino system, including "dispatching" runs to begin execution. A Java Virtual Machine (JVM) executes the program that performs these responsibilities. The "Nucleus Dispatcher JVM Metrics: Current Pod" dashboard shows metrics about JVM resource usage, including heap and non-heap memory usage, garbage collection stats, and thread counts. You can use this dashboard to help understand performance or scaling issues related to Domino’s execution engine.

This dashboard shows the JVM stats for the currently-running nucleus-dispatcher Kubernetes pod.

There are also similar JVM metrics dashboards for other "Nucleus" Kubernetes pods that perform less critical roles, such as nucleus-train and nucleus-workspace-volume-snapshot-cleaner.

General / Nucleus Dispatcher JVM Metrics: All Pods Including Terminated

This dashboard shows the same metrics as the "Nucleus Dispatcher JVM Metrics: Current Pod" dashboard, but for all historical pods that still have metrics in the system. This dashboard is useful for viewing resource usage across pod restarts.

There are also similar historical metrics dashboards for other "Nucleus" Kubernetes pods.

Kubernetes / Views / *

The dashboards under Kubernetes / Views are for inspecting the overall health of the Kubernetes cluster and resources within it.

Domino Data LabKnowledge BaseData Science BlogTraining
Copyright © 2022 Domino Data Lab. All rights reserved.