You are monitoring Google Kubernetes Engine (GKE) clusters in a Cloud Monitoring workspace. As a Site Reliability Engineer (SRE), you need to triage incidents quickly. What should you do?
A.
Navigate the predefined dashboards in the Cloud Monitoring workspace, and then add metrics and create alert policies.
Most Voted
B.
Navigate the predefined dashboards in the Cloud Monitoring workspace, create custom metrics, and install alerting software on a Compute Engine instance.
C.
Write a shell script that gathers metrics from GKE nodes, publish these metrics to a Pub/Sub topic, export the data to BigQuery, and make a Data Studio dashboard.
D.
Create a custom dashboard in the Cloud Monitoring workspace for each incident, and then add metrics and create alert policies.
A is correct. Option D is highly inefficient and time-consuming. Creating individual dashboards for every incident is impractical and slows down the triage process.
Explanation: Cloud Monitoring provides predefined dashboards for monitoring GKE clusters, which facilitate an immediate and comprehensive view of cluster performance and health. As an SRE, utilizing these dashboards helps triage incidents quickly. You can also add additional metrics that are pertinent to the incident and create alert policies that will notify you when specific conditions indicative of an incident are met. This strategy allows for the proactive monitoring of incidents and rapid response when necessary.
Ans: D. Although creating dashboard per incident sounds confusing and inefficient, it is still better than the impossible option A as we can't edit or add metrics to a predefined dashboard. Inefficient option vs Impossible Option - Inefficient one is ok!
Optipn A is possible. You can't add widgets to predefined dashboard but you can create alert policies based on metrics. Although the structure of the question is actually misleading
"You can't delete or modify the automatically-created dashboards; however, when support for copying the dashboard exists, you can modify the copy. In general, you can also copy charts on a predefined dashboard to a dashboard that you create. Dashboards that you create are custom dashboards. Custom dashboards let you display information that is of interest to you, organized in a way that's useful to you. "
https://cloud.google.com/monitoring/charts/predefined-dashboards
"To view the chart associated with an alerting policy and information about incidents in the same context as your metric data, add alert charts and incident widgets to your CUSTOM dashboard." https://cloud.google.com/monitoring/dashboards/alerts-and-incidents
A voting comment increases the vote count for the chosen answer by one.
Upvoting a comment with a selected answer will also increase the vote count towards that answer by one.
So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.
kopper2019
Highly Voted 3 years, 3 months agoDiegoMDZ
Highly Voted 3 years, 2 months agobandegg
9 months agoMikeliz
Most Recent 1 month agoplumbig11
3 months ago6a8c7ad
4 months agohitmax87
4 months, 2 weeks agoryaryarya
2 months, 3 weeks agocoolie1234
5 months, 1 week agoGino17m
5 months, 2 weeks agomesodan
7 months agoOrangeTiger
8 months, 2 weeks agohzaoui
8 months, 2 weeks agoSSS987
8 months, 2 weeks agoGino17m
5 months, 2 weeks agoade7cae
9 months, 3 weeks agospuyol
9 months, 3 weeks agobrentc
10 months, 3 weeks agoTopTalk
1 year agoArtistS
10 months, 3 weeks agoTopTalk
1 year ago