Oncall 101 quiz

Create an image depicting a tech professional engaged in incident response, surrounded by digital tools and alert notifications, in a modern office setting, emphasizing teamwork and problem-solving.

Oncall 101 Quiz

Welcome to the Oncall 101 Quiz! Designed to test your knowledge and understanding of incident response processes and best practices, this quiz is perfect for both new and experienced oncall engineers. Whether you're looking to sharpen your skills or refresh your knowledge, this engaging quiz will challenge you.

  • Assess your incident management skills
  • Learn about best practices in oncall responsibilities
  • Improve your readiness for real-world scenarios
14 Questions4 MinutesCreated by ResoluteEngineer147
What are the main responsibilities for primary oncallers?
Incidents management
Alerts handling
Respond to deployment failure
Write code and fix the bugs
Investigate ,facilitate and triage
Ack the page then ping others
What's the first thing to do after I get paged?
Evaluate severity
Investigation
Create an incident
Acknowledge the page
Communicate in #incident-triage
When should I create an incident?
Customer facing issue to all customers
Customer facing issue to a subset of customers
Internal user facing issue: such as broken admin tool
Transient deployment failure due to broken tests etc
Right after I got paged
What are the most common causes of systemic failure
Regression after deployment
Config changes in prod: experiment, feature flags, zsettings etc
External outage
Broken tests
Missing metrics and logging
What are the best practices to mitigate a regression issue that happens after a deployment
Freeze the deploy
Rollback the deployed binary
Revert the culprit PR
Force push the SHA after it passes smoke test
Unfreeze the deploy right after rollback
Monitor the system health after rollback
Work on a hot fix
During the incident, what to do if it's a domain you are not familiar with?
Look up go/ownership
Page other team's oncallers if you need help or its relevant to their team's area
Deepdive into that domain and figure it out
How to page other oncallers?
Use slack command: /page-team
Call them
Ping them on slack
What should oncaller do after an incident is declared
Involve the right folks
Provide a tldr and context in incident channel
Silently investigate the root cause by yourself
Involve CH team and #incident-comms-on-call if it's end user facing issue
Provide incremental update of steps to mitigation/resolution
Go grocery shopping
What should oncaller do after an incident is resolved
Archive the incident channel
Facilitate authoring postmortem, scheduling
Go grocery shopping
Summarize the lessons learned and share with the team
Turn off the laptop and cellphone
What are the common useful links during incident investigation
Go/logs
Go/911
Go/deploy
Go/beagle
Go/sentry
Go/reviews
Go/prf
What are recommended steps to do after you get an alert message
Ignore it
Focus on my current work and come back to it later today
Annotate it with :eyes emoji when you start looking into it
Evaluate the severity
Create an incident
Perform initial investigation
Page other team's oncaller if it's relevant
Search the alert message in the alert channel
Mute the alert
Tag team's oncaller if it's related to their team
What to do if you think the playbook in alert message isn't helpful?
Update the playbook
Remove the playbook
Reply to the alert message with your findings
Communicate with the team and reach consensus on alert adjustment
Mute the alert
What to do if you think the alert is too noisy?
Delete the alert
Increase the alert threshold
Ignore the alert and let it fire
Work on short-term/long-term plan and adjust the alert rule to reduce the noise because alert noise leads to ignoring alerts
What to do when your oncall rotation ends
Document things in go/oncall-handoff
Run the weekly oncall handoff meeting
Unwind, reflect on how the shift impacted your anxiety levels and improve it
Call out things that next oncaller need to follow up
{"name":"Oncall 101 quiz", "url":"https://www.quiz-maker.com/QPREVIEW","txt":"Welcome to the Oncall 101 Quiz! Designed to test your knowledge and understanding of incident response processes and best practices, this quiz is perfect for both new and experienced oncall engineers. Whether you're looking to sharpen your skills or refresh your knowledge, this engaging quiz will challenge you.Assess your incident management skillsLearn about best practices in oncall responsibilitiesImprove your readiness for real-world scenarios","img":"https:/images/course4.png"}
Powered by: Quiz Maker