Google Cloud
Site Reliability Engineering: Measuring and Managing Reliability
Google Cloud

Site Reliability Engineering: Measuring and Managing Reliability

Access provided by Duke University

58,948 already enrolled

Gain insight into a topic and learn the fundamentals.
4.5

(947 reviews)

Intermediate level
Some related experience required
Flexible schedule
1 week at 10 hours a week
Learn at your own pace
89%
Most learners liked this course
Gain insight into a topic and learn the fundamentals.
4.5

(947 reviews)

Intermediate level
Some related experience required
Flexible schedule
1 week at 10 hours a week
Learn at your own pace
89%
Most learners liked this course

What you'll learn

  • How to make systems reliable

  • Quantifying risks to and consequences of SLOs

  • Understanding SLIs, SLOs and SLAs

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

16 assignments¹

AI Graded see disclaimer
Taught in English

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

There are 7 modules in this course

This module is intended to bring you up to speed on the concepts underpinning SRE, CRE, and SLOs. If you're already familiar with these concepts, you may still find new information and perspectives in this module, but it is not necessary to complete it.

What's included

11 videos1 assignment

In this module we’re going to talk about how you measure the desired reliability of a service. We will address what to consider when setting SLOs for your application within your organization. We'll look at the three principles we use to measure the desired reliability of a service: figuring out what you want to promise and to whom, figuring out the metrics you care about that make your service reliability ā€œgood", and finally, deciding how much reliability is good enough.

What's included

7 videos4 assignments

In this module, we’ll start by introducing a mechanism for quantifying unreliability using something called an error budget. We'll show how error budgets help you decide when to focus on making a service more reliable. And then we'll learn about some of the engineering and operational improvements that can help you do that.

What's included

7 videos3 assignments

In this module we will start off by taking a look at some characteristics of monitoring metrics that can make them useful as SLIs and contrast these against other metrics that are less useful. Because the choice of where to measure an SLI is a key variable, we'll cover the five main ways you can measure an SLI and compare their pros and cons.

What's included

14 videos3 assignments5 discussion prompts

In this module, we'll start off with an overview of our four step process for developing SLOs and SLIs for a user journey. We'll introduce the fictional company that created our example mobile game, the infrastructure that we'll be working with, and the simple user journey we'll be applying the four step process to.

What's included

7 videos2 assignments2 peer reviews

In this module we'll be taking a critical look at the availability risks for our example service. We want to answer the question: "are our SLO targets and error budgets realistic?"

What's included

4 videos2 peer reviews

In this module, we'll cover best practices for documenting your SLOs, the rationale behind a formal error budget policy and how best to create one and finally, we'll look at an example error budget policy in order to understand the trade-offs and incentives that play out during negotiations when trying to write an error budget policy.

What's included

9 videos3 assignments3 discussion prompts

Instructor

Instructor ratings
4.5 (301 ratings)
Google Cloud Training
Google Cloud
1,995 Courses3,643,645 learners

Offered by

Google Cloud

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

4.5

947 reviews

  • 5 stars

    69.90%

  • 4 stars

    20.80%

  • 3 stars

    5.17%

  • 2 stars

    2.21%

  • 1 star

    1.90%

Showing 3 of 947

GM
5

Reviewed on May 27, 2020

SP
4

Reviewed on Jun 7, 2020

EE
4

Reviewed on Feb 15, 2019

Explore more from Information Technology

¹ Some assignments in this course are AI-graded. For these assignments, your data will be used in accordance with Coursera's Privacy Notice.