Open in app

Sign in

Write

Sign in

Saheed Oladosu
Saheed Oladosu

66 Followers

Home

About

May 28

Five Standard Models to Work on Incidents Effectively

Every incident is different, so the best way to make sure you’re working effectively is to follow a standard model. — In my last article, I talked about How to Manage Incidents and how every alert should not trigger incidents. Some issues are simple. The alert comes in with a link to the playbook, you follow the steps in the playbook, and close the issue down. These types of issues are…

Sre

5 min read

Five Standard Models to Work on Incidents Effectively
Five Standard Models to Work on Incidents Effectively
Sre

5 min read


Mar 26

Site Reliability Engineering: How to Manage Incidents

Incident management is a formal process, and not every alert will trigger it. — Incident management is one of the important responsibilities of the Site Reliability Engineering team. It’s the on-call person who gets alerted to an incident and starts the investigation. …

Sre

3 min read

Site Reliability Engineering: How to Manage Incidents
Site Reliability Engineering: How to Manage Incidents
Sre

3 min read


Jan 30

How to Setup Multi-burn rate Windows Alert on Service Level Objectives

The burn rate is a calculation of how fast an issue is burning through the error budget. — The concept of alerting is pretty simple. When an SLI tells you you’re consuming an error budget, you need to get a human involved to protect your SLO. The mechanics are fairly straightforward too. In my previous articles, I have discussed How to track Service Level Objectives, Site Reliability Engineering…

Software Development

5 min read

How to Setup Multi-burn rate Windows Alert on Service Level Objectives
How to Setup Multi-burn rate Windows Alert on Service Level Objectives
Software Development

5 min read


Nov 27, 2022

Site Reliability Engineering: Which metrics help to measure SLI?

SRE recommends a baseline set of metrics to monitor called the four golden signals. — Monitoring is the automated collection of data from your systems which feeds your SLIs (Service Level Indicators) and tells you if your SLOs (Service Level Objectives) are on track. Actually, SRE recommends a baseline set of metrics to monitor called the four golden signals: Latency, Traffic, Errors, and Saturation.

Sre

5 min read

Site Reliability Engineering: Which metrics help to measure SLI?
Site Reliability Engineering: Which metrics help to measure SLI?
Sre

5 min read


Oct 23, 2022

Site Reliability Engineering: SLI Implementation Example

The Service Level Indicator is the ongoing measurement of your system that tells you whether you’re meeting your objective — In my article on how do you keep track of the actual Service Level Objectives? I discussed ways to track your SLO, which is the target level of reliability. Whereas, the Service Level Indicator, SLI, is the ongoing measurement of your system that tells you whether you’re meeting your objective…

DevOps

4 min read

Site Reliability Engineering: SLI Implementation Example
Site Reliability Engineering: SLI Implementation Example
DevOps

4 min read


Sep 27, 2022

How do you keep track of the actual service level objectives?

Service health is defined in terms of multiple service level objectives, SLOs, which are user-focused rather than operations-focused. — There are crucial questions for making sure your services run well enough without excessive maintenance effort, for example; What service level are your apps currently running at? What service level does the business expect? How do you monitor the actual service level?

Sre

5 min read

How do you keep track of the actual service level objectives?
How do you keep track of the actual service level objectives?
Sre

5 min read


Aug 24, 2022

How to effectively Identify and Measure Toil as Site Reliability Engineer

The outcome must be worth the investment — Identifying and measuring toil is the first step to addressing it effectively. Data is your friend here because you need to put together a case for removing the toil. We should automate this because it’s boring isn’t really a good justification. Especially if the boring task takes 5 minutes once…

Sre

4 min read

How to effectively Identify and Measure Toil as Site Reliability Engineer
How to effectively Identify and Measure Toil as Site Reliability Engineer
Sre

4 min read


Jul 18, 2022

Site Reliability Engineering: What is a Toil?

Toil has a negative impact on people and systems — You know that feeling when you have a constant stream of work coming in and you keep powering through it. But it feels like you never get anything done. That type of work is toil, and SRE has explicit practices and guidelines to help keep it to a minimum. Toil…

Sre

4 min read

Site Reliability Engineering: What is a Toil?
Site Reliability Engineering: What is a Toil?
Sre

4 min read


Jun 22, 2022

Site Reliability Engineering: Setting up the right Monitoring System

You need to know if something is going on with your application that affects the end‑user experience as soon as possible. — Monitoring is a foundational capability of a Site Reliability Engineering (SRE) team. You need to know if something is going on with your application, that affects the end‑user experience as soon as possible. Also, your monitoring should be able to help you in identifying the root cause as soon as…

Sre

4 min read

Site Reliability Engineering: Setting up the right Monitoring System
Site Reliability Engineering: Setting up the right Monitoring System
Sre

4 min read


Published in

Geek Culture

·Mar 31, 2021

How to Choose the Right Continuous Integration (CI) System

Choosing the right CI system is important to the success of every product — There are three main categories of Continuous Integration (CI) Systems. Choosing the right CI system is important to the success of every product. 1. Open-source CI systems 2. Commercial vendors Offerings 3. SaaS Offerings Let me start with the open-source. There are various open-source CI systems like; · Buildbot, ·…

Programming

4 min read

How to Choose the Right Continuous Integration (CI) System
How to Choose the Right Continuous Integration (CI) System
Programming

4 min read

Saheed Oladosu

Saheed Oladosu

66 Followers
Following
  • Paweł Huryn

    Paweł Huryn

  • Coders

    Coders

  • Shola Adio

    Shola Adio

  • DevOps.com

    DevOps.com

  • Jamer Yan

    Jamer Yan

See all (9)

Help

Status

About

Careers

Blog

Privacy

Terms

Text to speech

Teams