The Case for Continuous Reliability Management
Over the past decade, the processes for effective application delivery have evolved significantly. We have moved from waterfall to agile, ..
Over the past decade, the processes for effective application delivery have evolved significantly. We have moved from waterfall to agile, ..
In a previous article on Site Reliability Engineering (SRE), we discussed the SRE role and why it’s in high demand. At the end of the article, ..
The beauty of a generic mitigation is that it solves a wide variety of problems using a single solution. When executed smoothly, generic ..
In recent years, many engineering organizations have embraced DevOps as a means to improve the software development lifecycle and increase ..
Unreliable services can affect businesses in myriad ways, from slowed development velocity, to unhappy users, to impacted revenue streams. ..
Site Reliability Engineering (SRE) can mean different things to different companies; and operators that are responsible for reliability ..
You’ve heard of playbooks. But what about playbooks-as-code? How can playbooks be managed as code, and what does that mean for SREs and incident ..
Traditionally, the first key question that coders had to answer as they charted their career paths was whether they wanted to be a software ..
A risk-averse business or engineering culture can slow down delivery of products and cost business opportunities. Using risk management ..
In incident response, every problem is unique. But that doesn’t mean that every problem requires a unique response. On the contrary, ..
One of the most important aspects of a software system is its reliability — and for good reason. With so many digital options available ..
When organizations adopt Agile development practices, they also adopt shorter release cycles. These shorter release cycles mean more frequent ..
In the current IT market, one of the hottest job roles is the Site Reliability Engineer (SRE). In January 2019, according to LinkedIn, ..
DevOps has transformed the way we think about the roles of the IT team. Now, IT engineers don’t just maintain software post-deployment. ..
Being a developer once meant writing code, possibly testing and building it, and then calling it a day. In the era of DevOps, however, the work ..
Modern IT has changed enormously over the last few decades and created a plethora of new opportunities for individuals and businesses. In the process, ..
Poorly implemented postmortems can be painful for everyone involved; they cost money, and worse yet, they can fail to address the root cause ..
If you work in IT Ops, SRE, or DevOps, you don’t need to be told that every second counts in incident response. You already know that. ..
DevOps and SRE are sometimes referred to as competing or separate disciplines. This post looks at DevOps VS SRE, showing that they are not really ..
This post was originally published on The New Stack. When something goes wrong in your production environment, you want your best and brightest ..