Материалы по теме 'postmortems' | DevsDay.ru

IT-блоги Материалы по теме 'postmortems'

IT-блоги Материалы по теме 'postmortems'

Разработка The GitHub Blog 15 декабря 2021 г. 18:35

This blog post tells the story of why we built a new search engine optimized for code.... читать далее

Engineering Product code search

DevOps DZone DevOps 10 декабря 2021 г. 22:15

Although SRE toolsets vary from one team to another, there is one type of tool, Infrastructure-as-Code (IaC), that virtually every SRE needs to manage reliability at scale. If you’re not leveraging IaC, you’re not being all you can be as an SRE. Keep...... читать далее

infrastructure sre infrastructure as code iac iac tools

DevOps DZone DevOps 2 декабря 2021 г. 12:08

Incident management is one of the most critical processes a software development team has to get right. Service outages can be costly to the business and teams need an efficient way to respond to and resolve these issues quickly. For example, many or...... читать далее

incident management

Разработка dev.to 26 ноября 2021 г. 16:58

No matter how stable your software product is, occasionally things go wrong in production, and Jobber is committed to doing a post-mortem investigation to follow up and learn from each incident. At a high-level, an incident post-mortem answers these...... читать далее

productivity devops postmortems

DevOps DZone DevOps 21 ноября 2021 г. 9:49

As the tech world changes, language changes with it. New technologies will always introduce new terms and descriptions to provide a clear understanding. For example, the emergence of the cloud introduced language to describe the changing relationship...... читать далее

devops sre

Разработка dev.to 18 ноября 2021 г. 20:31

As the tech world changes, language changes with it. New technologies will always introduce new terms and descriptions to provide clear understanding. For example, the emergence of the cloud introduced language to describe the changing relationship b...... читать далее

sre devops

Разработка dev.to 12 ноября 2021 г. 10:20

Debugar sistemas distribuídos não é uma tarefa simples. Uma das primeiras ferramentas que precisamos é um modelo mental de como o sistema se comporta (ou como desejamos que ele se comporte). Tente desenhar num esquema simples o fluxo das interações,...... читать далее

programming productivity architecture

Разработка Honeybadger Developer Blog 1 ноября 2021 г. 1:28

Errors happen in every application. Devs have to decide: do you write code to handle the error? Suppress it? Notify the user? Report it to the team? In this article, Ayo Isaiah walks us through every aspect of the JavaScript error system. He'll show...... читать далее

DevOps DZone DevOps 23 октября 2021 г. 15:50

In the world of reliability engineering, folks talk frequently about “incident response teams.” But they rarely explain what, exactly, an incident response team looks like, how it’s structured, or which roles organizations should define for incident...... читать далее

devops sre incident management incident response

DevOps DZone DevOps 1 октября 2021 г. 20:59

SRE and DevOps deliver the best value when used together. Culture is key to avoiding burnout. You need the cloud more than ever. These are among the main takeaways from Google Cloud’s latest Accelerate State of DevOps report, which examines how compa...... читать далее

devops google sre incident management site reliability engineer site reliability engineering tools

Разработка dev.to 13 сентября 2021 г. 7:49

September 13th, 2021 - Instalment #81 Newsletter #81. Welcome new and existing readers of this newsletter to another edition with plenty to excite you. This weeks brand new open source projects include some great new AWS CDK constructs to help you...... читать далее

opensource aws

Разработка dev.to 1 сентября 2021 г. 13:31

Jonan Scheffler interviews DevOps Engineer, Arshad Zackeriya about how FinOps is involved with observability and how observability can help FinOps. Should you find a burning need to share your thoughts or rants about the show, please spray them at d...... читать далее

devops kubernetes podcast observability

DevOps DZone DevOps 28 августа 2021 г. 20:13

What Is Site Reliability Engineering (SRE)? The site reliability engineering (SRE) concept originated at Google and is closely related to the principles of DevOps. It is an approach to IT operations. SRE teams use the software to manage systems, solv...... читать далее

devops automation best practices system administration sre slis slos slas

DevOps DZone DevOps 23 июля 2021 г. 15:57

Introduction It's common today to talk about the "gap between security and development" or the "DevOps security disconnect." That makes good sense; there is indeed a need to de-silo security from the development and DevOps processes. What receives su...... читать далее

security devops software saas devsecops reliability incident management sre incident response outage

DevOps DZone DevOps 23 июля 2021 г. 15:53

Introduction Mastering the concepts at the core of reliability is the first step in becoming an SRE. But you also need tools to put those concepts into practice. Which types of tools do SREs need to do their jobs? And what are the best tools in each...... читать далее

devops software incident management sre gremlin incident response pagerduty outages datadog tools for sre

Разработка dev.to 14 июля 2021 г. 20:53

This incident retro was tougher than most to share because, despite the seriousness of the issue, it affected only a very small percentage of our user base. However, we learned some incredibly valuable lessons and I think it's only right that I give...... читать далее

incident retro postmortem

DevOps DZone DevOps 16 июня 2021 г. 23:11

In most organizations, developers are not allowed to access the production environment for stability, security, or regulatory reasons. This is a quite good practice (enforced by many frameworks like COBIT or ITIL) to restrict access to production but...... читать далее

devops development production experience issues best-practices

Разработка dou.ua 31 мая 2021 г. 10:00

Чи потрібно CTO програмувати? Здавалося б, головне в бізнесі — добре керувати людьми та вміти продати свій продукт. Та чимало розробників, які доросли до посади технічного директора, продовжують програмувати. Навіщо вони це роблять, що це дає особист...... читать далее

Дизайн UX Planet 2 мая 2021 г. 21:36

In this article, I will talk about the steps I have taken to win the challenge within the timeline of 4 days.This article is about my journey, experience, and framework I followed to perform a UX audit on a social E-commerce app “MEESHO”.For those wh...... читать далее

ux-research uxaudit ux-writing ux-strategy ux-challenge

DevOps DZone DevOps 19 марта 2021 г. 23:35

With the onset of remote work due to COVID-19, remote incident management has become the norm for businesses worldwide. Organizations that were earlier used to having war rooms now find themselves having to coordinate teams through Slack, MS Teams, o...... читать далее

best practices incident management sre experiences remote work management