<img height="1" width="1" style="display:none" src="https://www.facebook.com/tr?id=1919858758278392&amp;ev=PageView&amp;noscript=1">

Session Name: Effective Observability in Microservice Architectures

This talk will focus on how to create an effective monitoring system across an organization’s entire microservice architecture. This talk combines principles from effective monitoring with microservice architecture patterns to define criteria that will set microservice organizations up for success. One example of effective techniques include having consistency across all services since this consistency can centralize the definition of observability & what it means to have an “observable” system, make it easier to set up monitoring for services, and enable engineers to troubleshoot production issues across their own services and service dependencies. Another example is having SLOs that align well with microservice ownership. Much like any given team should not own more than a few services, any given team also should not own more than a few different SLOs. I will dive into the organizational benefits that microservices provide and SRE reinforces by implementing SLOs. Thirdly, how the main units of an observable system (events) align well with event-driven microservices. I will go into specifics on how events, logs, metrics, and traces relate to one another.
 

Session Name: Psychologically Safe Reliability Management

Psychological safety is particularly important for teams that manage service reliability. The vulnerability that comes with mitigating failures in production requires principles of trust, transparency, and inclusion that can only come from cultures that minimize harm and enable empowerment. Cultivating this kind of culture requires leaders to think proactively about how to build processes and systems that enable teams to be healthy, productive, and effective while being adequately prepared for situations when failure inevitably happens. We’ll review the cultural consequences of chronic issues and the strategies we can use as leaders to align with our shared goal of building excellent teams. We’ll touch upon themes of privilege, power, and accountability.

 

Speaker Bio:

Lesley Cordero is currently a Senior Software Engineer at Teachers Pay Teachers. She has spent the majority of her career on edtech teams as an engineer, including Google for Education and other edtech startups. In her current role, she has focused on building excellent teams and reliability management by setting an org-wide vision & strategy for observability, improving on-call processes, introducing chaos engineering, and cultivating a culture that builds with the most vulnerable employees in mind first. She shows care for others by holding them accountable to the best versions of themselves – and by buying them the occasional bubble tea.