We launched the Bonsai Asset Index pretty quietly in February, and since that time we've been doing continual improvements on the asset story with feedback from our early adopters. I’m going to a few minutes to provide an overview of the asset feature journey, how assets work, the role Bonsai plays, and how the community is already contributing!
In theory, the idea of infrastructure that can be deployed between different cloud providers is a wonderful concept. Who wouldn't love to migrate workloads seamlessly between providers for a variety of reasons? In theory, a tiger with an anger management problem is just a scaled up house-cat.
This talk explores the practical reality of a "multi-cloud strategy," with all of its warts. The financial, technical, and operational complexities introduced by multiple providers can take companies by surprise. Come explore the basic truth of "however much you hate your cloud provider, you will hate arbitrating between them far more."
Monitoring Graceful Failure of a Distributed System
How can you be sure that your team is alerted of a failure before it causes an outage for your users?
The move from monolith to microservice has allowed pieces of functionality to be deployed individually and on demand. Having functionality isolated allows the opportunity for one microservice to fail without bringing down the whole system. However, the complexity of releasing and monitoring API calls being made across services has increased.
Whether you’re launching a new product or iterating on a feature, delivering a delightful experience is crucial to your success. If something is to fail, you’d prefer your users didn’t know. Be thoughtful about how your system will degrade, how to inject failure to verify your design, and how this is monitored.
In this talk, Lorne Kligerman, Director of Product at Gremlin, will cover failing gracefully as an engineering goal which can be confidently tested and monitored with Chaos Engineering. By purposely causing failure of one service at a time in a controlled environment, you can safely observe and react in a timely manner to limit the effect on the end user.
The move from monolithic architectures to microservices has resulted in a monumental increase in the number of distinct pieces of software that engineering teams own. It’s getting harder -- some would say impossible -- for engineers to keep the architecture of the entire system in their heads. And this is to say nothing of understanding service interdependencies and the resultant risk profile associated with either code or architectural changes. If we don’t find solutions to these problems, we not only risk large-scale service disruption, but lengthening the time to diagnose and resolve incidents due to a lack of system-level understanding.
In this talk, I’ll share insights from how the most innovative companies on the Internet today combat these issues with service maturity modelling: how they define maturity, how they measure it both before and after a service change is introduced to a system, and how they map out the potential impact of component changes on the whole environment. This talk is also a clarion call for a new way of keeping track of all the “stuff” that we’re building, because our existing approaches like CMDBs and Wikis are inadequate for keeping up with the scale of what’s being built today.
In 2016 Paddy Power and Betfair, two gambling giants, merged to form PPB. Each company had its own monitoring baggage, but the SRE team were tasked with cleaning up and consolidating our toolsets. This talk looks at our selection process, scoring and ultimately the decisions which led us to Sensu - which now monitors over 10,000 clients across the PPB estate.
Building a security Team without becoming “the bad guy”
I have had various developer roles in my career which security teams got in our way and got bad reputations as the NO team. I am in the process of building my own security team. Discuss some tips on avoiding this and engineering seeing security as an asset rather than something to work around.
Share where we are with the Puppet module for managing Sensu. Discuss the changes to the module and how users can migrate from Sensu Classic to Sensu Go. Show off all the testing surrounding the Puppet module and how we were able to uncover issues and contribute back to the Sensu-go project during the GA release and how that continues.
There’s an old wives tale (referred to as the “Evolution of QA to GA: The Sensu Go Crucible”) that tells the story of how we redefined release engineering and quality assurance at Sensu. Failure after failure, we would gut through a far from perfect release strategy which involved hours of painstaking manual testing, in order to stamp a green checkmark of approval to ship our product. To optimize this process, we implemented a full automated test infrastructure for staging and end to end testing, which later became known as the QA Crucible 🔥. This automation pattern was (and still is) great, but I couldn’t help but wonder if there was a well known and loved product which could decipher JSON test results and instrument them in an event pipeline. I believe Sensu can encompass monitoring _and_ testing in a CI/CD pipeline, and I’d like to show everyone how.
Order from Chaos: Automating Monitoring Configuration
In a high performance computing shop with over 3,000 nodes, we can’t afford chaos around our monitoring checks! Come hear about how we're using CI/CD pipelines and the Sensu Go API to ensure that all changes to our monitoring system are validated, reproducible, and version controlled.
Seniority By Chaos: The Changing Role of the Senior Engineer
"Myth: Senior Engineering is a journey from many mistakes to no mistakes.Fact: Years do not make the senior engineer, experience does. Practicing Chaos accelerates learning, providing ""senior level"" skills in months. So what is the new role of the senior engineer? Come to this talk to find out."
Introducing GoAlert – A Brand New On-call Scheduling and Notification Open Source Product
The day to day life of a DevOps & IT Ops engineer should be spent on developing the beautiful products and services you offer your customers and less time operating them. We'll learn how Moogsoft & Sensu, along with other monitoring and observability tools, will drive your digital transformation, allowing you to understand the significance of your alerts and the alerts that are correlated, to give you better and faster context of an incident.
In this talk, Sensu CEO Caleb Hailey will explore how Sensu provides a "monitoring control plane" that is analogous to the Kubernetes control plane, and how Sensu can empower organizations by delivering consistent monitoring workflows in multi-cloud environments.
For over eight years, our community has been using Sensu to monitor their applications and infrastructure at scale. Sensu Go became generally available at the beginning of this year, and was designed to be more portable, easier and faster to deploy, and most importantly: more scalable than ever before! In this talk, Sensu CTO Sean Porter will share Sensu Go scaling patterns, best practices, and case studies. He’ll also explain our design and architectural choices and talk about our plan to take things even further.
We’ve moved from waking someone up if a disk passed some arbitrary threshold to only paging off-hours when the business is impacted. Our lives have improved immensely because we learned how to measure the right things. Let’s take some of the lessons we’ve learned from monitoring and alerting and see if we can apply them to how we measure the humans in our systems. From who we see and don’t see as leaders to which candidates we think have the potential to be excellent contributors, let’s look at how we’ve been measuring humans and see if we are evaluating the right things.
Migrating to Sensu Go: Who, What, When, Where, Why, and How?
Caleb Hailey (Co-Founder & CEO) and Jef Spaleta (Principal Developer Advocate) will answer six pertinent questions about migrating to Sensu Go. During the presentation, Jef will do a live demonstration to show exactly how simple the migration process is, while Caleb takes questions from the audience. Long time Sensu users who have been putting off their migration to Sensu Go will benefit greatly.