We launched the Bonsai Asset Index pretty quietly in February, and since that time we've been doing continual improvements on the asset story with feedback from our early adopters. I’m going to a few minutes to provide an overview of the asset feature journey, how assets work, the role Bonsai plays, and how the community is already contributing!
In theory, the idea of infrastructure that can be deployed between different cloud providers is a wonderful concept. Who wouldn't love to migrate workloads seamlessly between providers for a variety of reasons? In theory, a tiger with an anger management problem is just a scaled up house-cat.
This talk explores the practical reality of a "multi-cloud strategy," with all of its warts. The financial, technical, and operational complexities introduced by multiple providers can take companies by surprise. Come explore the basic truth of "however much you hate your cloud provider, you will hate arbitrating between them far more."
How can you be sure that your team is alerted of a failure before it causes an outage for your users?
The move from monolith to microservice has allowed pieces of functionality to be deployed individually and on demand. Having functionality isolated allows the opportunity for one microservice to fail without bringing down the whole system. However, the complexity of releasing and monitoring API calls being made across services has increased.
Whether you’re launching a new product or iterating on a feature, delivering a delightful experience is crucial to your success. If something is to fail, you’d prefer your users didn’t know. Be thoughtful about how your system will degrade, how to inject failure to verify your design, and how this is monitored.
In this talk, Lorne Kligerman, Director of Product at Gremlin, will cover failing gracefully as an engineering goal which can be confidently tested and monitored with Chaos Engineering. By purposely causing failure of one service at a time in a controlled environment, you can safely observe and react in a timely manner to limit the effect on the end user.
The move from monolithic architectures to microservices has resulted in a monumental increase in the number of distinct pieces of software that engineering teams own. It’s getting harder -- some would say impossible -- for engineers to keep the architecture of the entire system in their heads. And this is to say nothing of understanding service interdependencies and the resultant risk profile associated with either code or architectural changes. If we don’t find solutions to these problems, we not only risk large-scale service disruption, but lengthening the time to diagnose and resolve incidents due to a lack of system-level understanding.
In this talk, I’ll share insights from how the most innovative companies on the Internet today combat these issues with service maturity modelling: how they define maturity, how they measure it both before and after a service change is introduced to a system, and how they map out the potential impact of component changes on the whole environment. This talk is also a clarion call for a new way of keeping track of all the “stuff” that we’re building, because our existing approaches like CMDBs and Wikis are inadequate for keeping up with the scale of what’s being built today.
In 2016 Paddy Power and Betfair, two gambling giants, merged to form PPB. Each company had its own monitoring baggage, but the SRE team were tasked with cleaning up and consolidating our toolsets. This talk looks at our selection process, scoring and ultimately the decisions which led us to Sensu - which now monitors over 10,000 clients across the PPB estate.
This tutorial will cover using Grafana to display events and device state from multiple Sensu servers. The tutorial will also demonstrate how to use Loki for Sensu Server and Agent log aggregation. A preview of a Sensu Go datasource for Grafana will also be included as part of the updated Sensu App.
I have had various developer roles in my career which security teams got in our way and got bad reputations as the NO team. I am in the process of building my own security team. Discuss some tips on avoiding this and engineering seeing security as an asset rather than something to work around.
Share where we are with the Puppet module for managing Sensu. Discuss the changes to the module and how users can migrate from Sensu Classic to Sensu Go. Show off all the testing surrounding the Puppet module and how we were able to uncover issues and contribute back to the Sensu-go project during the GA release and how that continues.
There’s an old wives tale (referred to as the “Evolution of QA to GA: The Sensu Go Crucible”) that tells the story of how we redefined release engineering and quality assurance at Sensu. Failure after failure, we would gut through a far from perfect release strategy which involved hours of painstaking manual testing, in order to stamp a green checkmark of approval to ship our product. To optimize this process, we implemented a full automated test infrastructure for staging and end to end testing, which later became known as the QA Crucible 🔥. This automation pattern was (and still is) great, but I couldn’t help but wonder if there was a well known and loved product which could decipher JSON test results and instrument them in an event pipeline. I believe Sensu can encompass monitoring _and_ testing in a CI/CD pipeline, and I’d like to show everyone how.
In this hands-on workshop, you'll learn how to set up InfluxDB to store data from both service checks and perf metrics, create visualizations and dashboards from that data using the InfluxDB UI, as well as performing complex checks and alerting based on historical data.Your bio: Noah is a Developer Evangelist at InfluxData, focused on DevOps with a dash of IoT. He has worked as a systems administrator, audio engineer, production coordinator for an animation studio, and as a software developer building DevOps tools. Noah holds a master’s degree from the Interactive Telecommunications Program at NYU, where he focused on augmented reality, physical computing, and interaction design.
In a high performance computing shop with over 3,000 nodes, we can’t afford chaos around our monitoring checks! Come hear about how we're using CI/CD pipelines and the Sensu Go API to ensure that all changes to our monitoring system are validated, reproducible, and version controlled.
"Myth: Senior Engineering is a journey from many mistakes to no mistakes.Fact: Years do not make the senior engineer, experience does. Practicing Chaos accelerates learning, providing ""senior level"" skills in months. So what is the new role of the senior engineer? Come to this talk to find out."
The day to day life of a DevOps & IT Ops engineer should be spent on developing the beautiful products and services you offer your customers and less time operating them. We'll learn how Moogsoft & Sensu, along with other monitoring and observability tools, will drive your digital transformation, allowing you to understand the significance of your alerts and the alerts that are correlated, to give you better and faster context of an incident.
In this talk, Sensu CEO Caleb Hailey will explore how Sensu provides a "monitoring control plane" that is analogous to the Kubernetes control plane, and how Sensu can empower organizations by delivering consistent monitoring workflows in multi-cloud environments.
In this talk, VP of Product Anna Plotkin will cover the product evolution of Sensu Go. Starting with the original goals of “Sensu 2.x,” insight on the journey to GA and Sensu’s initial stable release will set the stage for the innovative growth in 2019. Anna will also cover what our most exciting developments have been this year and look to how they're building the foundation for the future into 2020 and beyond.
For over eight years, our community has been using Sensu to monitor their applications and infrastructure at scale. Sensu Go became generally available at the beginning of this year, and was designed to be more portable, easier and faster to deploy, and most importantly: more scalable than ever before! In this talk, Sensu CTO Sean Porter will share Sensu Go scaling patterns, best practices, and case studies. He’ll also explain our design and architectural choices and talk about our plan to take things even further.
We’ve moved from waking someone up if a disk passed some arbitrary threshold to only paging off-hours when the business is impacted. Our lives have improved immensely because we learned how to measure the right things. Let’s take some of the lessons we’ve learned from monitoring and alerting and see if we can apply them to how we measure the humans in our systems. From who we see and don’t see as leaders to which candidates we think have the potential to be excellent contributors, let’s look at how we’ve been measuring humans and see if we are evaluating the right things.
In this Sensu Summit 2018 talk, Lee Briggs, Senior Infrastructure Engineer at Apptio, discusses how to monitor Kubernetes components and applications using the classic sensu components.
He covers some of the tricks you can use when monitoring Kubernetes resources and cluster components. We’ll cover the kind of things you should and shouldn't monitor with sensu at this stage, as well as some of the lessons learned along the way.
In this talk from Sensu Summit 2018, Garrett Honeycutt showcases the Puppet module: its current state; support for Sensu 2.0; highlight community contributions and how you can contribute. You’ll see the Vagrant setup and how even if you don’t use Puppet, you can easily get Sensu up and running on a bunch of different platforms.
In this talk David Schroeder, Software Engineer at Viasat, describes how Ansible is used to configure and deploy Sensu for multiple teams, how much autonomy is granted each one, and where the bottlenecks are.
In this talk from Doximity's Ben Abrams, you'll learn why alert fatigue is dangerous and how we can solve for it (using Sensu features like check hooks, round robin subscriptions, auto remediation, and more).
Chef's Julian Dunn & Fletcher Nichol give you a primer about promise theory and the autonomous actor model that underlies the design of products like Sensu and Habitat, why it leads to not only higher overall system reliability but human comprehension for easier operations.
In this talk from Sensu Summit 2018, Greg Poirier walks you through assets in Sensu 2.0, our ideas about how assets may impact the community plugins, and what we have planned for Sensu Enterprise integrations as assets.
Chris Chandler, Principal Engineer at T-Mobile, discusses times where deploying the client is either sub-optimal (e.g., a box you don’t control) or impossible (e.g., appliances, 3rd party services), and how he tackled these challenges serverless-style.
As a home brewer, Sensu Customer Success Engineer Aaron Sachs has often found himself unfortunately surprised when he reaches the end of a keg. In this talk, he covers how he changed that using a Raspberry Pi and Sensu Core.
Christopher J. Caillouet shares how Sensu has become a critical component to keeping the modern visual effects studio of Industrial Light & Magic in the business of creating the beautiful movies of our world and realizing the dreams we all enjoy on the big screen.
In this talk from Sensu Summit 2018, David Beaurpere, Principal Software Engineer for the Observability Group at Workday Ltd, discusses how Sensu 1.x evolved from a Nagios replacement to the backbone monitoring data collection and transport at Workday.
In this Sensu Summit 2018 talk. CTO Sean Porter shares Sensu's beliefs as a company and community, how far we’ve come in the past year, the problems we face together, what we’re doing next to solve for them.
A story of how Sensu will evolve in its next iteration. It includes exploring a long history of loving Sensu, how it can and will evolve with the industry, why the Go programming language is a natural fit for our next version and about the human side of software design.
The Sensu origin story was born out of an operational view of a world where infrastructure and software delivery were shifting towards automation and public cloud. In this talk Sean will explain how a very similar trend is happening now, and what this means for the future of the Sensu project (and the monitoring industry).
After years of being a Sensu user, Ben Abrams started to contribute back. Since then he's become one of the maintainers of Sensu Plugins and Extensions. This talk will demystify the role of maintainers and also talk to all Sensu users about when and where they can contribute.
SendGrid's Sean Kilgore shares insights into a challenging situation the SendGrid operations team found themselves in, and how they leveraged Sensu's metrics collection and routing capability to collect ad-hoc telemetry data from their daemons.
This talk explores how the versatility of Sensu helped the ICF team overcome challenges that are unique to early growth in a large-scale infrastructure (one that's heavily invested in automation tooling workflows), and what's next.
GoDaddy has Sensu monitoring over 35,000 clients (!), including physical servers, public and private cloud compute instances, and containers. In this talk Michael and Thomas will share lessons learned and what challenges are yet remaining.
This is the story of a journey, a long and challenging journey in which a small, bootstrapped consulting firm has taken Sensu from humble beginnings as an open source software project, to an open core platform for a burgeoning commercial product (Sensu Enterprise).
Sometimes the whole is more important than any individual part. This talk will describe how Tubular uses Sensu and the Elastic stack to create aggregated monitors that alert based on a group of Sensu checks.
In this talk, T-Mobile's Chris Chandler covers how he quickly stood up a working proof of concept that used Sensu to register services in Consul via Consul's REST APIs, update the state of those services, dynamically drive subscriptions, and alert for any hosts that did not have services registered.
One of the most useful features of sensu is the client socket, which allows you to pipe events into sensu on an ad-hoc basis. This talk will introduce you to sensu-wrapper, a golang tool which makes this process much easier, plus use cases and examples.
Jeff will tell the story about how GE has scaled Sensu Enterprise using systems automation and configuration management tools to enable automated monitoring across the PaaS and SaaS products that make up the GE Predix Industrial Cloud platform.