Blog Feed Post

What’s Better Than Rehydrated Microwave Pizza?

Time-based alert grouping, that’s what. And today, we’re proud to announce that time-based alert grouping is now available on all PagerDuty standard accounts.  

Futuristic Technology vs Practical Technology

I remember being amazed and inspired by the futuristic technology in Back to the Future II. I’d argue the title should have been: Back to the Future: But this time, let’s actually go to the Future. While the first movie nostalgically travels to a town 30 years in the past, the second movie explores what society and technology would look like 30 years into the future.  

Most people remember flying cars, rehydrating microwave pizza, and 3-D movie sharks that jump out and eat you in the street. I am partial to more practical technology: self-lacing sneakers. Self-lacing sneakers solve a real problem with a simple and elegant solution. Your shoes will never be too tight or too loose. You’ll never trip on a loose shoelace with self-lacing sneakers.

Time-based alert grouping is like self-lacing sneakers. If you’ve tried to solve a problem during an “alert storm”, triage can be impeded by superfluous alerts generated by redundant monitoring tools. What is triage? Triage is when you know something’s wrong, and you have to gather information to help you make a decision and take action. Triage is painful because every minute of downtime is expensive and stressful.

How can time-based alert grouping help you and your team triage during incident response?

  • Improve the signal to noise ratio: All alerts within a specified time period on a service automatically group into an incident that represents the full span of the issue instead of letting each alert create a new incident with no other context.
  • Reduce alert fatigueInstead of 100 incidents with one alert each, your team can manage 10 incidents with 10 alerts each, for example. Imagine receiving 10 notifications (phone calls, text messages, emails) instead of 100.
  • Collect relevant information in one place: Grouping alerts on a service over a period of time captures the changing state of the incident itself in one rich incident object instead of distributed over many separate incidents.

One of our customers suggested the following:

“ We should be able to roll up to a single alert rather than getting paged (literally) hundreds of times…”

If you’ve ever struggled with redundant incidents notifying you 10 times an hour during an outage, you know what I mean. If you’ve force-resolved dozens of incidents cluttering up your incident list because they all have the exact same title and came in within seconds of each other, you know what I mean. Time-based alert grouping may not have the pizzazz of a 3-dimensional shark, but it solves a real problem in a simple way. Like sneakers that automatically adapt to your needs.

At PagerDuty Summit, Dominic Marion from NBCNews Digital and I hosted a breakout session called AI and Machine Learning Get Real. In it, Dominic talked about how his team of NOC responders streamlined their response process during “alert floods.” Over 40 of their critical business services are configured to use time-based alert grouping to help his team focus on triaging and responding to incidents. During the customer beta, his team prevented thousands of redundant incidents from ever being created.

Now you, too, can enjoy the benefits of time-based alert grouping for your team. Automatic alert grouping is designed to help you triage a problem effectively and calmly, reducing noise when you need to focus most.  

“ Often the things that we monitor go down in groups, which creates multiple incidents, which causes the same team to get multiple text messages, emails, phone calls, etc.”

Now you can configure a service to automatically group alerts with a predictable cadence — options range from grouping by 2 minutes to 24 hours. When alerts group into incidents, your team will be notified at the start of an incident and can continue to monitor how the incident changes over time. Time-based alert grouping allows responders to resolve issues efficiently instead of spending time reacting to redundant alarms. And when a routine issue turns into a major issue, it’s now easy to centralize response around a single, rich incident — looping in responders, notifying stakeholders, and driving every stage of the response (assess, resolve, learn) from a single source of truth.

Your feedback, like the feedback mentioned throughout this post, continues to shape our roadmap. We’re excited to develop features like time-based alert grouping that we hope will solve some of your biggest pains.

Time-based alert grouping is simple, powerful, and available today (read more on the support site here). We’re continuing to develop intelligent alert grouping in a limited customer preview — if you’re interested, please fill out this form.

If you have feedback on time-based alert grouping or are interested in the customer preview for intelligent alert grouping, please let us know in the Community!

The post What’s Better Than Rehydrated Microwave Pizza? appeared first on PagerDuty.

Read the original blog entry...

More Stories By PagerDuty Blog

PagerDuty’s operations performance platform helps companies increase reliability. By connecting people, systems and data in a single view, PagerDuty delivers visibility and actionable intelligence across global operations for effective incident resolution management. PagerDuty has over 100 platform partners, and is trusted by Fortune 500 companies and startups alike, including Microsoft, National Instruments, Electronic Arts, Adobe, Rackspace, Etsy, Square and Github.

Latest Stories
As you move to the cloud, your network should be efficient, secure, and easy to manage. An enterprise adopting a hybrid or public cloud needs systems and tools that provide: Agility: ability to deliver applications and services faster, even in complex hybrid environments Easier manageability: enable reliable connectivity with complete oversight as the data center network evolves Greater efficiency: eliminate wasted effort while reducing errors and optimize asset utilization Security: implemen...
DXWordEXPO New York 2018, colocated with CloudEXPO New York 2018 will be held November 11-13, 2018, in New York City and will bring together Cloud Computing, FinTech and Blockchain, Digital Transformation, Big Data, Internet of Things, DevOps, AI, Machine Learning and WebRTC to one location.
DXWorldEXPO | CloudEXPO are the world's most influential, independent events where Cloud Computing was coined and where technology buyers and vendors meet to experience and discuss the big picture of Digital Transformation and all of the strategies, tactics, and tools they need to realize their goals. Sponsors of DXWorldEXPO | CloudEXPO benefit from unmatched branding, profile building and lead generation opportunities.
DXWorldEXPO LLC announced today that ICOHOLDER named "Media Sponsor" of Miami Blockchain Event by FinTechEXPO. ICOHOLDER give you detailed information and help the community to invest in the trusty projects. Miami Blockchain Event by FinTechEXPO has opened its Call for Papers. The two-day event will present 20 top Blockchain experts. All speaking inquiries which covers the following information can be submitted by email to [email protected] Miami Blockchain Event by FinTechEXPO also offers s...
@DevOpsSummit New York 2018, colocated with CloudEXPO | DXWorldEXPO New York 2018 will be held November 11-13, 2018, in New York City. From showcase success stories from early adopters and web-scale businesses, DevOps is expanding to organizations of all sizes, including the world's largest enterprises - and delivering real results.
Dion Hinchcliffe is an internationally recognized digital expert, bestselling book author, frequent keynote speaker, analyst, futurist, and transformation expert based in Washington, DC. He is currently Chief Strategy Officer at the industry-leading digital strategy and online community solutions firm, 7Summits.
DXWorldEXPO LLC announced today that Dez Blanchfield joined the faculty of CloudEXPO's "10-Year Anniversary Event" which will take place on November 11-13, 2018 in New York City. Dez is a strategic leader in business and digital transformation with 25 years of experience in the IT and telecommunications industries developing strategies and implementing business initiatives. He has a breadth of expertise spanning technologies such as cloud computing, big data and analytics, cognitive computing, m...
Digital Transformation and Disruption, Amazon Style - What You Can Learn. Chris Kocher is a co-founder of Grey Heron, a management and strategic marketing consulting firm. He has 25+ years in both strategic and hands-on operating experience helping executives and investors build revenues and shareholder value. He has consulted with over 130 companies on innovating with new business models, product strategies and monetization. Chris has held management positions at HP and Symantec in addition to ...
Cloud-enabled transformation has evolved from cost saving measure to business innovation strategy -- one that combines the cloud with cognitive capabilities to drive market disruption. Learn how you can achieve the insight and agility you need to gain a competitive advantage. Industry-acclaimed CTO and cloud expert, Shankar Kalyana presents. Only the most exceptional IBMers are appointed with the rare distinction of IBM Fellow, the highest technical honor in the company. Shankar has also receive...
DXWorldEXPO LLC announced today that Kevin Jackson joined the faculty of CloudEXPO's "10-Year Anniversary Event" which will take place on November 11-13, 2018 in New York City. Kevin L. Jackson is a globally recognized cloud computing expert and Founder/Author of the award winning "Cloud Musings" blog. Mr. Jackson has also been recognized as a "Top 100 Cybersecurity Influencer and Brand" by Onalytica (2015), a Huffington Post "Top 100 Cloud Computing Experts on Twitter" (2013) and a "Top 50 C...
There is a huge demand for responsive, real-time mobile and web experiences, but current architectural patterns do not easily accommodate applications that respond to events in real time. Common solutions using message queues or HTTP long-polling quickly lead to resiliency, scalability and development velocity challenges. In his session at 21st Cloud Expo, Ryland Degnan, a Senior Software Engineer on the Netflix Edge Platform team, will discuss how by leveraging a reactive stream-based protocol,...
Enterprises have taken advantage of IoT to achieve important revenue and cost advantages. What is less apparent is how incumbent enterprises operating at scale have, following success with IoT, built analytic, operations management and software development capabilities - ranging from autonomous vehicles to manageable robotics installations. They have embraced these capabilities as if they were Silicon Valley startups.
Daniel Jones is CTO of EngineerBetter, helping enterprises deliver value faster. Previously he was an IT consultant, indie video games developer, head of web development in the finance sector, and an award-winning martial artist. Continuous Delivery makes it possible to exploit findings of cognitive psychology and neuroscience to increase the productivity and happiness of our teams.
The standardization of container runtimes and images has sparked the creation of an almost overwhelming number of new open source projects that build on and otherwise work with these specifications. Of course, there's Kubernetes, which orchestrates and manages collections of containers. It was one of the first and best-known examples of projects that make containers truly useful for production use. However, more recently, the container ecosystem has truly exploded. A service mesh like Istio addr...
Poor data quality and analytics drive down business value. In fact, Gartner estimated that the average financial impact of poor data quality on organizations is $9.7 million per year. But bad data is much more than a cost center. By eroding trust in information, analytics and the business decisions based on these, it is a serious impediment to digital transformation.