Welcome!

Blog Feed Post

The Diverse Alerting Needs for Application Performance Monitoring

Alerting Needs for Application Performance MonitoringIn today’s digital economy, most business services rely on IT applications. The increasing dependency on applications has resulted in the growing adoption of application performance monitoring (APM) solutions. The goals of an APM solution are:

  • To ensure high application uptime, service reliability and great end-user experience
  • To proactively diagnose performance problems so the respective stakeholder (application owner, IT Ops, DevOps, developer, etc.) can fix them before users notice them.

Modern APM solutions must not only have deep monitoring functionality, but they must also be able to provide actionable intelligence to simplify an administrator’s job in finding and fixing an application problem. Alerts on performance deviations, errors, warnings, bottlenecks, etc. are essential requirements for an APM tool. But the requirements of enterprise IT teams have expanded beyond this to also include context-aware alerting for fast and smart resolution of problems.

  • Application Performance Monitoring Alerting RequirementsBusiness-aware alerting: As IT infrastructures have evolved to be multi-tiered with inter-dependencies between tiers, monitoring in silos is no longer sufficient. Service owners need to know when the service is impacted, and so an APM tool should embed the intelligence to discover application topologies, and these topologies should, in turn, provide an admin with business-aware, service-level alerts. To support this, when any application component failures are detected, the states of all services that depend on the affected components should reflect the performance problem. This way, when users contact the helpdesk to report problems with the service, the helpdesk staff can quickly determine whether the complaint relates to a known problem with the service or not.
  • Application Performance Monitoring - Correlated AlertsRoot cause alerting: Determining the root cause of an application slowdown is one of the most difficult tasks for IT operations teams. Again, application inter-dependencies and infrastructure inter-dependencies make root cause alerting very difficult. A problem in one tier can ripple and affect several others. For root cause alerting, APM tools must consider inter-application and application-to-infrastructure dependencies. For example, a web application may be slow because of slow query processing in the backend database. In turn, the database server may be running on a storage device where one of the RAID array disks has failed and is limiting the throughput the device can support. Therefore, the database queries issued by the application are taking extra time. In this scenario, an APM tool should highlight the root cause (i.e., the storage device issue), and indicate all effects (i.e., database server slowness and application slowness). Accurate root cause alerting results in improved user satisfaction and higher service uptime. It also enables IT operations staff to spend less time fire-fighting problems, and enhances operations productivity.

Root Cause Alert

  • APM Solutions - Not All Alerts Are the SameAggregated alerting on farm-wide metrics: Large infrastructures have many servers in a farm/cluster. An administrator may only need to be alerted when, for example, four out of six web servers are facing connection spikes. This provides the actionable intelligence that is needed to determine when additional servers should be added to support the growing connection load. More complex conditions across multiple servers should also be supported. For example, an administrator may want to be alerted when 25% of servers are reporting CPU utilization above 80%. Such farm-wide alerts help administrators understand the health and capacity requirements of the entire farm (rather than just individual servers/nodes).
  • Alerting in Application Performance MonitoringComposite Alerting: Management-level reports must present simplified views of performance, instead of only detailed metrics. For example, consider a CIO who is interested in knowing if the user experience of a core virtual desktop service is good or not. There are many factors that affect the user experience, include a user’s logon time, application launch time, screen refresh latency, bandwidth availability etc. While an IT operations person is interested in the details, the CIO is not. The CIO is only looking for the overall user experience. APM tools must offer composite alerting functionality to simplify executive-level reporting. A composite alert is the collective representation of the state of multiple metrics. By assigning weights for different metrics (e.g. for logons, which happen less frequently and so may have a lower weight than screen refresh latency) and using a weighted average method, a composite rating is obtained –  a simplified percentage value indicating user experience. Examples of composite alerts include user experience, Apdex score, stress level for servers, etc.

Composite Alert

  • Alerting Requirements in APM SolutionsSituation-aware dynamic baseline alerting: Manually adjusting alert thresholds for every performance metric is challenging. Based on usage trends, there is a need for different alert thresholds at different times of the day and for each day of the week. An admin would not need an alert triggered for the same threshold condition during the day – when there is high workload on an application server – as during low workload time at night or over the weekend. The best practice to determine these alert thresholds is by baselining the application and infrastructure performance. Some APM tools use artificial intelligence and machine learning to auto-baseline the infrastructure and dynamically determine alert thresholds. This is critical, as unless there is situational-awareness built into the APM solution, there would certainly be false positives for administrators, making their job more difficult.

Baseline Alert

eG Enterprise is an end-to-end application performance monitoring solution that includes all these comprehensive, intelligent alerting capabilities that help IT and business stakeholders get actionable insights for effective troubleshooting and decision-making. With out-of-the-box monitoring support for over 180 applications (Java, SAP, SharePoint, Citrix, PeopleSoft, etc.), eG Enterprise tracks health, availability and performance of all aspects of your business-critical applications and helps with proactive problem diagnosis and root cause analysis.

Learn more about APM with eG Enterprise »

 

The post The Diverse Alerting Needs for Application Performance Monitoring appeared first on eG Innovations.

Read the original blog entry...

More Stories By Vinod Mohan

Vinod Mohan is a Senior Product Marketing Manager for eG Innovations, a global provider of unified performance monitoring and root-cause diagnosis solutions for virtual, physical and cloud IT infrastructures. He has 10 years of experience in product, technology and solution marketing of IT software and services spanning application performance management, network, systems, virtualization, storage, IT security and IT service management (ITSM).

Previously, he was a Senior Product Marketing Manager at SolarWinds for server and application monitoring software. Now a key team member for eG Innovations, he is a contributing author for the eG Innovations blog, "Application & Virtualization Performance Insights", along with other trade publications including APMdigest, DABCC, Cyber Defense Magazine, IT Briefcase, Insfosec Island, The Hacker News, IT Pro Portal, and SolarWinds THWACK community.

Latest Stories
Mobile device usage has increased exponentially during the past several years, as consumers rely on handhelds for everything from news and weather to banking and purchases. What can we expect in the next few years? The way in which we interact with our devices will fundamentally change, as businesses leverage Artificial Intelligence. We already see this taking shape as businesses leverage AI for cost savings and customer responsiveness. This trend will continue, as AI is used for more sophistica...
Nordstrom is transforming the way that they do business and the cloud is the key to enabling speed and hyper personalized customer experiences. In his session at 21st Cloud Expo, Ken Schow, VP of Engineering at Nordstrom, discussed some of the key learnings and common pitfalls of large enterprises moving to the cloud. This includes strategies around choosing a cloud provider(s), architecture, and lessons learned. In addition, he covered some of the best practices for structured team migration an...
Most technology leaders, contemporary and from the hardware era, are reshaping their businesses to do software. They hope to capture value from emerging technologies such as IoT, SDN, and AI. Ultimately, irrespective of the vertical, it is about deriving value from independent software applications participating in an ecosystem as one comprehensive solution. In his session at @ThingsExpo, Kausik Sridhar, founder and CTO of Pulzze Systems, discussed how given the magnitude of today's application ...
Recently, REAN Cloud built a digital concierge for a North Carolina hospital that had observed that most patient call button questions were repetitive. In addition, the paper-based process used to measure patient health metrics was laborious, not in real-time and sometimes error-prone. In their session at 21st Cloud Expo, Sean Finnerty, Executive Director, Practice Lead, Health Care & Life Science at REAN Cloud, and Dr. S.P.T. Krishnan, Principal Architect at REAN Cloud, discussed how they built...
The “Digital Era” is forcing us to engage with new methods to build, operate and maintain applications. This transformation also implies an evolution to more and more intelligent applications to better engage with the customers, while creating significant market differentiators. In both cases, the cloud has become a key enabler to embrace this digital revolution. So, moving to the cloud is no longer the question; the new questions are HOW and WHEN. To make this equation even more complex, most ...
In his session at 21st Cloud Expo, Raju Shreewastava, founder of Big Data Trunk, provided a fun and simple way to introduce Machine Leaning to anyone and everyone. He solved a machine learning problem and demonstrated an easy way to be able to do machine learning without even coding. Raju Shreewastava is the founder of Big Data Trunk (www.BigDataTrunk.com), a Big Data Training and consulting firm with offices in the United States. He previously led the data warehouse/business intelligence and B...
As you move to the cloud, your network should be efficient, secure, and easy to manage. An enterprise adopting a hybrid or public cloud needs systems and tools that provide: Agility: ability to deliver applications and services faster, even in complex hybrid environments Easier manageability: enable reliable connectivity with complete oversight as the data center network evolves Greater efficiency: eliminate wasted effort while reducing errors and optimize asset utilization Security: imple...
In his Opening Keynote at 21st Cloud Expo, John Considine, General Manager of IBM Cloud Infrastructure, led attendees through the exciting evolution of the cloud. He looked at this major disruption from the perspective of technology, business models, and what this means for enterprises of all sizes. John Considine is General Manager of Cloud Infrastructure Services at IBM. In that role he is responsible for leading IBM’s public cloud infrastructure including strategy, development, and offering m...
With tough new regulations coming to Europe on data privacy in May 2018, Calligo will explain why in reality the effect is global and transforms how you consider critical data. EU GDPR fundamentally rewrites the rules for cloud, Big Data and IoT. In his session at 21st Cloud Expo, Adam Ryan, Vice President and General Manager EMEA at Calligo, examined the regulations and provided insight on how it affects technology, challenges the established rules and will usher in new levels of diligence arou...
The past few years have brought a sea change in the way applications are architected, developed, and consumed—increasing both the complexity of testing and the business impact of software failures. How can software testing professionals keep pace with modern application delivery, given the trends that impact both architectures (cloud, microservices, and APIs) and processes (DevOps, agile, and continuous delivery)? This is where continuous testing comes in. D
Modern software design has fundamentally changed how we manage applications, causing many to turn to containers as the new virtual machine for resource management. As container adoption grows beyond stateless applications to stateful workloads, the need for persistent storage is foundational - something customers routinely cite as a top pain point. In his session at @DevOpsSummit at 21st Cloud Expo, Bill Borsari, Head of Systems Engineering at Datera, explored how organizations can reap the bene...
Digital transformation is about embracing digital technologies into a company's culture to better connect with its customers, automate processes, create better tools, enter new markets, etc. Such a transformation requires continuous orchestration across teams and an environment based on open collaboration and daily experiments. In his session at 21st Cloud Expo, Alex Casalboni, Technical (Cloud) Evangelist at Cloud Academy, explored and discussed the most urgent unsolved challenges to achieve f...
The dynamic nature of the cloud means that change is a constant when it comes to modern cloud-based infrastructure. Delivering modern applications to end users, therefore, is a constantly shifting challenge. Delivery automation helps IT Ops teams ensure that apps are providing an optimal end user experience over hybrid-cloud and multi-cloud environments, no matter what the current state of the infrastructure is. To employ a delivery automation strategy that reflects your business rules, making r...
The 22nd International Cloud Expo | 1st DXWorld Expo has announced that its Call for Papers is open. Cloud Expo | DXWorld Expo, to be held June 5-7, 2018, at the Javits Center in New York, NY, brings together Cloud Computing, Digital Transformation, Big Data, Internet of Things, DevOps, Machine Learning and WebRTC to one location. With cloud computing driving a higher percentage of enterprise IT budgets every year, it becomes increasingly important to plant your flag in this fast-expanding busin...
In a recent survey, Sumo Logic surveyed 1,500 customers who employ cloud services such as Amazon Web Services (AWS), Microsoft Azure, and Google Cloud Platform (GCP). According to the survey, a quarter of the respondents have already deployed Docker containers and nearly as many (23 percent) are employing the AWS Lambda serverless computing framework. It’s clear: serverless is here to stay. The adoption does come with some needed changes, within both application development and operations. Tha...