Welcome!

Blog Feed Post

Cloud infrastructure monitoring checklist: Are you covered?

Recently I got a glimpse into one of our latest customer’s cloud migration story, and how they got their cloud infrastructure monitoring needs covered. They happen to be one of the biggest industrial companies worldwide, actually.

The company recently implemented a cloud-first initiative. Accordingly, they migrated their previously outsourced enterprise business critical applications into an internally managed AWS environment with a “lift-and-shift” approach. Their new environment, made up of hundreds of hosts and very diverse technologies in the AWS cloud, triggered the need for:

What we experience here at Dynatrace is that the need for an elastic and fully scalable cloud infrastructure increases, as cloud-native apps increasingly become the standard for companies of all sizes wanting to create better customer experiences.

No wonder that early this year IDC  forecasted that:

spending on off-premises cloud IT infrastructure will experience a five-year compound annual growth rate (CAGR) of 14.2%, reaching $48.1 billion in 2020.

But what we also experience is that once in the cloud, these companies quickly realize that in this new environment their traditional infrastructure monitoring approach no longer works.

A few questions to consider

Are you implementing a cloud-based infrastructure for your business-critical apps, similarly to the company in my introduction? Whether it’s on AWS, Azure, Google CloudOpenStack or CloudFoundry, you might want to consider the following questions before starting to monitor it with a bunch of different tools.

1. How easy is the solution to implement, configure, and maintain?

With the increasingly complex environments of today’s applications, ease of implementation and ease of use become more than just nice-to-haves — they are essential.

Traditional monitoring solutions require too much manual instrumentation and configuration – a reason why most companies today are only monitoring 5 to 10 percent of their applications.

My recommendation: look out for a monitoring tool that has already embraced the power of automation. This means auto-discovery of your cloud environment, auto-baselining, or even automatic root cause analysis.

2. Does it provide real-time insights into the health of your cloud resources?

Whether you choose to run a public, private, or hybrid cloud, virtualize your datacenter, or simply deploy your applications to CloudFoundry, it should be a basic expectation from every monitoring solution to give you the complete, real-time picture of health of your entire cloud-based architecture.

Do you have your containers under control? What about your load balancers? And about your hypervisor dynamics? There are just so many moving parts in a cloud infrastructure that makes it difficult to identify the underlying cause of aberrant system behavior.

Choose a cloud monitoring solution that has been built from the ground up with dynamic environments in mind. They can eliminate all blind spots and can keep up with any changes of the dynamic environments.

3. Does it provide full stack application performance monitoring, or only firefighting capabilities at infrastructure level?

Even though a solid cloud infrastructure is the backbone of any successful business, at the end of the day it’s all about the applications. And if they fail, users can be cruel.

Your applications may span many technology tiers, and components from the cloud through the back-end data center and mainframe. To get a full stack view of all your applications, you will need the ability to monitor from different perspectives:

  • Digital Experience Analytics
  • Application Performance Management
  • Cloud and Infrastructure Monitoring

If you care about your apps, I recommend that you choose a unified monitoring tool that provides a holistic view of not only your cloud infrastructure, but also of your applications running on it.

4. How fast it lets you find the root cause of an issue?

What’s one of the biggest obstacles plaguing your IT teams? If it’s alert overload, you are not alone.

Companies still often use different monitoring tools to look at datacenters, hosts, processes and services. When any of these components fail or slow down, it can trigger a chain reaction of hundreds of other failures, leaving IT teams drowning in a sea of alerts. Tools with traditional alerting approach leave you with countless metrics and charts, but then it’s up to you to correlate those metrics to determine what is really happening.

The solution? Using a tool that gives you causation instead of correlation.

If a monitoring tool can capture every transaction all the time and uses a tagging approach across every remoting call, it gives the performance engineer causation based data, which gives them confidence and hard facts on what is causing system problems. Being able to point the Dev team directly to the root cause is priceless when time, money and your business reputation is on the line.

5. How does the solution handle performance baselining for ultra-dynamic environments?

Setting up performance baselines is another tricky part in cloud infrastructure monitoring. It can involve a lot of time-consuming and potentially error-prone manual effort with traditional APM—especially because most of them rely on averages and transaction samples to determine normal performance.

Averages are ineffective because they mask underlying issues by “flattening” performance spikes and dips. Sampling lets performance issues slip through the cracks—creating false negatives.

If you want to effectively baseline your cloud infrastructure’s performance, look for a tool that uses percentiles based on 100% gap-free data. Looking at percentiles (median and slowest 10%) tells you what’s really going on: how most users are actually experiencing your application and site.

6. Does it offer built-in log monitoring, or needs additional tool?

Remember the company I presented in my introduction? One of their key requirements was that log management and log analytics are built-in features. Quite understandably: being able to monitor application performance and analyze related process log files using the same tool helps their DevOps, Development, and QA teams to perform their jobs quickly and efficiently.

If log analytics is also an important part of your monitoring process, choose a solution that has this feature built in. Having a direct access to all log content related to your mission-critical processes extends your monitoring reach well beyond traditional APM data sources.

7. Will the monitoring solution scale with your business needs?

The last, but not least important feature you should look for in a monitoring tool is its ability to scale with your business.

Modern cloud environments run thousands of nodes with hundreds of technologies, distributed across datacenters around the globe. You can keep deploying more and more monitoring tools for each silo to ensure the system limits are not reached, but soon questions like these will come up:

  • How far will this scale?
  • How long until I‘ll need a newer, faster, or bigger one?

Picking a monitoring solution that gives you real-time insights into your cloud components is important, but ensuring that it will not crash and burn as you expand your environment is crucial. Therefore, look for a tool that was built with large application environments in mind and therefore scales to any size.

“The value that you get from Dynatrace is almost instant”

Watch the video below to see how Dynatrace helps Citrix reduce cloud resources, as well as time spent on troubleshooting issues.

Wrapping it up

Today’s digital businesses are under more pressure than ever to do things faster, smarter, and more effectively. This is doubly true for companies who run customer facing applications. It basically depends on their technology if they win or lose the war on the battlefield of customer experience. And the trend shows that the winners already implemented a digital transformation strategy – which might as well include a cloud-first initiative and migrating to a cloud-based infrastructure.

However, being there is not enough. The complex architecture and the countless moving parts of a cloud ecosystem require modern monitoring capabilities. Why monitor a modern cloud architecture with a bunch of different, outdated tools? That would detract you from the benefits for which you migrated to the cloud in the first place.

This is what the company described in the intro realized – and, as of today, they are developing new cloud-native applications on their own, deploying these into their cloud environments, and monitoring them happily with Dynatrace.

The post Cloud infrastructure monitoring checklist: Are you covered? appeared first on Dynatrace blog – monitoring redefined.

Read the original blog entry...

More Stories By Dynatrace Blog

Building a revolutionary approach to software performance monitoring takes an extraordinary team. With decades of combined experience and an impressive history of disruptive innovation, that’s exactly what we ruxit has.

Get to know ruxit, and get to know the future of data analytics.

Latest Stories
Mobile device usage has increased exponentially during the past several years, as consumers rely on handhelds for everything from news and weather to banking and purchases. What can we expect in the next few years? The way in which we interact with our devices will fundamentally change, as businesses leverage Artificial Intelligence. We already see this taking shape as businesses leverage AI for cost savings and customer responsiveness. This trend will continue, as AI is used for more sophistica...
"When we talk about cloud without compromise what we're talking about is that when people think about 'I need the flexibility of the cloud' - it's the ability to create applications and run them in a cloud environment that's far more flexible,” explained Matthew Finnie, CTO of Interoute, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
"NetApp's vision is how we help organizations manage data - delivering the right data in the right place, in the right time, to the people who need it, and doing it agnostic to what the platform is," explained Josh Atwell, Developer Advocate for NetApp, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
SYS-CON Events announced today that SourceForge has been named “Media Sponsor” of SYS-CON's 21st International Cloud Expo, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. SourceForge is the largest, most trusted destination for Open Source Software development, collaboration, discovery and download on the web serving over 32 million viewers, 150 million downloads and over 460,000 active development projects each and every month.
What You Need to Know You know you need the cloud, but you’re hesitant to simply dump everything at Amazon since you know that not all workloads are suitable for cloud. You know that you want the kind of ease of use and scalability that you get with public cloud, but your applications are architected in a way that makes the public cloud a non-starter. You’re looking at private cloud solutions based on hyperconverged infrastructure, but you’re concerned with the limits inherent in those technolog...
SYS-CON Events announced today that DXWorldExpo has been named “Global Sponsor” of SYS-CON's 21st International Cloud Expo, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Digital Transformation is the key issue driving the global enterprise IT business. Digital Transformation is most prominent among Global 2000 enterprises and government institutions.
One of the biggest challenges with adopting a DevOps mentality is: new applications are easily adapted to cloud-native, microservice-based, or containerized architectures - they can be built for them - but old applications need complex refactoring. On the other hand, these new technologies can require relearning or adapting new, oftentimes more complex, methodologies and tools to be ready for production. In his general session at @DevOpsSummit at 20th Cloud Expo, Chris Brown, Solutions Marketi...
SYS-CON Events announced today that Nihon Micron will exhibit at the Japan External Trade Organization (JETRO) Pavilion at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Nihon Micron Co., Ltd. strives for technological innovation to establish high-density, high-precision processing technology for providing printed circuit board and metal mount RFID tags used for communication devices. For more inf...
SYS-CON Events announced today that Ryobi Systems will exhibit at the Japan External Trade Organization (JETRO) Pavilion at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Ryobi Systems Co., Ltd., as an information service company, specialized in business support for local governments and medical industry. We are challenging to achive the precision farming with AI. For more information, visit http:...
SYS-CON Events announced today that mruby Forum will exhibit at the Japan External Trade Organization (JETRO) Pavilion at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. mruby is the lightweight implementation of the Ruby language. We introduce mruby and the mruby IoT framework that enhances development productivity. For more information, visit http://forum.mruby.org/.
SYS-CON Events announced today that Mobile Create USA will exhibit at the Japan External Trade Organization (JETRO) Pavilion at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Mobile Create USA Inc. is an MVNO-based business model that uses portable communication devices and cellular-based infrastructure in the development, sales, operation and mobile communications systems incorporating GPS capabi...
SYS-CON Events announced today that Keisoku Research Consultant Co. will exhibit at the Japan External Trade Organization (JETRO) Pavilion at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Keisoku Research Consultant, Co. offers research and consulting in a wide range of civil engineering-related fields from information construction to preservation of cultural properties. For more information, vi...
SYS-CON Events announced today that MIRAI Inc. will exhibit at the Japan External Trade Organization (JETRO) Pavilion at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. MIRAI Inc. are IT consultants from the public sector whose mission is to solve social issues by technology and innovation and to create a meaningful future for people.
SYS-CON Events announced today that Daiya Industry will exhibit at the Japan External Trade Organization (JETRO) Pavilion at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Daiya Industry specializes in orthotic support systems and assistive devices with pneumatic artificial muscles in order to contribute to an extended healthy life expectancy. For more information, please visit https://www.daiyak...
SYS-CON Events announced today that Fusic will exhibit at the Japan External Trade Organization (JETRO) Pavilion at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Fusic Co. provides mocks as virtual IoT devices. You can customize mocks, and get any amount of data at any time in your test. For more information, visit https://fusic.co.jp/english/.