Blog Feed Post

Cloud infrastructure monitoring checklist: Are you covered?

Recently I got a glimpse into one of our latest customer’s cloud migration story, and how they got their cloud infrastructure monitoring needs covered. They happen to be one of the biggest industrial companies worldwide, actually.

The company recently implemented a cloud-first initiative. Accordingly, they migrated their previously outsourced enterprise business critical applications into an internally managed AWS environment with a “lift-and-shift” approach. Their new environment, made up of hundreds of hosts and very diverse technologies in the AWS cloud, triggered the need for:

What we experience here at Dynatrace is that the need for an elastic and fully scalable cloud infrastructure increases, as cloud-native apps increasingly become the standard for companies of all sizes wanting to create better customer experiences.

No wonder that early this year IDC  forecasted that:

spending on off-premises cloud IT infrastructure will experience a five-year compound annual growth rate (CAGR) of 14.2%, reaching $48.1 billion in 2020.

But what we also experience is that once in the cloud, these companies quickly realize that in this new environment their traditional infrastructure monitoring approach no longer works.

A few questions to consider

Are you implementing a cloud-based infrastructure for your business-critical apps, similarly to the company in my introduction? Whether it’s on AWS, Azure, Google CloudOpenStack or CloudFoundry, you might want to consider the following questions before starting to monitor it with a bunch of different tools.

1. How easy is the solution to implement, configure, and maintain?

With the increasingly complex environments of today’s applications, ease of implementation and ease of use become more than just nice-to-haves — they are essential.

Traditional monitoring solutions require too much manual instrumentation and configuration – a reason why most companies today are only monitoring 5 to 10 percent of their applications.

My recommendation: look out for a monitoring tool that has already embraced the power of automation. This means auto-discovery of your cloud environment, auto-baselining, or even automatic root cause analysis.

2. Does it provide real-time insights into the health of your cloud resources?

Whether you choose to run a public, private, or hybrid cloud, virtualize your datacenter, or simply deploy your applications to CloudFoundry, it should be a basic expectation from every monitoring solution to give you the complete, real-time picture of health of your entire cloud-based architecture.

Do you have your containers under control? What about your load balancers? And about your hypervisor dynamics? There are just so many moving parts in a cloud infrastructure that makes it difficult to identify the underlying cause of aberrant system behavior.

Choose a cloud monitoring solution that has been built from the ground up with dynamic environments in mind. They can eliminate all blind spots and can keep up with any changes of the dynamic environments.

3. Does it provide full stack application performance monitoring, or only firefighting capabilities at infrastructure level?

Even though a solid cloud infrastructure is the backbone of any successful business, at the end of the day it’s all about the applications. And if they fail, users can be cruel.

Your applications may span many technology tiers, and components from the cloud through the back-end data center and mainframe. To get a full stack view of all your applications, you will need the ability to monitor from different perspectives:

  • Digital Experience Analytics
  • Application Performance Management
  • Cloud and Infrastructure Monitoring

If you care about your apps, I recommend that you choose a unified monitoring tool that provides a holistic view of not only your cloud infrastructure, but also of your applications running on it.

4. How fast it lets you find the root cause of an issue?

What’s one of the biggest obstacles plaguing your IT teams? If it’s alert overload, you are not alone.

Companies still often use different monitoring tools to look at datacenters, hosts, processes and services. When any of these components fail or slow down, it can trigger a chain reaction of hundreds of other failures, leaving IT teams drowning in a sea of alerts. Tools with traditional alerting approach leave you with countless metrics and charts, but then it’s up to you to correlate those metrics to determine what is really happening.

The solution? Using a tool that gives you causation instead of correlation.

If a monitoring tool can capture every transaction all the time and uses a tagging approach across every remoting call, it gives the performance engineer causation based data, which gives them confidence and hard facts on what is causing system problems. Being able to point the Dev team directly to the root cause is priceless when time, money and your business reputation is on the line.

5. How does the solution handle performance baselining for ultra-dynamic environments?

Setting up performance baselines is another tricky part in cloud infrastructure monitoring. It can involve a lot of time-consuming and potentially error-prone manual effort with traditional APM—especially because most of them rely on averages and transaction samples to determine normal performance.

Averages are ineffective because they mask underlying issues by “flattening” performance spikes and dips. Sampling lets performance issues slip through the cracks—creating false negatives.

If you want to effectively baseline your cloud infrastructure’s performance, look for a tool that uses percentiles based on 100% gap-free data. Looking at percentiles (median and slowest 10%) tells you what’s really going on: how most users are actually experiencing your application and site.

6. Does it offer built-in log monitoring, or needs additional tool?

Remember the company I presented in my introduction? One of their key requirements was that log management and log analytics are built-in features. Quite understandably: being able to monitor application performance and analyze related process log files using the same tool helps their DevOps, Development, and QA teams to perform their jobs quickly and efficiently.

If log analytics is also an important part of your monitoring process, choose a solution that has this feature built in. Having a direct access to all log content related to your mission-critical processes extends your monitoring reach well beyond traditional APM data sources.

7. Will the monitoring solution scale with your business needs?

The last, but not least important feature you should look for in a monitoring tool is its ability to scale with your business.

Modern cloud environments run thousands of nodes with hundreds of technologies, distributed across datacenters around the globe. You can keep deploying more and more monitoring tools for each silo to ensure the system limits are not reached, but soon questions like these will come up:

  • How far will this scale?
  • How long until I‘ll need a newer, faster, or bigger one?

Picking a monitoring solution that gives you real-time insights into your cloud components is important, but ensuring that it will not crash and burn as you expand your environment is crucial. Therefore, look for a tool that was built with large application environments in mind and therefore scales to any size.

“The value that you get from Dynatrace is almost instant”

Watch the video below to see how Dynatrace helps Citrix reduce cloud resources, as well as time spent on troubleshooting issues.

Wrapping it up

Today’s digital businesses are under more pressure than ever to do things faster, smarter, and more effectively. This is doubly true for companies who run customer facing applications. It basically depends on their technology if they win or lose the war on the battlefield of customer experience. And the trend shows that the winners already implemented a digital transformation strategy – which might as well include a cloud-first initiative and migrating to a cloud-based infrastructure.

However, being there is not enough. The complex architecture and the countless moving parts of a cloud ecosystem require modern monitoring capabilities. Why monitor a modern cloud architecture with a bunch of different, outdated tools? That would detract you from the benefits for which you migrated to the cloud in the first place.

This is what the company described in the intro realized – and, as of today, they are developing new cloud-native applications on their own, deploying these into their cloud environments, and monitoring them happily with Dynatrace.

The post Cloud infrastructure monitoring checklist: Are you covered? appeared first on Dynatrace blog – monitoring redefined.

Read the original blog entry...

More Stories By Dynatrace Blog

Building a revolutionary approach to software performance monitoring takes an extraordinary team. With decades of combined experience and an impressive history of disruptive innovation, that’s exactly what we ruxit has.

Get to know ruxit, and get to know the future of data analytics.

Latest Stories
Andrew Keys is Co-Founder of ConsenSys Enterprise. He comes to ConsenSys Enterprise with capital markets, technology and entrepreneurial experience. Previously, he worked for UBS investment bank in equities analysis. Later, he was responsible for the creation and distribution of life settlement products to hedge funds and investment banks. After, he co-founded a revenue cycle management company where he learned about Bitcoin and eventually Ethereal. Andrew's role at ConsenSys Enterprise is a mul...
DXWorldEXPO LLC announced today that "Miami Blockchain Event by FinTechEXPO" has announced that its Call for Papers is now open. The two-day event will present 20 top Blockchain experts. All speaking inquiries which covers the following information can be submitted by email to [email protected] Financial enterprises in New York City, London, Singapore, and other world financial capitals are embracing a new generation of smart, automated FinTech that eliminates many cumbersome, slow, and expe...
Evan Kirstel is an internationally recognized thought leader and social media influencer in IoT (#1 in 2017), Cloud, Data Security (2016), Health Tech (#9 in 2017), Digital Health (#6 in 2016), B2B Marketing (#5 in 2015), AI, Smart Home, Digital (2017), IIoT (#1 in 2017) and Telecom/Wireless/5G. His connections are a "Who's Who" in these technologies, He is in the top 10 most mentioned/re-tweeted by CMOs and CIOs (2016) and have been recently named 5th most influential B2B marketeer in the US. H...
DXWorldEXPO | CloudEXPO are the world's most influential, independent events where Cloud Computing was coined and where technology buyers and vendors meet to experience and discuss the big picture of Digital Transformation and all of the strategies, tactics, and tools they need to realize their goals. Sponsors of DXWorldEXPO | CloudEXPO benefit from unmatched branding, profile building and lead generation opportunities.
The best way to leverage your Cloud Expo presence as a sponsor and exhibitor is to plan your news announcements around our events. The press covering Cloud Expo and @ThingsExpo will have access to these releases and will amplify your news announcements. More than two dozen Cloud companies either set deals at our shows or have announced their mergers and acquisitions at Cloud Expo. Product announcements during our show provide your company with the most reach through our targeted audiences.
DevOpsSummit New York 2018, colocated with CloudEXPO | DXWorldEXPO New York 2018 will be held November 11-13, 2018, in New York City. Digital Transformation (DX) is a major focus with the introduction of DXWorldEXPO within the program. Successful transformation requires a laser focus on being data-driven and on using all the tools available that enable transformation if they plan to survive over the long term. A total of 88% of Fortune 500 companies from a generation ago are now out of bus...
With 10 simultaneous tracks, keynotes, general sessions and targeted breakout classes, @CloudEXPO and DXWorldEXPO are two of the most important technology events of the year. Since its launch over eight years ago, @CloudEXPO and DXWorldEXPO have presented a rock star faculty as well as showcased hundreds of sponsors and exhibitors! In this blog post, we provide 7 tips on how, as part of our world-class faculty, you can deliver one of the most popular sessions at our events. But before reading...
Cloud Expo | DXWorld Expo have announced the conference tracks for Cloud Expo 2018. Cloud Expo will be held June 5-7, 2018, at the Javits Center in New York City, and November 6-8, 2018, at the Santa Clara Convention Center, Santa Clara, CA. Digital Transformation (DX) is a major focus with the introduction of DX Expo within the program. Successful transformation requires a laser focus on being data-driven and on using all the tools available that enable transformation if they plan to survive ov...
DXWordEXPO New York 2018, colocated with CloudEXPO New York 2018 will be held November 11-13, 2018, in New York City and will bring together Cloud Computing, FinTech and Blockchain, Digital Transformation, Big Data, Internet of Things, DevOps, AI, Machine Learning and WebRTC to one location.
DXWorldEXPO LLC announced today that ICOHOLDER named "Media Sponsor" of Miami Blockchain Event by FinTechEXPO. ICOHOLDER give you detailed information and help the community to invest in the trusty projects. Miami Blockchain Event by FinTechEXPO has opened its Call for Papers. The two-day event will present 20 top Blockchain experts. All speaking inquiries which covers the following information can be submitted by email to [email protected] Miami Blockchain Event by FinTechEXPO also offers s...
@DevOpsSummit New York 2018, colocated with CloudEXPO | DXWorldEXPO New York 2018 will be held November 11-13, 2018, in New York City. From showcase success stories from early adopters and web-scale businesses, DevOps is expanding to organizations of all sizes, including the world's largest enterprises - and delivering real results.
The dynamic nature of the cloud means that change is a constant when it comes to modern cloud-based infrastructure. Delivering modern applications to end users, therefore, is a constantly shifting challenge. Delivery automation helps IT Ops teams ensure that apps are providing an optimal end user experience over hybrid-cloud and multi-cloud environments, no matter what the current state of the infrastructure is. To employ a delivery automation strategy that reflects your business rules, making r...
Dion Hinchcliffe is an internationally recognized digital expert, bestselling book author, frequent keynote speaker, analyst, futurist, and transformation expert based in Washington, DC. He is currently Chief Strategy Officer at the industry-leading digital strategy and online community solutions firm, 7Summits.
"We started a Master of Science in business analytics - that's the hot topic. We serve the business community around San Francisco so we educate the working professionals and this is where they all want to be," explained Judy Lee, Associate Professor and Department Chair at Golden Gate University, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Digital Transformation and Disruption, Amazon Style - What You Can Learn. Chris Kocher is a co-founder of Grey Heron, a management and strategic marketing consulting firm. He has 25+ years in both strategic and hands-on operating experience helping executives and investors build revenues and shareholder value. He has consulted with over 130 companies on innovating with new business models, product strategies and monetization. Chris has held management positions at HP and Symantec in addition to ...