Blog Feed Post

AWS Lambda monitoring done right

“Serverless” aka Function as a Service (FaaS) is the hot new technology everybody is talking about these days and AWS Lambda is currently the most popular offering in this space.

Evolution from traditional IT to Function as a Service

Of course, FaaS is not a silver bullet and it doesn’t supersede everything that’s been here before.
I’d rather say that the various ways to slice and dice IT infrastructure are orthogonal and FaaS is a viable addition that solves a specific set of problems.

Tradeoff between operational simplicity and versatility in modern IT operations

We saw growing demand amongst our customers starting in spring 2017 and so we started our research to extend Dynatrace to make monitoring AWS Lambda seamless with no code change required.

This posts covers the basics of “Serverless” computing and how to monitor this new addition to your tool belt. For an in-depth article about serverless computing, I recommend reading this article by Mike Roberts on martinfowler.com.

What is Serverless and Function as a Service (FaaS)?

‘Serverless’ is more of a marketing term and a bit misleading. Of course, there are still applications running on servers but while a PaaS environment gives you control over this applications – e.g. a Node.js application running on Heroku, FaaS abstracts away infrastructure, application and request routing and lets you focus on just writing code that is executed when a predefined event (like an incoming web request) is triggered.

AWS Lambda Function triggered via API Gateway

Why enterprises are using FaaS

Technically, serverless functions are microservices and the reasoning behind breaking down monoliths is well known. Interestingly, we see customers using FaaS even before a real cloud migration takes place. Because of the low entry barrier, factoring out some logic into functions – think fetching data from an external API through a Lambda function – is easy and low-risk.

Enter AWS Lambda

As of today, AWS Lambda is the most popular FaaS platform and consequently supporting it was our first priority.

In a nutshell, a Lambda function is a piece of user provided code running in application on a dedicated container on AWS. Of course there is a lot more going on behind the scenes. E.g. Lambda functions are autoscaled and load balanced and there are triggers for many use cases beside just web requests. E.g. a function can be triggered by data inserted into a database or an event coming from an IoT device.

Simplified schema of AWS lambda

Why you need end-to-end monitoring for AWS Lambda

As for every larger application and even more so for micro services, end-to-end monitoring is needed to get full visibility into all tiers of your stack. Think of a lambda function that calls a slow API. This will not only extend the runtime of your function which adds more costs, it will also affect the overall performance of your application.

Service Flow of an IoT Application including Lambda

In conclusion, Lambda monitoring helps you to

  • optimize response time hotspots
  • optimize timing hotspots
  • simplify error analytics
  • understand and optimize your architecture

How to monitor AWS Lambda functions

Out-of-the box, Dynatrace comes with the full set of CloudWatch metrics.

AWS CloudWatch Metrics in Dynatrace

By deploying the Dynatrace agent to Lambda we additionally provide

  • End-to-end tracing to and from lambda functions, without changing your code
  • AI powered problem detection
  • Response times
  • Failure rates
  • Process metrics
  • Event Loop Metrics (for Node.js)

This functionality is configurable through your AWS Lambda dashboard.

PurePath of an AWS Lambda function
Process Details of a Lambda function


We are excited to roll out this new key feature to our customers soon and we will extend our closed EAP during the next weeks, aiming for a GA releases in Q1/2018.

Please reach out to me if you have any questions or want to join the EAP.
While waiting for the EAP, I suggest to install our free trial to make yourself familiar with Dynatrace.

The post AWS Lambda monitoring done right appeared first on Dynatrace blog – monitoring redefined.

Read the original blog entry...

More Stories By APM Blog

APM: It’s all about application performance, scalability, and architecture: best practices, lifecycle and DevOps, mobile and web, enterprise, user experience

Latest Stories
DX World EXPO, LLC, a Lighthouse Point, Florida-based startup trade show producer and the creator of "DXWorldEXPO® - Digital Transformation Conference & Expo" has announced its executive management team. The team is headed by Levent Selamoglu, who has been named CEO. "Now is the time for a truly global DX event, to bring together the leading minds from the technology world in a conversation about Digital Transformation," he said in making the announcement.
"Space Monkey by Vivent Smart Home is a product that is a distributed cloud-based edge storage network. Vivent Smart Home, our parent company, is a smart home provider that places a lot of hard drives across homes in North America," explained JT Olds, Director of Engineering, and Brandon Crowfeather, Product Manager, at Vivint Smart Home, in this SYS-CON.tv interview at @ThingsExpo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
SYS-CON Events announced today that Conference Guru has been named “Media Sponsor” of the 22nd International Cloud Expo, which will take place on June 5-7, 2018, at the Javits Center in New York, NY. A valuable conference experience generates new contacts, sales leads, potential strategic partners and potential investors; helps gather competitive intelligence and even provides inspiration for new products and services. Conference Guru works with conference organizers to pass great deals to gre...
DevOps is under attack because developers don’t want to mess with infrastructure. They will happily own their code into production, but want to use platforms instead of raw automation. That’s changing the landscape that we understand as DevOps with both architecture concepts (CloudNative) and process redefinition (SRE). Rob Hirschfeld’s recent work in Kubernetes operations has led to the conclusion that containers and related platforms have changed the way we should be thinking about DevOps and...
The Internet of Things will challenge the status quo of how IT and development organizations operate. Or will it? Certainly the fog layer of IoT requires special insights about data ontology, security and transactional integrity. But the developmental challenges are the same: People, Process and Platform. In his session at @ThingsExpo, Craig Sproule, CEO of Metavine, demonstrated how to move beyond today's coding paradigm and shared the must-have mindsets for removing complexity from the develop...
In his Opening Keynote at 21st Cloud Expo, John Considine, General Manager of IBM Cloud Infrastructure, led attendees through the exciting evolution of the cloud. He looked at this major disruption from the perspective of technology, business models, and what this means for enterprises of all sizes. John Considine is General Manager of Cloud Infrastructure Services at IBM. In that role he is responsible for leading IBM’s public cloud infrastructure including strategy, development, and offering m...
The next XaaS is CICDaaS. Why? Because CICD saves developers a huge amount of time. CD is an especially great option for projects that require multiple and frequent contributions to be integrated. But… securing CICD best practices is an emerging, essential, yet little understood practice for DevOps teams and their Cloud Service Providers. The only way to get CICD to work in a highly secure environment takes collaboration, patience and persistence. Building CICD in the cloud requires rigorous ar...
Companies are harnessing data in ways we once associated with science fiction. Analysts have access to a plethora of visualization and reporting tools, but considering the vast amount of data businesses collect and limitations of CPUs, end users are forced to design their structures and systems with limitations. Until now. As the cloud toolkit to analyze data has evolved, GPUs have stepped in to massively parallel SQL, visualization and machine learning.
"Evatronix provides design services to companies that need to integrate the IoT technology in their products but they don't necessarily have the expertise, knowledge and design team to do so," explained Adam Morawiec, VP of Business Development at Evatronix, in this SYS-CON.tv interview at @ThingsExpo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
To get the most out of their data, successful companies are not focusing on queries and data lakes, they are actively integrating analytics into their operations with a data-first application development approach. Real-time adjustments to improve revenues, reduce costs, or mitigate risk rely on applications that minimize latency on a variety of data sources. In his session at @BigDataExpo, Jack Norris, Senior Vice President, Data and Applications at MapR Technologies, reviewed best practices to ...
Widespread fragmentation is stalling the growth of the IIoT and making it difficult for partners to work together. The number of software platforms, apps, hardware and connectivity standards is creating paralysis among businesses that are afraid of being locked into a solution. EdgeX Foundry is unifying the community around a common IoT edge framework and an ecosystem of interoperable components.
"ZeroStack is a startup in Silicon Valley. We're solving a very interesting problem around bringing public cloud convenience with private cloud control for enterprises and mid-size companies," explained Kamesh Pemmaraju, VP of Product Management at ZeroStack, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Large industrial manufacturing organizations are adopting the agile principles of cloud software companies. The industrial manufacturing development process has not scaled over time. Now that design CAD teams are geographically distributed, centralizing their work is key. With large multi-gigabyte projects, outdated tools have stifled industrial team agility, time-to-market milestones, and impacted P&L stakeholders.
"Akvelon is a software development company and we also provide consultancy services to folks who are looking to scale or accelerate their engineering roadmaps," explained Jeremiah Mothersell, Marketing Manager at Akvelon, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Enterprises are adopting Kubernetes to accelerate the development and the delivery of cloud-native applications. However, sharing a Kubernetes cluster between members of the same team can be challenging. And, sharing clusters across multiple teams is even harder. Kubernetes offers several constructs to help implement segmentation and isolation. However, these primitives can be complex to understand and apply. As a result, it’s becoming common for enterprises to end up with several clusters. Thi...