Welcome!

Related Topics: Open Source Cloud, Linux Containers, Agile Computing, Release Management , Ruby-On-Rails, Python

Open Source Cloud: Article

Headnet Improves Drupal Performance and Reliability with TraceView

Not only did Headnet find problems they couldn’t see before, but resolution time dropped

Headnet is a web consultancy firm based in Copenhagen, Demark. Headnet develops and hosts web applications, mainly for government clients, as well as private organizations and NGOs. The company utilizes a broad range of technologies to create custom sites, from CMS-based sites with Drupal and Plone to custom applications in Java and Python. As a one-stop shop for web apps, Headnet is entirely responsible for the creation, modification and upkeep of sites for its customers.

The Challenge
Until November 2012, Headnet relied on “old-school” monitoring tools, such as open-source infrastructure monitoring, to ensure the machines its applications ran on were up to the task. When a problem arose, sometimes they could add more RAM or replace a disk to fix the issue, but more often than not, it was an application issue. In many cases, Headnet engineers were left digging through application logs for days before identifying a resolution.

The Solution:
After installing TraceView, not only did the Headnet team find problems they couldn’t see before, but their resolution time dropped dramatically. “I showed TraceView to one of my colleagues, and he was like ‘Wow.’ It took literally minutes to see issues you could improve with it,” said Anton Stonor, Web Technology Lead at Headnet.

One of the first issues identified by Stonor and his team was a sporadic spike in latency. According to the infrastructure monitoring, there was no problem, and even the application itself seemed to be responding quickly. Looking at the slow requests directly revealed that Apache would occasionally think the backend was down, and those requests would stall in Apache until it re-established the connection. Stonor changed the Apache configuration to eliminate those slow requests, resulting in a much more consistent userexperience.

In another Drupal application, customers complained about a particular page taking two to three seconds to load, but only occasionally. Without a reproducible test case, Stonor’s team struggled to find a fix, as the problem could not be recreated in their environments. With TraceView, they immediately saw that the page in question was synchronously running a maintenance job before rendering the HTML. Moving the job into the background cut the time to consistently less than one second.

Throughout their projects, Stonor’s team found more subtle issues into which they previously had no visibility. In their SQL DBs, not only were there slow queries, but “there would be either way too many queries or [queries] would be fetching too-large data sets. In TraceView, you could see exactly the amount of queries and which one of them was taking a long time, and what code is to blame,” according to Stonor. TraceView also illuminated the impact of slow external calls. In one case, an API call to a reverse DNS lookup service was to blame, and Stonor’s team identifi d the problem in production immediately. “Something that may have taken days, weeks or months to find, we were able to find in just a matter of a few minutes,” said Stonor.

The Result:
Across Headnet’s projects, TraceView significantly improved the MTTR (Mean-Time-to-Resolution), not to mention the MTTPC (Mean-Time-to-Pretty-Chart). In the first two months with TraceView, Headnet dramatically reduced the average and worst-case load times on a number of sites, improving customer satisfaction and decreasing support incidents – all without taking significant time away from new development.

Related Articles

More Stories By TR Jordan

A veteran of MIT’s Lincoln Labs, TR is a reformed physicist and full-stack hacker – for some limited definition of full stack. After a few years as Software Development Lead with Thermopylae Science and Techology, he left to join Tracelytics as its first engineer. Following Tracelytics merger with AppNeta, TR was tapped to run all of its developer and market evangelism efforts. TR still harbors a not-so-secret love for Matlab-esque graphs and half-baked statistics, as well as elegant and highly-performant code. Read more of his articles at www.appneta.com/blog or visit www.appneta.com.

Latest Stories
With more than 30 Kubernetes solutions in the marketplace, it's tempting to think Kubernetes and the vendor ecosystem has solved the problem of operationalizing containers at scale or of automatically managing the elasticity of the underlying infrastructure that these solutions need to be truly scalable. Far from it. There are at least six major pain points that companies experience when they try to deploy and run Kubernetes in their complex environments. In this presentation, the speaker will d...
While DevOps most critically and famously fosters collaboration, communication, and integration through cultural change, culture is more of an output than an input. In order to actively drive cultural evolution, organizations must make substantial organizational and process changes, and adopt new technologies, to encourage a DevOps culture. Moderated by Andi Mann, panelists discussed how to balance these three pillars of DevOps, where to focus attention (and resources), where organizations might...
The deluge of IoT sensor data collected from connected devices and the powerful AI required to make that data actionable are giving rise to a hybrid ecosystem in which cloud, on-prem and edge processes become interweaved. Attendees will learn how emerging composable infrastructure solutions deliver the adaptive architecture needed to manage this new data reality. Machine learning algorithms can better anticipate data storms and automate resources to support surges, including fully scalable GPU-c...
When building large, cloud-based applications that operate at a high scale, it's important to maintain a high availability and resilience to failures. In order to do that, you must be tolerant of failures, even in light of failures in other areas of your application. "Fly two mistakes high" is an old adage in the radio control airplane hobby. It means, fly high enough so that if you make a mistake, you can continue flying with room to still make mistakes. In his session at 18th Cloud Expo, Le...
Machine learning has taken residence at our cities' cores and now we can finally have "smart cities." Cities are a collection of buildings made to provide the structure and safety necessary for people to function, create and survive. Buildings are a pool of ever-changing performance data from large automated systems such as heating and cooling to the people that live and work within them. Through machine learning, buildings can optimize performance, reduce costs, and improve occupant comfort by ...
As Cybric's Chief Technology Officer, Mike D. Kail is responsible for the strategic vision and technical direction of the platform. Prior to founding Cybric, Mike was Yahoo's CIO and SVP of Infrastructure, where he led the IT and Data Center functions for the company. He has more than 24 years of IT Operations experience with a focus on highly-scalable architectures.
CI/CD is conceptually straightforward, yet often technically intricate to implement since it requires time and opportunities to develop intimate understanding on not only DevOps processes and operations, but likely product integrations with multiple platforms. This session intends to bridge the gap by offering an intense learning experience while witnessing the processes and operations to build from zero to a simple, yet functional CI/CD pipeline integrated with Jenkins, Github, Docker and Azure...
The explosion of new web/cloud/IoT-based applications and the data they generate are transforming our world right before our eyes. In this rush to adopt these new technologies, organizations are often ignoring fundamental questions concerning who owns the data and failing to ask for permission to conduct invasive surveillance of their customers. Organizations that are not transparent about how their systems gather data telemetry without offering shared data ownership risk product rejection, regu...
René Bostic is the Technical VP of the IBM Cloud Unit in North America. Enjoying her career with IBM during the modern millennial technological era, she is an expert in cloud computing, DevOps and emerging cloud technologies such as Blockchain. Her strengths and core competencies include a proven record of accomplishments in consensus building at all levels to assess, plan, and implement enterprise and cloud computing solutions. René is a member of the Society of Women Engineers (SWE) and a m...
Dhiraj Sehgal works in Delphix's product and solution organization. His focus has been DevOps, DataOps, private cloud and datacenters customers, technologies and products. He has wealth of experience in cloud focused and virtualized technologies ranging from compute, networking to storage. He has spoken at Cloud Expo for last 3 years now in New York and Santa Clara.
Enterprises are striving to become digital businesses for differentiated innovation and customer-centricity. Traditionally, they focused on digitizing processes and paper workflow. To be a disruptor and compete against new players, they need to gain insight into business data and innovate at scale. Cloud and cognitive technologies can help them leverage hidden data in SAP/ERP systems to fuel their businesses to accelerate digital transformation success.
Containers and Kubernetes allow for code portability across on-premise VMs, bare metal, or multiple cloud provider environments. Yet, despite this portability promise, developers may include configuration and application definitions that constrain or even eliminate application portability. In this session we'll describe best practices for "configuration as code" in a Kubernetes environment. We will demonstrate how a properly constructed containerized app can be deployed to both Amazon and Azure ...
Poor data quality and analytics drive down business value. In fact, Gartner estimated that the average financial impact of poor data quality on organizations is $9.7 million per year. But bad data is much more than a cost center. By eroding trust in information, analytics and the business decisions based on these, it is a serious impediment to digital transformation.
Digital Transformation: Preparing Cloud & IoT Security for the Age of Artificial Intelligence. As automation and artificial intelligence (AI) power solution development and delivery, many businesses need to build backend cloud capabilities. Well-poised organizations, marketing smart devices with AI and BlockChain capabilities prepare to refine compliance and regulatory capabilities in 2018. Volumes of health, financial, technical and privacy data, along with tightening compliance requirements by...
Predicting the future has never been more challenging - not because of the lack of data but because of the flood of ungoverned and risk laden information. Microsoft states that 2.5 exabytes of data are created every day. Expectations and reliance on data are being pushed to the limits, as demands around hybrid options continue to grow.