Welcome!

Related Topics: @DevOpsSummit, Microservices Expo, Linux Containers, Open Source Cloud, @CloudExpo, Apache, FinTech Journal

@DevOpsSummit: Blog Post

Infographic: DevOps Toolkit By @Logentries | @DevOpsSummit [#DevOps]

The top eight must-have technologies for your modern IT and DevOps toolkit

Infographic: The Modern IT and DevOps Toolkit

Over the past year I reckon I have spoken to more than a thousand Developers/IT Os/DevOps folk through customer calls, demos of Logentries, at conferences such as Velocity, DevOpsDays, AWS re:Invent as well as a bunch of other more low key meetups across US and Europe.

Naturally, one of the first questions I tend to ask is: "hey what do you use for logging?"

Quickly followed by: "What other tools do you use?"

Below is a list of tools I frequently come across (note: this is not exhaustive) that I see making up today's modern IT and Dev Ops toolkit.

The Modern IT & DevOps Toolkit?

This is how I think about the modern day IT and Dev management tools that are critical to supporting the distributed and complex environments, applications, and diverse end users today...

Below is an outline of each key category of tools that  need to be in your toolkit, and the leading technologies to consider as you build the toolkit that best supports your team and your organization.

You'll notice a lot of SaasS services; over the past few years people have really started to move away from the on-prem, "roll your own" solutions, and some of the old dinosaurs that tried to provide everything in one box (think Tivoli, Splunk...). Instead they are taking advantage of more specific cloud-based services that are more flexible, require less investment up front, and practically zero management.

Here they are. The top eight must-have technologies for YOUR modern IT and DevOps toolkit:

(Click infographic to enlarge)

Logentries-must-have-technologies-2015-it-devops-toolkit

Share this Infographic On Your Site

Configuration/Automation: Because it is so easy to spin up and down server instances these days, organizations will regularly have 100's or 1000's of instances associated with a specific app or set of services. Furthermore, one thing I have noticed over the past year is the number of organizations with autoscaling in place.

Through 2012/2013 a lot of people were talking about autoscaling, but this year I've noticed a big increase in the percentage of organizations are actually utilizing it. Large and dynamic environments call for orchestration and automation tools such as Chef,Puppet and AnsibleVagrant is a complementary tool which also allows you to easily manage your development environments which is also a common fixture.

Server Monitoring: Keeping an eye on server resource usage and performance metrics has long been a common practice. However, the tool set has moved away from solutions that were traditionally installed on premise (e.g. nagios, Solarwinds' server monitoring) to more lightweight SAAS services that require very little effort to configure and maintain.

Cloudwatch is Amazon's monitoring service which will give you insight into metrics on your service instances and other AWS services and is a very popular choice across the AWS community. Datadog is another popular (SAAS) service that allows you to easily collect all your server and application metrics in once place and can plug into your application components to retrieve metrics as well as other SaaS services and any existing monitoring tools you have in place. Other common SAAS tools for server monitoring include ServerDensity and ScoutApp.

Log Management & Analytics: Logs are important for a range of activities including developer troubleshooting, monitoring production systems, real-time alerting, customer support, application usage analytics... the list goes on. In fact being particularly well positioned to talk about logging use cases :) we see logging use cases limited only by the type of data you choose to log.

Logs have begun to provide very simple way to perform "risk-free analytics";  you do not have to invest heavily in application instrumentation or an expensive BI tool to start to get immediate insights and visibility into your application behavior.

That being said, often the primary reason for organizations requiring a log management solution to to centralize their logs so Development and Ops teams can easily access log data from hundreds or thousands of instances in a single location, without having to manually log into individual boxes.

The open source logging tool of choice tends to be Elasticsearch Logstash Kibana (commonly known as ELK). While this is a great open source tool of choice, as soon as log volumes grow maintenance of ELK can become painful and expensive, and organizations tend to look for a commercial SaaS solution. Organizations with big budgets, dedicated data scientists at hand and time and energy to invest in educating their users have traditionally looked at Splunk for their log management solution. However organizations are frequently looking for a more lightweight, cost effective, and easier to use technologies and without the need to break the bank.

Logentries is a real time log management technology designed for the cloud with more than 35,000 global users. It also provides a unique unlimited logging technology which allows you to send as much data as you like and decide dynamically what data you want to analyze immediately, and what data you route to cold storage for on demand analysis. This can reduce logging costs to a fraction of traditional solutions.

Incident Alert Management: As is evident from this post, the Modern DevOps Toolkit will regularly consist of a number of different lightweight tools that are used side by side rather than one large monolithic solutions of days past. As such, alerting can be a bit of a nightmare with alerts firing from different end points, which can potentially result in a lot of noise.

Furthermore, managing on-call schedules and what team member should get different alerts can also be challenging, especially as teams grow in size. Tools like PagerDuty andVictorOps have been designed to take the pain out of incident alerts and provide a range of capabilities to allow you to filter important alerts from the noise and to route them to the correct team members at the right time.

Data Visualization: Devops teams using lots of different tools to manage their environment often require a centralized dashboard to view and correlate data from different sources. For example almost every tech company office you walk into these days has a number of flat screens containing key performance indicators for the entire team to keep an eye on. Technologies like GeckoboardLibrato, and Graphite (or Hosted Graphite for those not wanting to maintain their own Graphite deployment) are some of the more popular operations dashboards used by DevOps teams today.

Real-Time Messaging: Chat clients are used across almost all organizations for real time comms and have been around for quite some time. Hipchat has likely been the most popular such tool among the Dev and Ops community, with it's nice integrations with Jira and Github as well as its ability to ingest alerts from your different monitoring tools. Slack seems to be the new kid on the block, and I'd personally rank as one of the fastest adopted tools I've come across. Expect slack to be everywhere in 2015 ... if not there already.

APM/End User Experience: Application performance monitoring (APM) is a key technology for developers wanting to optimize and manage the performance of their apps. For Ops teams concerned with overall user experience, APM can give insight into what is happening in your production environments and can be dynamically tuned to provide more information on demand, so that they do not have a constant performance impact on your running systems.

I spent time building APM solutions 10 years ago where they were all on premise solutions that you downloaded, deployed and managed in house. Today, they have largely moved to the cloud with the likes of Smartbear AlertSiteNew RelicAppNeta andAppDynamics leading the charge.

Health Checks: While most server monitoring, logging and APM choices have all moved to the cloud, I still regularly hear that recursive acronym ... NAGIOS ain't gonna insist on sainthood. DevOps teams still have a love/hate relationship with this old reliable, where the common feeling is that while not pretty, but it does the job. When I come across NAGIOS these days it tends to be in the context of health checks for vital services to make sure they are ‘Up', and is usually complemented with tools that provide a deeper dive if investigation is required (e.g. New Relic, Logentries, Cloudwatch...).

Alternatively, health checks are also commonly performed using Pingdom, which gives you a pretty coarse grained view of service uptime and downtime. Another alternative is to perform health checks at the log level, e.g. by using inactivity alerting to get notified when expected behaviors do not behave as expected....

Tell us what you think?

Above is an overview and categorization of the most popular Dev and Ops tools we have regularly come across 2014 as we have engaged with the Logentries Community across customer calls, meetups and conferences. This list is by no mean exhaustive and is our view into what we see as the modern IT and Dev Ops toolkit. Let us know how it lines up with what you see or if you think we are missing any thing?

More Stories By Trevor Parsons

Trevor Parsons is Chief Scientist and Co-founder of Logentries. Trevor has over 10 years experience in enterprise software and, in particular, has specialized in developing enterprise monitoring and performance tools for distributed systems. He is also a research fellow at the Performance Engineering Lab Research Group and was formerly a Scientist at the IBM Center for Advanced Studies. Trevor holds a PhD from University College Dublin, Ireland.

Latest Stories
Darktrace is the world's leading AI company for cyber security. Created by mathematicians from the University of Cambridge, Darktrace's Enterprise Immune System is the first non-consumer application of machine learning to work at scale, across all network types, from physical, virtualized, and cloud, through to IoT and industrial control systems. Installed as a self-configuring cyber defense platform, Darktrace continuously learns what is ‘normal' for all devices and users, updating its understa...
Codete accelerates their clients growth through technological expertise and experience. Codite team works with organizations to meet the challenges that digitalization presents. Their clients include digital start-ups as well as established enterprises in the IT industry. To stay competitive in a highly innovative IT industry, strong R&D departments and bold spin-off initiatives is a must. Codete Data Science and Software Architects teams help corporate clients to stay up to date with the mod...
There are many examples of disruption in consumer space – Uber disrupting the cab industry, Airbnb disrupting the hospitality industry and so on; but have you wondered who is disrupting support and operations? AISERA helps make businesses and customers successful by offering consumer-like user experience for support and operations. We have built the world’s first AI-driven IT / HR / Cloud / Customer Support and Operations solution.
ScaleMP is presenting at CloudEXPO 2019, held June 24-26 in Santa Clara, and we’d love to see you there. At the conference, we’ll demonstrate how ScaleMP is solving one of the most vexing challenges for cloud — memory cost and limit of scale — and how our innovative vSMP MemoryONE solution provides affordable larger server memory for the private and public cloud. Please visit us at Booth No. 519 to connect with our experts and learn more about vSMP MemoryONE and how it is already serving some of...
Platform9, the leader in SaaS-managed hybrid cloud, has announced it will present five sessions at four upcoming industry conferences in June: BCS in London, DevOpsCon in Berlin, HPE Discover and Cloud Computing Expo 2019.
At CloudEXPO Silicon Valley, June 24-26, 2019, Digital Transformation (DX) is a major focus with expanded DevOpsSUMMIT and FinTechEXPO programs within the DXWorldEXPO agenda. Successful transformation requires a laser focus on being data-driven and on using all the tools available that enable transformation if they plan to survive over the long term. A total of 88% of Fortune 500 companies from a generation ago are now out of business. Only 12% still survive. Similar percentages are found throug...
When you're operating multiple services in production, building out forensics tools such as monitoring and observability becomes essential. Unfortunately, it is a real challenge balancing priorities between building new features and tools to help pinpoint root causes. Linkerd provides many of the tools you need to tame the chaos of operating microservices in a cloud native world. Because Linkerd is a transparent proxy that runs alongside your application, there are no code changes required. I...
In his general session at 21st Cloud Expo, Greg Dumas, Calligo’s Vice President and G.M. of US operations, discussed the new Global Data Protection Regulation and how Calligo can help business stay compliant in digitally globalized world. Greg Dumas is Calligo's Vice President and G.M. of US operations. Calligo is an established service provider that provides an innovative platform for trusted cloud solutions. Calligo’s customers are typically most concerned about GDPR compliance, application p...
Modern software design has fundamentally changed how we manage applications, causing many to turn to containers as the new virtual machine for resource management. As container adoption grows beyond stateless applications to stateful workloads, the need for persistent storage is foundational - something customers routinely cite as a top pain point. In his session at @DevOpsSummit at 21st Cloud Expo, Bill Borsari, Head of Systems Engineering at Datera, explored how organizations can reap the bene...
"NetApp's vision is how we help organizations manage data - delivering the right data in the right place, in the right time, to the people who need it, and doing it agnostic to what the platform is," explained Josh Atwell, Developer Advocate for NetApp, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
Druva is the global leader in Cloud Data Protection and Management, delivering the industry's first data management-as-a-service solution that aggregates data from endpoints, servers and cloud applications and leverages the public cloud to offer a single pane of glass to enable data protection, governance and intelligence-dramatically increasing the availability and visibility of business critical information, while reducing the risk, cost and complexity of managing and protecting it. Druva's...
Kubernetes as a Container Platform is becoming a de facto for every enterprise. In my interactions with enterprises adopting container platform, I come across common questions: - How does application security work on this platform? What all do I need to secure? - How do I implement security in pipelines? - What about vulnerabilities discovered at a later point in time? - What are newer technologies like Istio Service Mesh bring to table?In this session, I will be addressing these commonly asked ...
BMC has unmatched experience in IT management, supporting 92 of the Forbes Global 100, and earning recognition as an ITSM Gartner Magic Quadrant Leader for five years running. Our solutions offer speed, agility, and efficiency to tackle business challenges in the areas of service management, automation, operations, and the mainframe.
Blockchain has shifted from hype to reality across many industries including Financial Services, Supply Chain, Retail, Healthcare and Government. While traditional tech and crypto organizations are generally male dominated, women have embraced blockchain technology from its inception. This is no more evident than at companies where women occupy many of the blockchain roles and leadership positions. Join this panel to hear three women in blockchain share their experience and their POV on the futu...
The Jevons Paradox suggests that when technological advances increase efficiency of a resource, it results in an overall increase in consumption. Writing on the increased use of coal as a result of technological improvements, 19th-century economist William Stanley Jevons found that these improvements led to the development of new ways to utilize coal. In his session at 19th Cloud Expo, Mark Thiele, Chief Strategy Officer for Apcera, compared the Jevons Paradox to modern-day enterprise IT, examin...