Related Topics: @DevOpsSummit, Microservices Expo, Linux Containers, Open Source Cloud, @CloudExpo, Apache, FinTech Journal

@DevOpsSummit: Blog Post

Infographic: DevOps Toolkit By @Logentries | @DevOpsSummit [#DevOps]

The top eight must-have technologies for your modern IT and DevOps toolkit

Infographic: The Modern IT and DevOps Toolkit

Over the past year I reckon I have spoken to more than a thousand Developers/IT Os/DevOps folk through customer calls, demos of Logentries, at conferences such as Velocity, DevOpsDays, AWS re:Invent as well as a bunch of other more low key meetups across US and Europe.

Naturally, one of the first questions I tend to ask is: "hey what do you use for logging?"

Quickly followed by: "What other tools do you use?"

Below is a list of tools I frequently come across (note: this is not exhaustive) that I see making up today's modern IT and Dev Ops toolkit.

The Modern IT & DevOps Toolkit?

This is how I think about the modern day IT and Dev management tools that are critical to supporting the distributed and complex environments, applications, and diverse end users today...

Below is an outline of each key category of tools that  need to be in your toolkit, and the leading technologies to consider as you build the toolkit that best supports your team and your organization.

You'll notice a lot of SaasS services; over the past few years people have really started to move away from the on-prem, "roll your own" solutions, and some of the old dinosaurs that tried to provide everything in one box (think Tivoli, Splunk...). Instead they are taking advantage of more specific cloud-based services that are more flexible, require less investment up front, and practically zero management.

Here they are. The top eight must-have technologies for YOUR modern IT and DevOps toolkit:

(Click infographic to enlarge)


Share this Infographic On Your Site

Configuration/Automation: Because it is so easy to spin up and down server instances these days, organizations will regularly have 100's or 1000's of instances associated with a specific app or set of services. Furthermore, one thing I have noticed over the past year is the number of organizations with autoscaling in place.

Through 2012/2013 a lot of people were talking about autoscaling, but this year I've noticed a big increase in the percentage of organizations are actually utilizing it. Large and dynamic environments call for orchestration and automation tools such as Chef,Puppet and AnsibleVagrant is a complementary tool which also allows you to easily manage your development environments which is also a common fixture.

Server Monitoring: Keeping an eye on server resource usage and performance metrics has long been a common practice. However, the tool set has moved away from solutions that were traditionally installed on premise (e.g. nagios, Solarwinds' server monitoring) to more lightweight SAAS services that require very little effort to configure and maintain.

Cloudwatch is Amazon's monitoring service which will give you insight into metrics on your service instances and other AWS services and is a very popular choice across the AWS community. Datadog is another popular (SAAS) service that allows you to easily collect all your server and application metrics in once place and can plug into your application components to retrieve metrics as well as other SaaS services and any existing monitoring tools you have in place. Other common SAAS tools for server monitoring include ServerDensity and ScoutApp.

Log Management & Analytics: Logs are important for a range of activities including developer troubleshooting, monitoring production systems, real-time alerting, customer support, application usage analytics... the list goes on. In fact being particularly well positioned to talk about logging use cases :) we see logging use cases limited only by the type of data you choose to log.

Logs have begun to provide very simple way to perform "risk-free analytics";  you do not have to invest heavily in application instrumentation or an expensive BI tool to start to get immediate insights and visibility into your application behavior.

That being said, often the primary reason for organizations requiring a log management solution to to centralize their logs so Development and Ops teams can easily access log data from hundreds or thousands of instances in a single location, without having to manually log into individual boxes.

The open source logging tool of choice tends to be Elasticsearch Logstash Kibana (commonly known as ELK). While this is a great open source tool of choice, as soon as log volumes grow maintenance of ELK can become painful and expensive, and organizations tend to look for a commercial SaaS solution. Organizations with big budgets, dedicated data scientists at hand and time and energy to invest in educating their users have traditionally looked at Splunk for their log management solution. However organizations are frequently looking for a more lightweight, cost effective, and easier to use technologies and without the need to break the bank.

Logentries is a real time log management technology designed for the cloud with more than 35,000 global users. It also provides a unique unlimited logging technology which allows you to send as much data as you like and decide dynamically what data you want to analyze immediately, and what data you route to cold storage for on demand analysis. This can reduce logging costs to a fraction of traditional solutions.

Incident Alert Management: As is evident from this post, the Modern DevOps Toolkit will regularly consist of a number of different lightweight tools that are used side by side rather than one large monolithic solutions of days past. As such, alerting can be a bit of a nightmare with alerts firing from different end points, which can potentially result in a lot of noise.

Furthermore, managing on-call schedules and what team member should get different alerts can also be challenging, especially as teams grow in size. Tools like PagerDuty andVictorOps have been designed to take the pain out of incident alerts and provide a range of capabilities to allow you to filter important alerts from the noise and to route them to the correct team members at the right time.

Data Visualization: Devops teams using lots of different tools to manage their environment often require a centralized dashboard to view and correlate data from different sources. For example almost every tech company office you walk into these days has a number of flat screens containing key performance indicators for the entire team to keep an eye on. Technologies like GeckoboardLibrato, and Graphite (or Hosted Graphite for those not wanting to maintain their own Graphite deployment) are some of the more popular operations dashboards used by DevOps teams today.

Real-Time Messaging: Chat clients are used across almost all organizations for real time comms and have been around for quite some time. Hipchat has likely been the most popular such tool among the Dev and Ops community, with it's nice integrations with Jira and Github as well as its ability to ingest alerts from your different monitoring tools. Slack seems to be the new kid on the block, and I'd personally rank as one of the fastest adopted tools I've come across. Expect slack to be everywhere in 2015 ... if not there already.

APM/End User Experience: Application performance monitoring (APM) is a key technology for developers wanting to optimize and manage the performance of their apps. For Ops teams concerned with overall user experience, APM can give insight into what is happening in your production environments and can be dynamically tuned to provide more information on demand, so that they do not have a constant performance impact on your running systems.

I spent time building APM solutions 10 years ago where they were all on premise solutions that you downloaded, deployed and managed in house. Today, they have largely moved to the cloud with the likes of Smartbear AlertSiteNew RelicAppNeta andAppDynamics leading the charge.

Health Checks: While most server monitoring, logging and APM choices have all moved to the cloud, I still regularly hear that recursive acronym ... NAGIOS ain't gonna insist on sainthood. DevOps teams still have a love/hate relationship with this old reliable, where the common feeling is that while not pretty, but it does the job. When I come across NAGIOS these days it tends to be in the context of health checks for vital services to make sure they are ‘Up', and is usually complemented with tools that provide a deeper dive if investigation is required (e.g. New Relic, Logentries, Cloudwatch...).

Alternatively, health checks are also commonly performed using Pingdom, which gives you a pretty coarse grained view of service uptime and downtime. Another alternative is to perform health checks at the log level, e.g. by using inactivity alerting to get notified when expected behaviors do not behave as expected....

Tell us what you think?

Above is an overview and categorization of the most popular Dev and Ops tools we have regularly come across 2014 as we have engaged with the Logentries Community across customer calls, meetups and conferences. This list is by no mean exhaustive and is our view into what we see as the modern IT and Dev Ops toolkit. Let us know how it lines up with what you see or if you think we are missing any thing?

More Stories By Trevor Parsons

Trevor Parsons is Chief Scientist and Co-founder of Logentries. Trevor has over 10 years experience in enterprise software and, in particular, has specialized in developing enterprise monitoring and performance tools for distributed systems. He is also a research fellow at the Performance Engineering Lab Research Group and was formerly a Scientist at the IBM Center for Advanced Studies. Trevor holds a PhD from University College Dublin, Ireland.

Latest Stories
DXWorldEXPO LLC announced today that Dez Blanchfield joined the faculty of CloudEXPO's "10-Year Anniversary Event" which will take place on November 11-13, 2018 in New York City. Dez is a strategic leader in business and digital transformation with 25 years of experience in the IT and telecommunications industries developing strategies and implementing business initiatives. He has a breadth of expertise spanning technologies such as cloud computing, big data and analytics, cognitive computing, m...
Digital Transformation and Disruption, Amazon Style - What You Can Learn. Chris Kocher is a co-founder of Grey Heron, a management and strategic marketing consulting firm. He has 25+ years in both strategic and hands-on operating experience helping executives and investors build revenues and shareholder value. He has consulted with over 130 companies on innovating with new business models, product strategies and monetization. Chris has held management positions at HP and Symantec in addition to ...
Cloud-enabled transformation has evolved from cost saving measure to business innovation strategy -- one that combines the cloud with cognitive capabilities to drive market disruption. Learn how you can achieve the insight and agility you need to gain a competitive advantage. Industry-acclaimed CTO and cloud expert, Shankar Kalyana presents. Only the most exceptional IBMers are appointed with the rare distinction of IBM Fellow, the highest technical honor in the company. Shankar has also receive...
DXWorldEXPO LLC announced today that Kevin Jackson joined the faculty of CloudEXPO's "10-Year Anniversary Event" which will take place on November 11-13, 2018 in New York City. Kevin L. Jackson is a globally recognized cloud computing expert and Founder/Author of the award winning "Cloud Musings" blog. Mr. Jackson has also been recognized as a "Top 100 Cybersecurity Influencer and Brand" by Onalytica (2015), a Huffington Post "Top 100 Cloud Computing Experts on Twitter" (2013) and a "Top 50 C...
Daniel Jones is CTO of EngineerBetter, helping enterprises deliver value faster. Previously he was an IT consultant, indie video games developer, head of web development in the finance sector, and an award-winning martial artist. Continuous Delivery makes it possible to exploit findings of cognitive psychology and neuroscience to increase the productivity and happiness of our teams.
There is a huge demand for responsive, real-time mobile and web experiences, but current architectural patterns do not easily accommodate applications that respond to events in real time. Common solutions using message queues or HTTP long-polling quickly lead to resiliency, scalability and development velocity challenges. In his session at 21st Cloud Expo, Ryland Degnan, a Senior Software Engineer on the Netflix Edge Platform team, will discuss how by leveraging a reactive stream-based protocol,...
Enterprises have taken advantage of IoT to achieve important revenue and cost advantages. What is less apparent is how incumbent enterprises operating at scale have, following success with IoT, built analytic, operations management and software development capabilities - ranging from autonomous vehicles to manageable robotics installations. They have embraced these capabilities as if they were Silicon Valley startups.
The standardization of container runtimes and images has sparked the creation of an almost overwhelming number of new open source projects that build on and otherwise work with these specifications. Of course, there's Kubernetes, which orchestrates and manages collections of containers. It was one of the first and best-known examples of projects that make containers truly useful for production use. However, more recently, the container ecosystem has truly exploded. A service mesh like Istio addr...
As DevOps methodologies expand their reach across the enterprise, organizations face the daunting challenge of adapting related cloud strategies to ensure optimal alignment, from managing complexity to ensuring proper governance. How can culture, automation, legacy apps and even budget be reexamined to enable this ongoing shift within the modern software factory? In her Day 2 Keynote at @DevOpsSummit at 21st Cloud Expo, Aruna Ravichandran, VP, DevOps Solutions Marketing, CA Technologies, was jo...
Predicting the future has never been more challenging - not because of the lack of data but because of the flood of ungoverned and risk laden information. Microsoft states that 2.5 exabytes of data are created every day. Expectations and reliance on data are being pushed to the limits, as demands around hybrid options continue to grow.
Business professionals no longer wonder if they'll migrate to the cloud; it's now a matter of when. The cloud environment has proved to be a major force in transitioning to an agile business model that enables quick decisions and fast implementation that solidify customer relationships. And when the cloud is combined with the power of cognitive computing, it drives innovation and transformation that achieves astounding competitive advantage.
Poor data quality and analytics drive down business value. In fact, Gartner estimated that the average financial impact of poor data quality on organizations is $9.7 million per year. But bad data is much more than a cost center. By eroding trust in information, analytics and the business decisions based on these, it is a serious impediment to digital transformation.
Digital Transformation: Preparing Cloud & IoT Security for the Age of Artificial Intelligence. As automation and artificial intelligence (AI) power solution development and delivery, many businesses need to build backend cloud capabilities. Well-poised organizations, marketing smart devices with AI and BlockChain capabilities prepare to refine compliance and regulatory capabilities in 2018. Volumes of health, financial, technical and privacy data, along with tightening compliance requirements by...
Andrew Keys is Co-Founder of ConsenSys Enterprise. He comes to ConsenSys Enterprise with capital markets, technology and entrepreneurial experience. Previously, he worked for UBS investment bank in equities analysis. Later, he was responsible for the creation and distribution of life settlement products to hedge funds and investment banks. After, he co-founded a revenue cycle management company where he learned about Bitcoin and eventually Ethereal. Andrew's role at ConsenSys Enterprise is a mul...
"NetApp is known as a data management leader but we do a lot more than just data management on-prem with the data centers of our customers. We're also big in the hybrid cloud," explained Wes Talbert, Principal Architect at NetApp, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.