Related Topics: @CloudExpo, @BigDataExpo, @ThingsExpo

@CloudExpo: Article

The #IoT and #Analytics | @ThingsExpo #BigData #BI #AI #MachineLearning

The Internet of Things promises to change everything by enabling “smart” environments and smart products

The Internet of Things (IoT) and Analytics at The Edge

The Internet of Things (IoT) promises to change everything by enabling “smart” environments (homes, cities, hospitals, schools, stores, etc.) and smart products (cars, trucks, airplanes, trains, wind turbines, lawnmowers, etc.). I recently wrote about the importance of moving beyond “connected” to “smart” in a blog titled “Internet of Things: Connected Does Not Equal Smart”. The article discusses the importance of moving beyond just collecting the data, to transitioning to leveraging this new wealth of IoT data to improve the decisions that these smart environments and products need to make: to help these environments and products to self-monitor, self-diagnose and eventually, self-direct.

But one of the key concepts in enabling this transition from connected to smart is the ability to perform “analytics at the edge.” Shawn Rogers, Chief Research Officer at Dell Statistica, had the following quote in an article in Information Management titled “Will the Citizen Data Scientist Inherit the World?”:

“Organizations are fast coming to the realization that IoT implementations are only going to become more vast and more pervasive, and that as that happens, the traditional analytic model of pulling all data in to a centralized source such as a data warehouse or analytic sandbox is going to make less and less sense.

So, most of the conversations I’m having around IoT analytics today revolve around looking at how companies can flip that model on its head and figure out ways to push the analytics out to the edge. If you can run analytics at the edge, you not only can eliminate the time, bandwidth and expense required to transport the data, but you make it possible to take immediate action in response to the insight. You speed up and simplify the analytic process in a way that’s never been done before.”

So I asked Shawn and his boss John Thompson, General Manager of Advanced Analytics at Dell, to help me understand what exactly they mean by “analytics at the edge.” It really boils down to these questions:

  • Are we really developing analytics at the edge?
  • If not, then what sorts of analytics are we performing at the edge?
  • Where are the analytic models actually being built?
  • And finally, what the heck does “at the edge” really mean?
  • So let’s actually start with that last question: What does “at the edge” really mean?

Question #1: What Is “At The Edge”?
“At the edge” refers to the multitude of devices or sensors that are scattered across any network or embedded throughout a product (car, jet engine, CT Scan) that is generating data about the operations and performance of that specific device or sensor.

For example, the current Airbus A350 model has close to 6,000 sensors and generates 2.5 Tb of data per day, while an even newer model – expected to be available in 2020 – will capture more than triple that amount! It is becoming more and more common for everyday common products to have hundreds if not thousands of embedded sensors that are generating readings every couple of seconds on the operations and performance of that particular product (see Figure 1).

Figure 1: Sensors at the Edge

But collecting these huge and real-time volumes of data doesn’t do anything to directly create business advantage. It is what you do with that data that drives the business value, which brings us to…

Question #2: Are We Really Developing Analytics “At The Edge”?
Are we really “performing analytics” (collecting the data, storing the data, preparing the data, running analytic algorithms, validating the analytic goodness of fit and then acting on the results) at the edges, or are we just “executing the analytic models” at the edges? It’s one thing to “execute the analytic models” (e.g., scores, rules, recommendations) at the edges, but something entirely different to actually “perform analytics” at the edges.

Per Shawn and John, “We can deliver analytic models to any end point. We can execute the analytic models in any environment – large or small. We can execute all the steps in performing analytics in a wide range of environments, but there are limits at the edge. The limits are on the robustness of the environment (i.e. cannot deliver an executable to an environment that does not have the memory or processing power to store it or execute it. We cannot change the laws of physics…;-).)”

Question #3: What Sorts Of Analytics Are We Performing At The Edge?
In our airplane example with 6,000 sensors on the plane generating over 2.5 Tb of data per day, how are we performing the analytics at the end?

Per John and Shawn, if the jet engine has a place to house a Java Virtual Machine (JVM) and an analytic model (i.e., lightweight rules based model), then we can execute the model on the engine itself. If the model streams the data to a network, we can execute the analytic model on a gateway, or intermediate server (see Figure 2).

Figure 2: Executing Analytic Models at The Edge

Think of the network as having concentric rings. Each ring can have many servers. Each server can do either – either executing an analytic model or building the analytic models. Now think of many network networks with concentric rings that interlock at various intersections. Analytics can be at any or all levels including at the core, in a data center or in the cloud.

Per Shawn, “By working in tandem with Dell Boomi, we’ve given users the ability to deploy JVM’s with the analytic models on any edge device or gateway anywhere on the network or device. This edge scoring capability enables organizations to address nearly any IoT analytics use case by executing the analytic models at the edge of the network where data is being created.”

Question #4: Where Are The Analytic Models Actually Being Built?
Okay, so we “execute” the pre-built modes at the edge, but we actually build (test, refine, test, refine) the analytic models by bringing the detailed sensor data back to a central data and analytics environment (a.k.a. the Data Lake). Figure 3, courtesy of Joel Dodd of Pivotal, shows the data flow and the supporting analytics execution.

Figure 3: “At the Edge” Analytic Model Execution

Final point, even if you are doing all the sensor/IoT analysis at the edges, you are likely still going to want to bring the raw IoT data back into the data lake for more extensive analysis in order to house the detailed IoT history. For example, we have major economic cycles every 4 to 7 years. You might want to quantify the impact of these economic changes on your network demand and performance. That would eventually require 8 to 14 years of data. And that’s why you are going to want a data lake as the foundation of the transition from a “connected” IoT world to a “smart” IoT world.

The post The Internet of Things (IoT) and Analytics at The Edge appeared first on InFocus.

More Stories By William Schmarzo

Bill Schmarzo, author of “Big Data: Understanding How Data Powers Big Business”, is responsible for setting the strategy and defining the Big Data service line offerings and capabilities for the EMC Global Services organization. As part of Bill’s CTO charter, he is responsible for working with organizations to help them identify where and how to start their big data journeys. He’s written several white papers, avid blogger and is a frequent speaker on the use of Big Data and advanced analytics to power organization’s key business initiatives. He also teaches the “Big Data MBA” at the University of San Francisco School of Management.

Bill has nearly three decades of experience in data warehousing, BI and analytics. Bill authored EMC’s Vision Workshop methodology that links an organization’s strategic business initiatives with their supporting data and analytic requirements, and co-authored with Ralph Kimball a series of articles on analytic applications. Bill has served on The Data Warehouse Institute’s faculty as the head of the analytic applications curriculum.

Previously, Bill was the Vice President of Advertiser Analytics at Yahoo and the Vice President of Analytic Applications at Business Objects.

Latest Stories
More and more brands have jumped on the IoT bandwagon. We have an excess of wearables – activity trackers, smartwatches, smart glasses and sneakers, and more that track seemingly endless datapoints. However, most consumers have no idea what “IoT” means. Creating more wearables that track data shouldn't be the aim of brands; delivering meaningful, tangible relevance to their users should be. We're in a period in which the IoT pendulum is still swinging. Initially, it swung toward "smart for smar...
Governments around the world are adopting Safe Harbor privacy provisions to protect customer data from leaving sovereign territories. Increasingly, global companies are required to create new instances of their server clusters in multiple countries to keep abreast of these new Safe Harbor laws. Is it worth it? In his session at 19th Cloud Expo, Adam Rogers, Managing Director of Anexia, Inc., will discuss how to keep your data legal and still stay in business.
@ThingsExpo has been named the Top 5 Most Influential Internet of Things Brand by Onalytica in the ‘The Internet of Things Landscape 2015: Top 100 Individuals and Brands.' Onalytica analyzed Twitter conversations around the #IoT debate to uncover the most influential brands and individuals driving the conversation. Onalytica captured data from 56,224 users. The PageRank based methodology they use to extract influencers on a particular topic (tweets mentioning #InternetofThings or #IoT in this ...
Successful transition from traditional IT to cloud computing requires three key ingredients: an IT architecture that allows companies to extend their internal best practices to the cloud, a cost point that allows economies of scale, and automated processes that manage risk exposure and maintain regulatory compliance with industry regulations (FFIEC, PCI-DSS, HIPAA, FISMA). The unique combination of VMware, the IBM Cloud, and Cloud Raxak, a 2016 Gartner Cool Vendor in IT Automation, provides a co...
P2P RTC will impact the landscape of communications, shifting from traditional telephony style communications models to OTT (Over-The-Top) cloud assisted & PaaS (Platform as a Service) communication services. The P2P shift will impact many areas of our lives, from mobile communication, human interactive web services, RTC and telephony infrastructure, user federation, security and privacy implications, business costs, and scalability. In his session at @ThingsExpo, Robin Raymond, Chief Architect...
So you think you are a DevOps warrior, huh? Put your money (not really, it’s free) where your metrics are and prove it by taking The Ultimate DevOps Geek Quiz Challenge, sponsored by DevOps Summit. Battle through the set of tough questions created by industry thought leaders to earn your bragging rights and win some cool prizes.
SYS-CON Events announced today that SoftNet Solutions will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. SoftNet Solutions specializes in Enterprise Solutions for Hadoop and Big Data. It offers customers the most open, robust, and value-conscious portfolio of solutions, services, and tools for the shortest route to success with Big Data. The unique differentiator is the ability to architect and ...
SYS-CON Events announced today that Niagara Networks will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Niagara Networks offers the highest port-density systems, and the most complete Next-Generation Network Visibility systems including Network Packet Brokers, Bypass Switches, and Network TAPs.
SYS-CON Events announced today that Embotics, the cloud automation company, will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Embotics is the cloud automation company for IT organizations and service providers that need to improve provisioning or enable self-service capabilities. With a relentless focus on delivering a premier user experience and unmatched customer support, Embotics is the fas...
In an era of historic innovation fueled by unprecedented access to data and technology, the low cost and risk of entering new markets has leveled the playing field for business. Today, any ambitious innovator can easily introduce a new application or product that can reinvent business models and transform the client experience. In their Day 2 Keynote at 19th Cloud Expo, Mercer Rowe, IBM Vice President of Strategic Alliances, and Raejeanne Skillern, Intel Vice President of Data Center Group and ...
SYS-CON Events announced today that StarNet Communications will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. StarNet Communications’ FastX is the industry first cloud-based remote X Windows emulator. Using standard Web browsers (FireFox, Chrome, Safari, etc.) users from around the world gain highly secure access to applications and data hosted on Linux-based servers in a central data center. ...
Virgil consists of an open-source encryption library, which implements Cryptographic Message Syntax (CMS) and Elliptic Curve Integrated Encryption Scheme (ECIES) (including RSA schema), a Key Management API, and a cloud-based Key Management Service (Virgil Keys). The Virgil Keys Service consists of a public key service and a private key escrow service. 

SYS-CON Events announced today that Cemware will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Use MATLAB functions by just visiting website mathfreeon.com. MATLAB compatible, freely usable, online platform services. As of October 2016, 80,000 users from 180 countries are enjoying our platform service.
Data is the fuel that drives the machine learning algorithmic engines and ultimately provides the business value. In his session at Cloud Expo, Ed Featherston, a director and senior enterprise architect at Collaborative Consulting, will discuss the key considerations around quality, volume, timeliness, and pedigree that must be dealt with in order to properly fuel that engine.
All clouds are not equal. To succeed in a DevOps context, organizations should plan to develop/deploy apps across a choice of on-premise and public clouds simultaneously depending on the business needs. This is where the concept of the Lean Cloud comes in - resting on the idea that you often need to relocate your app modules over their life cycles for both innovation and operational efficiency in the cloud. In his session at @DevOpsSummit at19th Cloud Expo, Valentin (Val) Bercovici, CTO of So...