Welcome!

Related Topics: Government Cloud, @CloudExpo, @BigDataExpo, @ThingsExpo

Government Cloud: Blog Post

The #IoT and #Analytics | @ThingsExpo #BigData #BI #AI #DX #MachineLearning

The Internet of Things promises to change everything by enabling “smart” environments and smart products

The Internet of Things (IoT) and Analytics at The Edge

The Internet of Things (IoT) promises to change everything by enabling “smart” environments (homes, cities, hospitals, schools, stores, etc.) and smart products (cars, trucks, airplanes, trains, wind turbines, lawnmowers, etc.). I recently wrote about the importance of moving beyond “connected” to “smart” in a blog titled “Internet of Things: Connected Does Not Equal Smart”. The article discusses the importance of moving beyond just collecting the data, to transitioning to leveraging this new wealth of IoT data to improve the decisions that these smart environments and products need to make: to help these environments and products to self-monitor, self-diagnose and eventually, self-direct.

But one of the key concepts in enabling this transition from connected to smart is the ability to perform “analytics at the edge.” Shawn Rogers, Chief Research Officer at Dell Statistica, had the following quote in an article in Information Management titled “Will the Citizen Data Scientist Inherit the World?”:

“Organizations are fast coming to the realization that IoT implementations are only going to become more vast and more pervasive, and that as that happens, the traditional analytic model of pulling all data in to a centralized source such as a data warehouse or analytic sandbox is going to make less and less sense.

So, most of the conversations I’m having around IoT analytics today revolve around looking at how companies can flip that model on its head and figure out ways to push the analytics out to the edge. If you can run analytics at the edge, you not only can eliminate the time, bandwidth and expense required to transport the data, but you make it possible to take immediate action in response to the insight. You speed up and simplify the analytic process in a way that’s never been done before.”

So I asked Shawn and his boss John Thompson, General Manager of Advanced Analytics at Dell, to help me understand what exactly they mean by “analytics at the edge.” It really boils down to these questions:

  • Are we really developing analytics at the edge?
  • If not, then what sorts of analytics are we performing at the edge?
  • Where are the analytic models actually being built?
  • And finally, what the heck does “at the edge” really mean?
  • So let’s actually start with that last question: What does “at the edge” really mean?

Question #1: What Is “At The Edge”?
“At the edge” refers to the multitude of devices or sensors that are scattered across any network or embedded throughout a product (car, jet engine, CT Scan) that is generating data about the operations and performance of that specific device or sensor.

For example, the current Airbus A350 model has close to 6,000 sensors and generates 2.5 Tb of data per day, while an even newer model – expected to be available in 2020 – will capture more than triple that amount! It is becoming more and more common for everyday common products to have hundreds if not thousands of embedded sensors that are generating readings every couple of seconds on the operations and performance of that particular product (see Figure 1).

Figure 1: Sensors at the Edge

But collecting these huge and real-time volumes of data doesn’t do anything to directly create business advantage. It is what you do with that data that drives the business value, which brings us to…

Question #2: Are We Really Developing Analytics “At The Edge”?
Are we really “performing analytics” (collecting the data, storing the data, preparing the data, running analytic algorithms, validating the analytic goodness of fit and then acting on the results) at the edges, or are we just “executing the analytic models” at the edges? It’s one thing to “execute the analytic models” (e.g., scores, rules, recommendations) at the edges, but something entirely different to actually “perform analytics” at the edges.

Per Shawn and John, “We can deliver analytic models to any end point. We can execute the analytic models in any environment – large or small. We can execute all the steps in performing analytics in a wide range of environments, but there are limits at the edge. The limits are on the robustness of the environment (i.e. cannot deliver an executable to an environment that does not have the memory or processing power to store it or execute it. We cannot change the laws of physics…;-).)”

Question #3: What Sorts Of Analytics Are We Performing At The Edge?
In our airplane example with 6,000 sensors on the plane generating over 2.5 Tb of data per day, how are we performing the analytics at the end?

Per John and Shawn, if the jet engine has a place to house a Java Virtual Machine (JVM) and an analytic model (i.e., lightweight rules based model), then we can execute the model on the engine itself. If the model streams the data to a network, we can execute the analytic model on a gateway, or intermediate server (see Figure 2).

Figure 2: Executing Analytic Models at The Edge

Think of the network as having concentric rings. Each ring can have many servers. Each server can do either – either executing an analytic model or building the analytic models. Now think of many network networks with concentric rings that interlock at various intersections. Analytics can be at any or all levels including at the core, in a data center or in the cloud.

Per Shawn, “By working in tandem with Dell Boomi, we’ve given users the ability to deploy JVM’s with the analytic models on any edge device or gateway anywhere on the network or device. This edge scoring capability enables organizations to address nearly any IoT analytics use case by executing the analytic models at the edge of the network where data is being created.”

Question #4: Where Are The Analytic Models Actually Being Built?
Okay, so we “execute” the pre-built modes at the edge, but we actually build (test, refine, test, refine) the analytic models by bringing the detailed sensor data back to a central data and analytics environment (a.k.a. the Data Lake). Figure 3, courtesy of Joel Dodd of Pivotal, shows the data flow and the supporting analytics execution.

Figure 3: “At the Edge” Analytic Model Execution

Final point, even if you are doing all the sensor/IoT analysis at the edges, you are likely still going to want to bring the raw IoT data back into the data lake for more extensive analysis in order to house the detailed IoT history. For example, we have major economic cycles every 4 to 7 years. You might want to quantify the impact of these economic changes on your network demand and performance. That would eventually require 8 to 14 years of data. And that’s why you are going to want a data lake as the foundation of the transition from a “connected” IoT world to a “smart” IoT world.

The post The Internet of Things (IoT) and Analytics at The Edge appeared first on InFocus.

Read the original blog entry...

More Stories By William Schmarzo

Bill Schmarzo, author of “Big Data: Understanding How Data Powers Big Business”, is responsible for setting the strategy and defining the Big Data service line offerings and capabilities for the EMC Global Services organization. As part of Bill’s CTO charter, he is responsible for working with organizations to help them identify where and how to start their big data journeys. He’s written several white papers, avid blogger and is a frequent speaker on the use of Big Data and advanced analytics to power organization’s key business initiatives. He also teaches the “Big Data MBA” at the University of San Francisco School of Management.

Bill has nearly three decades of experience in data warehousing, BI and analytics. Bill authored EMC’s Vision Workshop methodology that links an organization’s strategic business initiatives with their supporting data and analytic requirements, and co-authored with Ralph Kimball a series of articles on analytic applications. Bill has served on The Data Warehouse Institute’s faculty as the head of the analytic applications curriculum.

Previously, Bill was the Vice President of Advertiser Analytics at Yahoo and the Vice President of Analytic Applications at Business Objects.

Latest Stories
DevOps is often described as a combination of technology and culture. Without both, DevOps isn't complete. However, applying the culture to outdated technology is a recipe for disaster; as response times grow and connections between teams are delayed by technology, the culture will die. A Nutanix Enterprise Cloud has many benefits that provide the needed base for a true DevOps paradigm. In his Day 3 Keynote at 20th Cloud Expo, Chris Brown, a Solutions Marketing Manager at Nutanix, will explore t...
Five years ago development was seen as a dead-end career, now it’s anything but – with an explosion in mobile and IoT initiatives increasing the demand for skilled engineers. But apart from having a ready supply of great coders, what constitutes true ‘DevOps Royalty’? It’ll be the ability to craft resilient architectures, supportability, security everywhere across the software lifecycle. In his keynote at @DevOpsSummit at 20th Cloud Expo, Jeffrey Scheaffer, GM and SVP, Continuous Delivery Busine...
SYS-CON Events announced today that Outscale, a global pure play Infrastructure as a Service provider and strategic partner of Dassault Systèmes, will exhibit at SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. Founded in 2010, Outscale simplifies infrastructure complexities and boosts the business agility of its customers. Outscale delivers a secure, reliable and industrial strength solution for its customers, which in...
Most DevOps journeys involve several phases of maturity. Research shows that the inflection point where organizations begin to see maximum value is when they implement tight integration deploying their code to their infrastructure. Success at this level is the last barrier to at-will deployment. Storage, for instance, is more capable than where we read and write data. In his session at @DevOpsSummit at 20th Cloud Expo, Josh Atwell, a Developer Advocate for NetApp, will discuss the role and value...
Regardless of what business you’re in, it’s increasingly a software-driven business. Consumers’ rising expectations for connected digital and physical experiences are driving what some are calling the "Customer Experience Challenge.” In his session at @DevOpsSummit at 20th Cloud Expo, Marco Morales, Director of Global Solutions at CollabNet, will discuss how organizations are increasingly adopting a discipline of Value Stream Mapping to ensure that the software they are producing is poised to o...
IBM helps FinTechs and financial services companies build and monetize cognitive-enabled financial services apps quickly and at scale. Hosted on IBM Bluemix, IBM’s platform builds in customer insights, regulatory compliance analytics and security to help reduce development time and testing. In his session at 20th Cloud Expo, Tom Eck, Industry Platforms CTO at IBM Cloud, will discuss how these tools simplify the time-consuming tasks of selection, mapping and data integration, allowing developers ...
In order to meet the rapidly changing demands of today’s customers, companies are continually forced to redefine their business strategies in order to meet these needs, stay relevant and continue to see profitable growth. IoT deployment and development is integral in this transformation, and today businesses are increasingly seeing the value of investing their resources into IoT deployments. These technologies are able increase ROI through projects such as connecting supply chains or enabling sm...
SYS-CON Events announced today that CollabNet, a global leader in enterprise software development, release automation and DevOps solutions, will be a Bronze Sponsor of SYS-CON's 20th International Cloud Expo®, taking place from June 6-8, 2017, at the Javits Center in New York City, NY. CollabNet offers a broad range of solutions with the mission of helping modern organizations deliver quality software at speed. The company’s latest innovation, the DevOps Lifecycle Manager (DLM), supports Value S...
A strange thing is happening along the way to the Internet of Things, namely far too many devices to work with and manage. It has become clear that we'll need much higher efficiency user experiences that can allow us to more easily and scalably work with the thousands of devices that will soon be in each of our lives. Enter the conversational interface revolution, combining bots we can literally talk with, gesture to, and even direct with our thoughts, with embedded artificial intelligence, whic...
SYS-CON Events announced today that Peak 10, Inc., a national IT infrastructure and cloud services provider, will exhibit at SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. Peak 10 provides reliable, tailored data center and network services, cloud and managed services. Its solutions are designed to scale and adapt to customers’ changing business needs, enabling them to lower costs, improve performance and focus intern...
Everywhere we turn in our industry we can find strong opinions about the direction, type and nature of cloud’s impact on computing and business. Another word that is used in every context in our industry is “hybrid.” In his session at 20th Cloud Expo, Alvaro Gonzalez, Director of Technical, Partner and Field Marketing at Peak 10, will use a combination of a few conceptual props and some research recently commissioned by Peak 10 to offer a real-world consideration of how the various categories of...
Cloud applications are seeing a deluge of requests to support the exploding advanced analytics market. “Open analytics” is the emerging strategy to deliver that data through an open data access layer, in the cloud, to be directly consumed by external analytics tools and popular programming languages. An increasing number of data engineers and data scientists use a variety of platforms and advanced analytics languages such as SAS, R, Python and Java, as well as frameworks such as Hadoop and Spark...
SYS-CON Events announced today that Super Micro Computer, Inc., a global leader in compute, storage and networking technologies, will exhibit at SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. Supermicro (NASDAQ: SMCI), the leading innovator in high-performance, high-efficiency server technology, is a premier provider of advanced server Building Block Solutions® for Data Center, Cloud Computing, Enterprise IT, Hadoop/...
This talk centers around how to automate best practices in a multi-/hybrid-cloud world based on our work with customers like GE, Discovery Communications and Fannie Mae. Today’s enterprises are reaping the benefits of cloud computing, but also discovering many risks and challenges. In the age of DevOps and the decentralization of IT, it’s easy to over-provision resources, forget that instances are running, or unintentionally expose vulnerabilities.
Multiple data types are pouring into IoT deployments. Data is coming in small packages as well as enormous files and data streams of many sizes. Widespread use of mobile devices adds to the total. In this power panel at @ThingsExpo, moderated by Conference Chair Roger Strukhoff, panelists will look at the tools and environments that are being put to use in IoT deployments, as well as the team skills a modern enterprise IT shop needs to keep things running, get a handle on all this data, and deli...