Welcome!

Blog Feed Post

Bill Schmarzo’s Top 2017 Big Data, Data Science and IOT Blogs

To put us on the path for a successful and engaging 2018, here is a quick review of my top 10 blogs from 2017.

#10. Is Data Science Really Science?

Science works within systems of laws, such as the laws of physics, thermodynamics, mathematics, and many others. Scientists can apply these laws to understand why certain actions lead to certain outcomes or why something is going to occur.

While there may never be “laws” that dictate human behaviors, in the world of IOT where organizations are melding analytics (machine learning and artificial intelligence) with physical products, we will see “data science” advancing beyond just “data” science. In IOT, the data science team must expand to include scientists and engineers from the physical sciences so that the team can understand and quantify the “why things happen” aspect of the analytic models. If not, the costs could be catastrophic.

Figure 1: Scientific Method Belief and Biases

 

Note: I’m adding Figure 1 to this blog to highlight the importance of the Scientific Method and understanding basic statistical techniques to ensure that one is building their analytics on unbiased data against unbiased hypotheses.

#9. Design Thinking: Future-proof Yourself from AI

While there is a high probability that machine learning and artificial intelligence will play an important role in whatever job you hold in the future, there is one way to “future-proof” your career…embrace the power of design thinking.

Design thinking is defined as human-centric design that builds upon the deep understanding of our users (e.g., their tendencies, propensities, inclinations, behaviors) to generate ideas, build prototypes, share what you’ve made, embrace the art of failure (i.e., fail fast but learn faster) and eventually put your innovative solution out into the world.  And fortunately for us humans (who really excel at human-centric things), there is a tight correlation between the design thinking and the machine learning (see Figure 2).

Figure 2: Design Thinking and Machine Learning Mapping

#8. 5 Steps to Building a Big Data Business Strategy

“The problem is that, in many cases, big data is not used well. Companies are better at collecting data – about their customers, about their products, about competitors – than analyzing that data and designing strategy around it.” “Companies Love Big Data but Lack the Strategy to Use It Effectively,” Harvard Business Review

Build a business strategy that incorporates big data. Build a business strategy that uncovers detailed customer, product, service and operational insights serve as the foundation for optimizing key operational processes, mitigating compliance and cyber-security risks, uncover new revenue opportunities, and create a more compelling, more differentiated customer or partner experience.

#7. What tomorrow’s business leaders need to know about Machine Learning.

Much of what comprises “Machine Learning” is really not all new. Many of the algorithms that fall into the Machine Learning category are analytic algorithms that have been around for decades. These include  clustering, association rules, and decisions trees. However, the detailed, granularity of the data, the wide variety of data sources, and a massive increase in computing power has re-invigorated many of these mature algorithms.

Machine learning is a type of applied artificial intelligence (AI) that provides computers with the ability to gain knowledge without being explicitly programmed. Machine learning focuses on the development of computer programs that can change when exposed to new data (see Figure 4). How can businesses, and business leaders, take advantage?

Figure 4: Supervised and Unsupervised Machine Learning Algorithms

#6. Is Blockchain the Ultimate Enabler of Data Monetization?

Blockchain is a data structure that maintains a digital ledger of transactions among a distributed network of entities.  Think of a “distributed ledger” that uses cryptography to allow each participant in the transaction to add to the ledger in a secure way without the need for a central authority or central clearinghouse (see Figure 5).

Figure 5: How to Use Blockchain Technology to Retain More Customers

Is blockchain the ultimate enabler of data and analytics monetization; creating marketplaces where companies, individuals and even smart entities (cars, trucks, building, airports, malls) can share/sell/trade/barter their data and analytic insights directly with others?

The impact that has on a company’s financials could be overwhelming, or devastating, depending upon what side of business model transformation you sit.

#5. Data is a New Currency

When you insert something, a new demand, into a circular flow,you create an economic concept called the Multiplier Effect. It is a concept that countries use to consider how to invest money and how that investment, by having it distribute though a supply chain, like the example above, will impact the economy of their country.

Multiplier Effect Definition: “An effect in economics in which an increase in spending produces an increase in national income and consumption greater than the initial amount spent.”

Figure 6: Economic Multiplier Effect

 

Data exhibits a Network Effect, where data can be used at the same time across multiple use cases thereby increasing its value to the organization. I would contend that this network effect is in fact the same thing principally as the Multiplier Effect.

#4. 5 Questions that Define Your Digital Transformation

I had the opportunity in 2017 to give a 10-minute keynote at DataWorks Summit 2017.  What sort of keynote could he give in just 10 minutes?  Ten minutes is not long for a keynote, and to be honest, I too struggled with what to say.

But after some brainstorming with my marketing experts, we came up with an idea:  Pose five questions that every organization needs to consider as they prepare themselves for digital transformation.  And while I didn’t have enough time in 10 minutes to answer those questions in a keynote, I certainly did in a blog!

Figure 7: 5 Questions that Frame Your Digital Transformation

 

You can also check out a video of my DataWorks Summit keynote presentation, complete with air guitar at the end so that I could embarrass my daughter (my presentation starts around the 39:30 mark)!

#3. Can Design Thinking Unleash Organizational Innovation?

Design Thinking, or human-centered design, is all about building a deep empathy with the people you’re designing for; generating tons of ideas; building a bunch of prototypes; sharing what you’ve made with the people you’re designing for; and eventually putting your innovative new solution out in the world (see Figure 8).

Figure 8: Stanford d.school Design Thinking Process

 

There is a good reason why Stanford’s d.school does not sit within one of their existing schools. Design thinking is used in almost all of Stanford’s schools including business, computer science, electrical, mechanical, and even healthcare.  Design thinking appears to be one of the secret sauces to Stanford’s success and cultivating the entrepreneurial spirit of its students and faculty (and neighbors, in my case).

#2. The Future Is Intelligent Apps

I have seen the future!  The future is a collision between big data (and data science) and application development that will yield a world of “intelligent apps.”

These “intelligent apps” combine customer, product, and operational insights (uncovered with predictive and prescriptive analytics) with modern application development tools and user-centric design to create a more compelling, more prescriptive user experience.

Intelligent apps will support or enable key user decisions, while continually learning from the user interactions to become even more relevant and valuable to those users.

The journey to building intelligent applications starts by understanding the decisions that key business constituents need to make in supporting their business and operational objectives.

Figure 9: Intelligent Application Stack

 

And my #1 blog of 2017 (drum roll please)…

#1. Difference between Big Data and Internet of Things

What are the differences between big data and IOT analytics? Big data analyzes large amounts of mostly human-generated data to support longer-duration use cases. IOT aggregates and compresses massive amounts of low latency / low duration / high volume machine-generated data coming from a wide variety of sensors to support real-time use cases.

I don’t believe that loading sensor data into a data lake and performing data science to create predictive analytic models qualifies as doing IOT analytics.  To me, that’s just big data (and potentially REALLY BIG DATA with all that sensor data).  In order for one to claim that they can deliver IOT analytic solutions requires big data (with data science and a data lake), but IOT analytics must also include:

  • Streaming data management with the ability to ingest, aggregate (e.g., mean, median, mode), and compress real-time data coming off a wide variety of sensor devices “at the edge” of the network.
  • Edge analytics that automatically analyzes real-time sensor data and renders real-time decisions (actions) at the edge of the network that optimizes operational performance (blade angle or yaw), or flags unusual performance or behaviors for immediate investigation (security breaches, fraud detection).

Sources:

Figure 1: Scientific Method Beliefs and Biases

Figure 4: Supervised and Unsupervised Machine Learning Algorithms

Figure 5: How to Use Blockchain Technology to Retain More Customers

 

The post Bill Schmarzo’s Top 2017 Big Data, Data Science and IOT Blogs appeared first on InFocus Blog | Dell EMC Services.

Read the original blog entry...

More Stories By William Schmarzo

Bill Schmarzo, author of “Big Data: Understanding How Data Powers Big Business” and “Big Data MBA: Driving Business Strategies with Data Science”, is responsible for setting strategy and defining the Big Data service offerings for Dell EMC’s Big Data Practice.

As a CTO within Dell EMC’s 2,000+ person consulting organization, he works with organizations to identify where and how to start their big data journeys. He’s written white papers, is an avid blogger and is a frequent speaker on the use of Big Data and data science to power an organization’s key business initiatives. He is a University of San Francisco School of Management (SOM) Executive Fellow where he teaches the “Big Data MBA” course. Bill also just completed a research paper on “Determining The Economic Value of Data”. Onalytica recently ranked Bill as #4 Big Data Influencer worldwide.

Bill has over three decades of experience in data warehousing, BI and analytics. Bill authored the Vision Workshop methodology that links an organization’s strategic business initiatives with their supporting data and analytic requirements. Bill serves on the City of San Jose’s Technology Innovation Board, and on the faculties of The Data Warehouse Institute and Strata.

Previously, Bill was vice president of Analytics at Yahoo where he was responsible for the development of Yahoo’s Advertiser and Website analytics products, including the delivery of “actionable insights” through a holistic user experience. Before that, Bill oversaw the Analytic Applications business unit at Business Objects, including the development, marketing and sales of their industry-defining analytic applications.

Bill holds a Masters Business Administration from University of Iowa and a Bachelor of Science degree in Mathematics, Computer Science and Business Administration from Coe College.

Latest Stories
Sometimes I write a blog just to formulate and organize a point of view, and I think it’s time that I pull together the bounty of excellent information about Machine Learning. This is a topic with which business leaders must become comfortable, especially tomorrow’s business leaders (tip for my next semester University of San Francisco business students!). Machine learning is a key capability that will help organizations drive optimization and monetization opportunities, and there have been some...
"Storpool does only block-level storage so we do one thing extremely well. The growth in data is what drives the move to software-defined technologies in general and software-defined storage," explained Boyan Ivanov, CEO and co-founder at StorPool, in this SYS-CON.tv interview at 16th Cloud Expo, held June 9-11, 2015, at the Javits Center in New York City.
A strange thing is happening along the way to the Internet of Things, namely far too many devices to work with and manage. It has become clear that we'll need much higher efficiency user experiences that can allow us to more easily and scalably work with the thousands of devices that will soon be in each of our lives. Enter the conversational interface revolution, combining bots we can literally talk with, gesture to, and even direct with our thoughts, with embedded artificial intelligence, whic...
The question before companies today is not whether to become intelligent, it’s a question of how and how fast. The key is to adopt and deploy an intelligent application strategy while simultaneously preparing to scale that intelligence. In her session at 21st Cloud Expo, Sangeeta Chakraborty, Chief Customer Officer at Ayasdi, provided a tactical framework to become a truly intelligent enterprise, including how to identify the right applications for AI, how to build a Center of Excellence to oper...
While some developers care passionately about how data centers and clouds are architected, for most, it is only the end result that matters. To the majority of companies, technology exists to solve a business problem, and only delivers value when it is solving that problem. 2017 brings the mainstream adoption of containers for production workloads. In his session at 21st Cloud Expo, Ben McCormack, VP of Operations at Evernote, discussed how data centers of the future will be managed, how the p...
ChatOps is an emerging topic that has led to the wide availability of integrations between group chat and various other tools/platforms. Currently, HipChat is an extremely powerful collaboration platform due to the various ChatOps integrations that are available. However, DevOps automation can involve orchestration and complex workflows. In his session at @DevOpsSummit at 20th Cloud Expo, Himanshu Chhetri, CTO at Addteq, will cover practical examples and use cases such as self-provisioning infra...
As DevOps methodologies expand their reach across the enterprise, organizations face the daunting challenge of adapting related cloud strategies to ensure optimal alignment, from managing complexity to ensuring proper governance. How can culture, automation, legacy apps and even budget be reexamined to enable this ongoing shift within the modern software factory? In her Day 2 Keynote at @DevOpsSummit at 21st Cloud Expo, Aruna Ravichandran, VP, DevOps Solutions Marketing, CA Technologies, was jo...
As Marc Andreessen says software is eating the world. Everything is rapidly moving toward being software-defined – from our phones and cars through our washing machines to the datacenter. However, there are larger challenges when implementing software defined on a larger scale - when building software defined infrastructure. In his session at 16th Cloud Expo, Boyan Ivanov, CEO of StorPool, provided some practical insights on what, how and why when implementing "software-defined" in the datacent...
Blockchain. A day doesn’t seem to go by without seeing articles and discussions about the technology. According to PwC executive Seamus Cushley, approximately $1.4B has been invested in blockchain just last year. In Gartner’s recent hype cycle for emerging technologies, blockchain is approaching the peak. It is considered by Gartner as one of the ‘Key platform-enabling technologies to track.’ While there is a lot of ‘hype vs reality’ discussions going on, there is no arguing that blockchain is b...
Blockchain is a shared, secure record of exchange that establishes trust, accountability and transparency across business networks. Supported by the Linux Foundation's open source, open-standards based Hyperledger Project, Blockchain has the potential to improve regulatory compliance, reduce cost as well as advance trade. Are you curious about how Blockchain is built for business? In her session at 21st Cloud Expo, René Bostic, Technical VP of the IBM Cloud Unit in North America, discussed the b...
You know you need the cloud, but you’re hesitant to simply dump everything at Amazon since you know that not all workloads are suitable for cloud. You know that you want the kind of ease of use and scalability that you get with public cloud, but your applications are architected in a way that makes the public cloud a non-starter. You’re looking at private cloud solutions based on hyperconverged infrastructure, but you’re concerned with the limits inherent in those technologies.
Is advanced scheduling in Kubernetes achievable?Yes, however, how do you properly accommodate every real-life scenario that a Kubernetes user might encounter? How do you leverage advanced scheduling techniques to shape and describe each scenario in easy-to-use rules and configurations? In his session at @DevOpsSummit at 21st Cloud Expo, Oleg Chunikhin, CTO at Kublr, answered these questions and demonstrated techniques for implementing advanced scheduling. For example, using spot instances and co...
The cloud era has reached the stage where it is no longer a question of whether a company should migrate, but when. Enterprises have embraced the outsourcing of where their various applications are stored and who manages them, saving significant investment along the way. Plus, the cloud has become a defining competitive edge. Companies that fail to successfully adapt risk failure. The media, of course, continues to extol the virtues of the cloud, including how easy it is to get there. Migrating...
The use of containers by developers -- and now increasingly IT operators -- has grown from infatuation to deep and abiding love. But as with any long-term affair, the honeymoon soon leads to needing to live well together ... and maybe even getting some relationship help along the way. And so it goes with container orchestration and automation solutions, which are rapidly emerging as the means to maintain the bliss between rapid container adoption and broad container use among multiple cloud host...
Imagine if you will, a retail floor so densely packed with sensors that they can pick up the movements of insects scurrying across a store aisle. Or a component of a piece of factory equipment so well-instrumented that its digital twin provides resolution down to the micrometer.