Welcome!

Related Topics: @DXWorldExpo, @CloudExpo, @ThingsExpo

@DXWorldExpo: Blog Feed Post

Whose Data Is It? | @CloudExpo #IoT #AI #ML #DL #M2M #BigData #Analytics

It now seems that sports, in this case the NBA, are breaking new ground with another data analytics topic: who owns the data?

Many times, sports have been at the leading edge of data analytics.  The book “Moneyball” was one of the first popular books to bring the basic concepts behind data analytics and data science to the general audience.  Fantasy leagues, sabermetrics and even games like “Strat-O-Matic” baseball and basketball provided an introduction into basic statistical concepts.

And it now seems that sports, in this case the National Basketball Association (NBA), are breaking new ground with another data analytics topic: who owns the data?  The National Basketball Players Association recently banned NBA teams from using a player’s wearable data in contract negotiations or other transactions (see “NBA Bans Teams From Using Wearable Data In Contract Negotiations”).

Maybe after the bitter fights professional and college athletes had about their “likeness” being used for advertising and promotions (think College Hoops 2K8), the players association wanted to get ahead of the curve on the data ownership issue.  If that’s the case, then that’s a very smart move – and a very telling move.  It brings to light a very interesting question:  who owns the personal data coming off wearables and other “intelligent” devices, and when and how can that personal data be used?

It’s easy to imagine how the NBA owners, agents and coaches could use the wearables data.  But there is already a plethora of data available on player performance.  Do I really need wearables data to tell me that Carmelo Anthony (over-rated New York Knicks forward) doesn’t hustle back on defense (or maybe even play defense)?  I can just look at some basic statistics to uncover that insight (see Table 1)

Offensive Real Plus-Minus

Defensive Real Plus-Minus

Carmelo Anthony, New York Knicks 2.18 -1.84
League Rank #35 out of 445 #403 out of 445

Table 1:  Source: http://www.espn.com/nba/statistics/rpm/_/sort/ORPM

From Table 1, we can see that Carmelo Anthony ranks #35 out of 445 NBA players for offensive effectiveness; however, he only ranks #403 out of 445 players on defensive effectiveness.  Again, I don’t need to see wearables data to understand where during the game Carmelo Anthony is putting his effort and hustle[1].  Plus there are other ways to get much of the same performance and effort data, such as video analytics.

Wearables data could be very beneficial to teams and players by scientifically flagging when a player is gassed and needs a rest, or whose body might be breaking down and needs to take a game off.  Wearables data could be used to create personalized training programs that optimize an individual athlete’s strength, endurance and agility capabilities.  Wearables data could be used to minimize training injuries and speed injury recovery.  The number of ways that wearables data, especially combined with in-game performance numbers and other external sources such as weather (temperatures, humidity, precipitation), social media and location data, could improve individual athlete as well as team performance is only being scratched.

By the way, check out the twitter account https://twitter.com/strong_science for examples as to how leading edge sports teams and athletes are combining data and analytics to achieve superior player development and in-game performance.

So Who Owns the Nest Data?
In order to take this conversation to the next level, I wanted to get a feel for the privacy statements that shield our personal information from being exploited for nefarious uses.  So I checked out a sample Privacy Policy.  And given the growing explosion of in-home Internet of Things (IOT) devices, I thought I’d start with the industry leader in home-based sensors and devices…Google Nest.

Just imagine all the insights that Google Nest could glean from having sensors placed throughout your house.  Here are just a few examples of the types of insights that Nest could glean from that data:

  • How many people live in the house?
  • When those people are typically home during the week?
  • When those people are typically home during the weekend or holidays?
  • When do they typically go to bed?
  • When do they typically get up in the morning?
  • Is there movement during the sleep period and if so, when and where is that movement?
  • When are the residents on vacation? When do they typically go on vacation?  How long are they typically gone while on vacation?
  • Do they take weekends away from the home (like ski weekends) and does that correlate to any holidays or
  • Does the time away from the home correlate to turning down the heat?
  • And more!

Boy, the Google Nest could know an awful lot about your home living patterns and tendencies.  That could be quite dangerous if all that were to get into the wrong hands.  So let’s see how Google Nest is protecting our personal and residence data via their privacy policy.

Here is the Google Nest Privacy Policy:

Device Usage information: If you are logged into your Nest account, we record the IP address you visit our website from, and if you have a Nest device or other connected device, we record adjustments you make to the product through the website interface. We store this data along with your email address, information about your Nest device, data collected directly by the device, a history of your device settings, and any other information we have collected about your use of Nest products and services. See our Privacy Statement for Nest Products and Services to learn more about the usage information collected through our products.

Okay, so not much comfort here that my personal data is being protected and won’t be used for whatever purpose Google decides.  But here is what I found even more concerning, selecting the “See our Privacy Statement for Nest Products and Services” link took me right back to this statement.  Yea, a circular reference to a privacy statement that says nothing about how they are going to protect your information.  If you have a Nest device and are not concerned, well I got a bridge in San Francisco to sell you…

Summary
As IOT devices continue to invade our homes, cars, work areas, shopping malls, movie theaters, coffee shops, grocery stores, sporting arenas, concert halls and airports, the multitude of different organizations that “own” that data will only become more confusing.  And while that data has great potential to do good for the individual, it also has the potential for much bad as well.  This issue is only going to grow as IOT continues its meteoritic growth in all aspects of our lives.  Read a few privacy policies and you will soon realize that the only thing that separates good from bad is only a few words in a paper-thin privacy policy.

And that should scare you.

[1] Real Plus-Minus (RPM) Real Plus-Minus is meant to be predictive.  RPM can help coaches (and agents) differentiate between players who have been consistently good (and will likely keep being good) and players who are merely going through a hot streak (and will likely regress to their mean).

The post Whose Data is it? appeared first on InFocus Blog | Dell EMC Services.

More Stories By William Schmarzo

Bill Schmarzo, author of “Big Data: Understanding How Data Powers Big Business” and “Big Data MBA: Driving Business Strategies with Data Science”, is responsible for setting strategy and defining the Big Data service offerings for Hitachi Vantara as CTO, IoT and Analytics.

Previously, as a CTO within Dell EMC’s 2,000+ person consulting organization, he works with organizations to identify where and how to start their big data journeys. He’s written white papers, is an avid blogger and is a frequent speaker on the use of Big Data and data science to power an organization’s key business initiatives. He is a University of San Francisco School of Management (SOM) Executive Fellow where he teaches the “Big Data MBA” course. Bill also just completed a research paper on “Determining The Economic Value of Data”. Onalytica recently ranked Bill as #4 Big Data Influencer worldwide.

Bill has over three decades of experience in data warehousing, BI and analytics. Bill authored the Vision Workshop methodology that links an organization’s strategic business initiatives with their supporting data and analytic requirements. Bill serves on the City of San Jose’s Technology Innovation Board, and on the faculties of The Data Warehouse Institute and Strata.

Previously, Bill was vice president of Analytics at Yahoo where he was responsible for the development of Yahoo’s Advertiser and Website analytics products, including the delivery of “actionable insights” through a holistic user experience. Before that, Bill oversaw the Analytic Applications business unit at Business Objects, including the development, marketing and sales of their industry-defining analytic applications.

Bill holds a Masters Business Administration from University of Iowa and a Bachelor of Science degree in Mathematics, Computer Science and Business Administration from Coe College.

Latest Stories
With more than 30 Kubernetes solutions in the marketplace, it's tempting to think Kubernetes and the vendor ecosystem has solved the problem of operationalizing containers at scale or of automatically managing the elasticity of the underlying infrastructure that these solutions need to be truly scalable. Far from it. There are at least six major pain points that companies experience when they try to deploy and run Kubernetes in their complex environments. In this presentation, the speaker will d...
While DevOps most critically and famously fosters collaboration, communication, and integration through cultural change, culture is more of an output than an input. In order to actively drive cultural evolution, organizations must make substantial organizational and process changes, and adopt new technologies, to encourage a DevOps culture. Moderated by Andi Mann, panelists discussed how to balance these three pillars of DevOps, where to focus attention (and resources), where organizations might...
The deluge of IoT sensor data collected from connected devices and the powerful AI required to make that data actionable are giving rise to a hybrid ecosystem in which cloud, on-prem and edge processes become interweaved. Attendees will learn how emerging composable infrastructure solutions deliver the adaptive architecture needed to manage this new data reality. Machine learning algorithms can better anticipate data storms and automate resources to support surges, including fully scalable GPU-c...
When building large, cloud-based applications that operate at a high scale, it's important to maintain a high availability and resilience to failures. In order to do that, you must be tolerant of failures, even in light of failures in other areas of your application. "Fly two mistakes high" is an old adage in the radio control airplane hobby. It means, fly high enough so that if you make a mistake, you can continue flying with room to still make mistakes. In his session at 18th Cloud Expo, Le...
Machine learning has taken residence at our cities' cores and now we can finally have "smart cities." Cities are a collection of buildings made to provide the structure and safety necessary for people to function, create and survive. Buildings are a pool of ever-changing performance data from large automated systems such as heating and cooling to the people that live and work within them. Through machine learning, buildings can optimize performance, reduce costs, and improve occupant comfort by ...
As Cybric's Chief Technology Officer, Mike D. Kail is responsible for the strategic vision and technical direction of the platform. Prior to founding Cybric, Mike was Yahoo's CIO and SVP of Infrastructure, where he led the IT and Data Center functions for the company. He has more than 24 years of IT Operations experience with a focus on highly-scalable architectures.
CI/CD is conceptually straightforward, yet often technically intricate to implement since it requires time and opportunities to develop intimate understanding on not only DevOps processes and operations, but likely product integrations with multiple platforms. This session intends to bridge the gap by offering an intense learning experience while witnessing the processes and operations to build from zero to a simple, yet functional CI/CD pipeline integrated with Jenkins, Github, Docker and Azure...
The explosion of new web/cloud/IoT-based applications and the data they generate are transforming our world right before our eyes. In this rush to adopt these new technologies, organizations are often ignoring fundamental questions concerning who owns the data and failing to ask for permission to conduct invasive surveillance of their customers. Organizations that are not transparent about how their systems gather data telemetry without offering shared data ownership risk product rejection, regu...
René Bostic is the Technical VP of the IBM Cloud Unit in North America. Enjoying her career with IBM during the modern millennial technological era, she is an expert in cloud computing, DevOps and emerging cloud technologies such as Blockchain. Her strengths and core competencies include a proven record of accomplishments in consensus building at all levels to assess, plan, and implement enterprise and cloud computing solutions. René is a member of the Society of Women Engineers (SWE) and a m...
Dhiraj Sehgal works in Delphix's product and solution organization. His focus has been DevOps, DataOps, private cloud and datacenters customers, technologies and products. He has wealth of experience in cloud focused and virtualized technologies ranging from compute, networking to storage. He has spoken at Cloud Expo for last 3 years now in New York and Santa Clara.
Enterprises are striving to become digital businesses for differentiated innovation and customer-centricity. Traditionally, they focused on digitizing processes and paper workflow. To be a disruptor and compete against new players, they need to gain insight into business data and innovate at scale. Cloud and cognitive technologies can help them leverage hidden data in SAP/ERP systems to fuel their businesses to accelerate digital transformation success.
Containers and Kubernetes allow for code portability across on-premise VMs, bare metal, or multiple cloud provider environments. Yet, despite this portability promise, developers may include configuration and application definitions that constrain or even eliminate application portability. In this session we'll describe best practices for "configuration as code" in a Kubernetes environment. We will demonstrate how a properly constructed containerized app can be deployed to both Amazon and Azure ...
Poor data quality and analytics drive down business value. In fact, Gartner estimated that the average financial impact of poor data quality on organizations is $9.7 million per year. But bad data is much more than a cost center. By eroding trust in information, analytics and the business decisions based on these, it is a serious impediment to digital transformation.
Digital Transformation: Preparing Cloud & IoT Security for the Age of Artificial Intelligence. As automation and artificial intelligence (AI) power solution development and delivery, many businesses need to build backend cloud capabilities. Well-poised organizations, marketing smart devices with AI and BlockChain capabilities prepare to refine compliance and regulatory capabilities in 2018. Volumes of health, financial, technical and privacy data, along with tightening compliance requirements by...
Predicting the future has never been more challenging - not because of the lack of data but because of the flood of ungoverned and risk laden information. Microsoft states that 2.5 exabytes of data are created every day. Expectations and reliance on data are being pushed to the limits, as demands around hybrid options continue to grow.