|By Haim Koshchitzky||
|July 28, 2014 07:45 AM EDT||
Sharing data is a cornerstone of the scientific method because it makes it possible to replicate work. That foundation is mostly absent from data science, which makes obtaining and reusing knowledge more difficult than it should be.
Job postings for data scientists increased 15,000 percent between 2011 and 2012, and Gartner predicted that 63% of organizations would invest in Big Data this year. The communications, consumer, education, financial, healthcare, government, manufacturing, and retail sectors are all adopting business practices that are using data science to inform their activities and improve operations.
There are a number of companies creating solutions to visualize and uncover insights from large volumes of data with robust platforms in operation worldwide. Vast volumes of data from applications logs to the network and business activities are well served by today's analytics technologies - computation isn't the issue. The ability to model data into experiments that act on data with data sources and conclusions is what's missing, and it's an emerging problem for businesses.
Gartner has observed that those same organizations are now "struggling" with deriving value from and managing Big Data (depending on organizational maturity). That could be due to what famed microbiologist Ludwik Fleck deemed an "empty mind" as he explored the sociology of science during the 1930s. What is that exactly? Fleck postulated that a mind must be filled with initial knowledge before it can perceive or think. This logic applies to organizations too.
Fleck's theory was that participating in a "thought collective" of institutional knowledge would fill minds. His works concluded that cognition is a collaborative activity because a body of knowledge is acquired from a group. It could be argued that making it possible to reuse data experiments would have the same effect. Organizations that can't find value in data have an empty mind.
Big Data should follow the lead of the scientific method (which was influenced by Fleck's ideas) to put greater emphasis on sharing and reusing data. Why is that important for businesses? Scientific data is easy to share among different organizations. Having the ability to do the same with data science could solve what's emerging as a major pain point. Employees change roles and organizations, but what happens to the knowledge, experiments, and patterns?
Whether the academic model would also function in the enterprise is a fascinating question for data scientists, operations professionals and industries. The next great "open source" horizon could be the exchange of knowledge.
It would be interesting to see companies take on the challenge of building systems that organize and share experiments more liberally to put an end to the empty brain problem. After all, data science is still science. Why should it be treated differently?
Dec. 1, 2015 08:00 AM EST Reads: 250
Dec. 1, 2015 08:00 AM EST Reads: 395
Dec. 1, 2015 08:00 AM EST Reads: 396
Dec. 1, 2015 06:30 AM EST Reads: 514
Dec. 1, 2015 06:00 AM EST Reads: 271
Dec. 1, 2015 05:00 AM EST Reads: 620
Dec. 1, 2015 05:00 AM EST Reads: 359
Dec. 1, 2015 04:45 AM EST Reads: 458
Dec. 1, 2015 04:15 AM EST
Dec. 1, 2015 03:30 AM EST Reads: 532
Dec. 1, 2015 03:00 AM EST Reads: 470
Dec. 1, 2015 03:00 AM EST Reads: 111
Dec. 1, 2015 03:00 AM EST Reads: 469
Dec. 1, 2015 01:15 AM EST Reads: 125
Dec. 1, 2015 01:00 AM EST Reads: 437