|By Haim Koshchitzky||
|July 28, 2014 07:45 AM EDT||
Sharing data is a cornerstone of the scientific method because it makes it possible to replicate work. That foundation is mostly absent from data science, which makes obtaining and reusing knowledge more difficult than it should be.
Job postings for data scientists increased 15,000 percent between 2011 and 2012, and Gartner predicted that 63% of organizations would invest in Big Data this year. The communications, consumer, education, financial, healthcare, government, manufacturing, and retail sectors are all adopting business practices that are using data science to inform their activities and improve operations.
There are a number of companies creating solutions to visualize and uncover insights from large volumes of data with robust platforms in operation worldwide. Vast volumes of data from applications logs to the network and business activities are well served by today's analytics technologies - computation isn't the issue. The ability to model data into experiments that act on data with data sources and conclusions is what's missing, and it's an emerging problem for businesses.
Gartner has observed that those same organizations are now "struggling" with deriving value from and managing Big Data (depending on organizational maturity). That could be due to what famed microbiologist Ludwik Fleck deemed an "empty mind" as he explored the sociology of science during the 1930s. What is that exactly? Fleck postulated that a mind must be filled with initial knowledge before it can perceive or think. This logic applies to organizations too.
Fleck's theory was that participating in a "thought collective" of institutional knowledge would fill minds. His works concluded that cognition is a collaborative activity because a body of knowledge is acquired from a group. It could be argued that making it possible to reuse data experiments would have the same effect. Organizations that can't find value in data have an empty mind.
Big Data should follow the lead of the scientific method (which was influenced by Fleck's ideas) to put greater emphasis on sharing and reusing data. Why is that important for businesses? Scientific data is easy to share among different organizations. Having the ability to do the same with data science could solve what's emerging as a major pain point. Employees change roles and organizations, but what happens to the knowledge, experiments, and patterns?
Whether the academic model would also function in the enterprise is a fascinating question for data scientists, operations professionals and industries. The next great "open source" horizon could be the exchange of knowledge.
It would be interesting to see companies take on the challenge of building systems that organize and share experiments more liberally to put an end to the empty brain problem. After all, data science is still science. Why should it be treated differently?
Sep. 25, 2016 06:30 PM EDT Reads: 1,802
Sep. 25, 2016 06:15 PM EDT Reads: 2,406
Sep. 25, 2016 05:30 PM EDT Reads: 1,536
Sep. 25, 2016 05:15 PM EDT Reads: 1,522
Sep. 25, 2016 05:00 PM EDT Reads: 1,748
Sep. 25, 2016 04:15 PM EDT Reads: 2,558
Sep. 25, 2016 03:15 PM EDT Reads: 4,386
Sep. 25, 2016 03:00 PM EDT Reads: 1,575
Sep. 25, 2016 02:30 PM EDT Reads: 1,514
Sep. 25, 2016 02:00 PM EDT Reads: 1,516
Sep. 25, 2016 02:00 PM EDT Reads: 2,602
Sep. 25, 2016 01:00 PM EDT Reads: 830
Sep. 25, 2016 12:45 PM EDT Reads: 2,459
Sep. 25, 2016 12:15 PM EDT Reads: 3,381
Sep. 25, 2016 12:15 PM EDT Reads: 1,115