Blog Feed Post

Heart of the Data Science Revolution… Mankato, MN?

My job has the great advantage of bringing me in touch with some outstanding and truly brilliant people.  As an example, last year I spoke at the PrecisionAg Vision Conference in Phoenix (and I’ll be speaking there again this fall on October 12), and ran into a number of companies that were applying data science and big data principals to the world of agriculture.  Having grown up in Charles City, Iowa, I have a special place in my heart for farming and the importance of agriculture to the health and financial success of our country.

One company that I ran into – Aglytix – really impressed me with their application of data science to some of the fundamental decisions that farmers need to make in order to optimize yield while minimizing costs.  Aglytix’s approach to first identify the farmer’s most important decisions and then apply data science to optimize those decisions plays right to the heart of the approach that we teach our customers (see Figure 1).

Big data and data science - focus on What's Important to the Business

Figure 1: Focus on What’s Important to the Business

And while I am totally fascinated by Aglytix’s analytic solutions, here is what I was even more impressed with:

  • Aglytix is a 15-person company. This is NOT a Fortune 500 company with an over-flowing abundance of business analysts, data engineers and data scientists.  15 people.  15 people with deep subject matter expertise in the area of farming (several came from farming communities like me) who have learned data science from leading data science hubs like North Dakota, Minnesota State, Mankato…not exactly Stanford, Cal or MIT.
  • Located in Mankato, Minnesota. That’s 100 miles away from Minneapolis, and 120 miles away from the Big Data Mecca of Charles City, Iowa.

It is exhilarating to see that big data is alive and well in the Heartland of America.  In fact, the “democratization of analytics” driven by economics of big data is enabling small communities and small businesses to thrive and compete against much larger and better-financed organizations.

I had the opportunity to spend some time with Jerry Johnson, the founder and CEO of Aglytix. Here is what I learned from our conversation.

Lesson #1:  Focus On The Customers’ Most Important Decisions

“Organizations don’t fail at big data because of a lack of opportunities; they fail because they have too many.”

That’s one of my most important messages, and Aglytix has taken it to heart.  Aglytix has focused on one use case at a time to build out its suite of lean farming analytics.  For example: quantifying the impact of crop stress. Crop stress decreases yield by inhibiting crop growth and decreasing the maximum potential crop yield.  Crop stress is caused by factors such as diseases, insects, drought, and salinity and excesses of trace elements in the soil.

Aglytix provides crop stress analytics to help farmers understand the financial impact of crop stress on the farmer’s yield and finances (see Figure 2).

Big data and data science - Impact of Crop Stress

Figure 2: Impact of Crop Stress

The analytics in Figure 2 show the impact of crop stress over time, both from the perspective of the field (the bottom progression of crop stress on the fields) as well as a quantitative perspective from the chart in the upper right of Figure 1. The types of actions the analytics can recommend to reduce crop stress include the optimal application of water, fertilizers, herbicides and pesticides (both too much and too little can bad for the crops).

Lesson #2: Lead with Recommendations

“Don’t just throw more charts and data at me.  Tell me what I need to do!”

This was one of my early lessons from my Yahoo Advertiser Analytics experience.  Farmers, like Yahoo Media planners and campaign managers, don’t want “informative” charts and dashboards; they wanted “actionable” insights and recommendations (prescriptive analytics) regarding what they need to do to improve yield and control costs.

Another Aglytix use cases focuses on optimizing weed control.  There are numerous variables that a farmer needs to consider in order to optimize weed control from a yield versus cost perspective.  And as is typically in any real-world situation, those variables are in constant flux.  So instead of trying to turn the farmer into a data scientist by giving them charts and reports, instead Aglytix helps the farmer become a more effective farmer by providing recommendations as to what actions to take (see Figure 3).

big data and data science - Weed Control Recommendations

Figure 3: Weed Control Recommendations

The prescriptive analytics in Figure 3 provides the farmer or agronomist with the analysis of what fields to spray, with what herbicides, at what rates and with what pressure in order to optimize weed control.  With prescriptive analytics, we don’t need to force the users to guess what to do; we can just tell them what to do with recommendations (think Netflix recommending movies to watch or Amazon recommending products to buy).

Of course the key to successful prescriptive analytics is to measure the effectiveness of the recommendations.  In this case, we’d want to know the results of the farmer’s weed spraying campaign so that we can measure how effective it was, and use the results to fine-tune our analytic models.

Lesson #3: Create a Use Case Roadmap

“The best way to guarantee that you don’t solve any problems, is to try to solve them all”

Because of the bounty of opportunities against which to apply data and analytics in order to optimize key operational processes, organizations need to embrace the power of prioritization and focus (see the blog “Big Data Success: Prioritize Important Over Urgent”).  The results of prioritizing and focusing will yield a use case roadmap that addresses both cost savings and revenue (yield) improvement, but on a use case-by-use case basis (see Figure 4).

big data and data science - Learning Farming Use Case Roadmap

Figure 4: Learning Farming Use Case Roadmap

Figure 4 provides a use case roadmap for how farmers can leverage big data to improve yields (increase yields by 40%) and reduces costs (decrease costs by 20%), but does it one a use case-by-use case basis.  The big bang effect – spending 5+ years and $15M+ before realizing any financial benefits – have gone the way of ERP implementations.

Instead, organizations have an opportunity to build out their big data architectures, technologies, data science and data lake one ROI-positive use case at a time.

Lesson #4: Make it about the money!

“I don’t care about the 3 V’s of Big Data; show me the 4 M’s of Big Data:  Make me more money!”

In the end, big data is about the 4 M’s of Big Data:  “Make Me More Money” (see Figure 5).

big data and data science - Cartoon courtesy of Timo Elliott (@timoelliott)

Figure 5: Cartoon courtesy of Timo Elliott (@timoelliott)


For example, crop establishment requires farmers to make critical planting decisions including how to plant (e.g., pre-germination, seeding depth, broadcasting) given different soil situations (e.g., wet, dry, soil acidity, soil composition), seed rates, replanting options and pest control.

big data and data science - Financial Impact of Crop Stand Establishment

Figure 6: Financial Impact of Crop Stand Establishment

Figure 6 shows the analytics that not only provides recommendations across the different crop stand variables and decisions, but even shows an estimate as to the financial impact of those decisions.

Now if that’s not a “Make Me More Money” analysis, then I don’t know what is!

Aglytix Summary

Big Data is starting to show up everywhere, and is no longer just the dominion of large organizations. This is the democratization of big data, where organizations of any size can leverage data and analytics to power their business models and win against larger competitors. The big data revolution will truly be in full swing when organizations of all sizes embrace analytics as a business discipline (think “Big Data MBA”), and not just something that IT does for them.

Also, I will be speaking at the PrecisionAg Conference October 12 in Phoenix.  I expect to again be meeting some truly brilliant people!


The post Heart of the Data Science Revolution… Mankato, MN? appeared first on InFocus Blog | Dell EMC Services.

Read the original blog entry...

More Stories By William Schmarzo

Bill Schmarzo, author of “Big Data: Understanding How Data Powers Big Business” and “Big Data MBA: Driving Business Strategies with Data Science”, is responsible for setting strategy and defining the Big Data service offerings for Dell EMC’s Big Data Practice. As a CTO within Dell EMC’s 2,000+ person consulting organization, he works with organizations to identify where and how to start their big data journeys. He’s written white papers, is an avid blogger and is a frequent speaker on the use of Big Data and data science to power an organization’s key business initiatives. He is a University of San Francisco School of Management (SOM) Executive Fellow where he teaches the “Big Data MBA” course. Bill also just completed a research paper on “Determining The Economic Value of Data”. Onalytica recently ranked Bill as #4 Big Data Influencer worldwide. Bill has over three decades of experience in data warehousing, BI and analytics. Bill authored the Vision Workshop methodology that links an organization’s strategic business initiatives with their supporting data and analytic requirements. Bill serves on the City of San Jose’s Technology Innovation Board, and on the faculties of The Data Warehouse Institute and Strata. Previously, Bill was vice president of Analytics at Yahoo where he was responsible for the development of Yahoo’s Advertiser and Website analytics products, including the delivery of “actionable insights” through a holistic user experience. Before that, Bill oversaw the Analytic Applications business unit at Business Objects, including the development, marketing and sales of their industry-defining analytic applications. Bill holds a Masters Business Administration from University of Iowa and a Bachelor of Science degree in Mathematics, Computer Science and Business Administration from Coe College.

Latest Stories
DX World EXPO, LLC, a Lighthouse Point, Florida-based startup trade show producer and the creator of "DXWorldEXPO® - Digital Transformation Conference & Expo" has announced its executive management team. The team is headed by Levent Selamoglu, who has been named CEO. "Now is the time for a truly global DX event, to bring together the leading minds from the technology world in a conversation about Digital Transformation," he said in making the announcement.
"Space Monkey by Vivent Smart Home is a product that is a distributed cloud-based edge storage network. Vivent Smart Home, our parent company, is a smart home provider that places a lot of hard drives across homes in North America," explained JT Olds, Director of Engineering, and Brandon Crowfeather, Product Manager, at Vivint Smart Home, in this SYS-CON.tv interview at @ThingsExpo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
SYS-CON Events announced today that Conference Guru has been named “Media Sponsor” of the 22nd International Cloud Expo, which will take place on June 5-7, 2018, at the Javits Center in New York, NY. A valuable conference experience generates new contacts, sales leads, potential strategic partners and potential investors; helps gather competitive intelligence and even provides inspiration for new products and services. Conference Guru works with conference organizers to pass great deals to gre...
DevOps is under attack because developers don’t want to mess with infrastructure. They will happily own their code into production, but want to use platforms instead of raw automation. That’s changing the landscape that we understand as DevOps with both architecture concepts (CloudNative) and process redefinition (SRE). Rob Hirschfeld’s recent work in Kubernetes operations has led to the conclusion that containers and related platforms have changed the way we should be thinking about DevOps and...
The Internet of Things will challenge the status quo of how IT and development organizations operate. Or will it? Certainly the fog layer of IoT requires special insights about data ontology, security and transactional integrity. But the developmental challenges are the same: People, Process and Platform. In his session at @ThingsExpo, Craig Sproule, CEO of Metavine, demonstrated how to move beyond today's coding paradigm and shared the must-have mindsets for removing complexity from the develop...
In his Opening Keynote at 21st Cloud Expo, John Considine, General Manager of IBM Cloud Infrastructure, led attendees through the exciting evolution of the cloud. He looked at this major disruption from the perspective of technology, business models, and what this means for enterprises of all sizes. John Considine is General Manager of Cloud Infrastructure Services at IBM. In that role he is responsible for leading IBM’s public cloud infrastructure including strategy, development, and offering m...
The next XaaS is CICDaaS. Why? Because CICD saves developers a huge amount of time. CD is an especially great option for projects that require multiple and frequent contributions to be integrated. But… securing CICD best practices is an emerging, essential, yet little understood practice for DevOps teams and their Cloud Service Providers. The only way to get CICD to work in a highly secure environment takes collaboration, patience and persistence. Building CICD in the cloud requires rigorous ar...
Companies are harnessing data in ways we once associated with science fiction. Analysts have access to a plethora of visualization and reporting tools, but considering the vast amount of data businesses collect and limitations of CPUs, end users are forced to design their structures and systems with limitations. Until now. As the cloud toolkit to analyze data has evolved, GPUs have stepped in to massively parallel SQL, visualization and machine learning.
"Evatronix provides design services to companies that need to integrate the IoT technology in their products but they don't necessarily have the expertise, knowledge and design team to do so," explained Adam Morawiec, VP of Business Development at Evatronix, in this SYS-CON.tv interview at @ThingsExpo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
To get the most out of their data, successful companies are not focusing on queries and data lakes, they are actively integrating analytics into their operations with a data-first application development approach. Real-time adjustments to improve revenues, reduce costs, or mitigate risk rely on applications that minimize latency on a variety of data sources. In his session at @BigDataExpo, Jack Norris, Senior Vice President, Data and Applications at MapR Technologies, reviewed best practices to ...
Widespread fragmentation is stalling the growth of the IIoT and making it difficult for partners to work together. The number of software platforms, apps, hardware and connectivity standards is creating paralysis among businesses that are afraid of being locked into a solution. EdgeX Foundry is unifying the community around a common IoT edge framework and an ecosystem of interoperable components.
"ZeroStack is a startup in Silicon Valley. We're solving a very interesting problem around bringing public cloud convenience with private cloud control for enterprises and mid-size companies," explained Kamesh Pemmaraju, VP of Product Management at ZeroStack, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Large industrial manufacturing organizations are adopting the agile principles of cloud software companies. The industrial manufacturing development process has not scaled over time. Now that design CAD teams are geographically distributed, centralizing their work is key. With large multi-gigabyte projects, outdated tools have stifled industrial team agility, time-to-market milestones, and impacted P&L stakeholders.
"Akvelon is a software development company and we also provide consultancy services to folks who are looking to scale or accelerate their engineering roadmaps," explained Jeremiah Mothersell, Marketing Manager at Akvelon, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Enterprises are adopting Kubernetes to accelerate the development and the delivery of cloud-native applications. However, sharing a Kubernetes cluster between members of the same team can be challenging. And, sharing clusters across multiple teams is even harder. Kubernetes offers several constructs to help implement segmentation and isolation. However, these primitives can be complex to understand and apply. As a result, it’s becoming common for enterprises to end up with several clusters. Thi...