Bridging the Gap between Field Experiments and Machine Learning: The EC H2020 B-GOOD Project as a Case Study towards Automated Predictive Health Monitoring of Honey Bee Colonies

Coby van Dooremalen, Zeynep N. Ulgezen, Raffaele Dall'Olio, Ugoline Godeau, Xiaodong Duan, José Paulo Sousa, Marc Schäfer, Alexis Beaurepaire, Pim van Gennip, Marten Schoonman, Claude Flener, Severine Matthijs, David Claeys Boúúaert, Wim Verbeke, Dana Freshley, Dirk-Jan Valkenburg, Trudy van den Bosch, Famke Schaafsma, Jeroen Peters, Mang XuYves Le Conte, Cedric Alaux, Anne Dalmon, Robert J Paxton, Anja Tehel, Tabea Streicher, Daniel S. Dezmirean, Alexandru L. Giurgiu, Christopher John Topping, James Henty Williams, Nuno Capela, Sara Lopes, Fátima Alves, Joana Alves, João Bica, Sandra Simöes, António Alves da Silva, Silvia Castro, João Loureiro, Eva Horčičková, Martin Bencsik, Adam McVeigh, Tarun Kumar, Arrigo Moro, April van Delden, Elżbieta Ziółkowska, Michał Filipiak, Łukasz Mikołajczyk, Kirsten Leufgen, Lina de Smet, Dirk C. de Graaf

Research output: Contribution to journal/Conference contribution in journal/Contribution to newspaperJournal articleResearchpeer-review

Abstract

Honey bee colonies have great societal and economic importance. The main challenge that beekeepers face is keeping bee colonies healthy under ever-changing environmental conditions. In the past two decades, beekeepers that manage colonies of Western honey bees (Apis mellifera) have become increasingly concerned by the presence of parasites and pathogens affecting the bees, the reduction in pollen and nectar availability, and the colonies’ exposure to pesticides, among others. Hence, beekeepers need to know the health condition of their colonies and how to keep them alive and thriving, which creates a need for a new holistic data collection method to harmonize the flow of information from various sources that can be linked at the colony level for different health determinants, such as bee colony, environmental, socioeconomic, and genetic statuses. For this purpose, we have developed and implemented the B-GOOD (Giving Beekeeping Guidance by computational-assisted Decision Making) project as a case study to categorize the colony’s health condition and find a Health Status Index (HSI). Using a 3-tier setup guided by work plans and standardized protocols, we have collected data from inside the colonies (amount of brood, disease load, honey harvest, etc.) and from their environment (floral resource availability). Most of the project’s data was automatically collected by the BEEP Base Sensor System. This continuous stream of data served as the basis to determine and validate an algorithm to calculate the HSI using machine learning. In this article, we share our insights on this holistic methodology and also highlight the importance of using a standardized data language to increase the compatibility between different current and future studies. We argue that the combined management of big data will be an essential building block in the development of targeted guidance for beekeepers and for the future of sustainable beekeeping.
Original languageEnglish
Article number76
JournalInsects
Volume15
Issue1
Pages (from-to)1-22
Number of pages22
ISSN2075-4450
DOIs
Publication statusPublished - Jan 2024

Keywords

  • bee data portal
  • beekeeping
  • big data on honey bee colonies
  • data collection method
  • data standardization and harmonization
  • honey bee automated health monitoring
  • stakeholder involvement in research
  • work plans and protocols

Fingerprint

Dive into the research topics of 'Bridging the Gap between Field Experiments and Machine Learning: The EC H2020 B-GOOD Project as a Case Study towards Automated Predictive Health Monitoring of Honey Bee Colonies'. Together they form a unique fingerprint.

Cite this