SUN Jianqiang
- Research Center for Agricultural Information Technology, National Agriculture and Food Research Organization, Tokyo, Japan
- Botany
Recommendations: 0
Review: 1
Review: 1
Hierarchizing multi-scale environmental effects on agricultural pest population dynamics: a case study on the annual onset of Bactrocera dorsalis population growth in Senegalese orchards
Uncovering the ecology in big-data by hierarchizing multi-scale environmental effects
Recommended by Elodie Vercken based on reviews by Kévin Tougeron and Jianqiang SunAlong with the generalization of open-access practices, large, heterogeneous datasets are becoming increasingly available to ecologists (Farley et al. 2018). While such data offer exciting opportunities for unveiling original patterns and trends, they also raise new challenges regarding how to extract relevant information and actually improve our knowledge of complex ecological systems, beyond purely descriptive correlations (Dietze 2017, Farley et al. 2018).
In this work, Caumette et al. (2024) develop an original ecoinformatics approach to relate multi-scale environmental factors to the temporal dynamics of a major pest in mango orchards. Their method relies on the recent tree-boosting method GPBoost (Sigrist 2022) to hierarchize the influence of environmental factors of heterogeneous nature (e.g., orchard composition and management; landscape structure; climate) on the emergence date of the oriental fruit fly, Bactrocera dorsalis. As boosting methods allows the analysis of high-dimensional data, they are particularly adapted to the exploration of such datasets, to uncover unexpected, potentially complex dependencies between ecological dynamics and multiple environmental factors (Farley et al. 2018). In this article, Caumette et al. (2024) make a special effort to guide the reader step by step through their complex analysis pipeline to make it broadly understandable to the average ecologist, which is no small feat. I particularly welcome this commitment, as making new, cutting-edge analytical methods accessible to a large community of science practitioners with varying degrees of statistical or programming expertise is a major challenge for the future of quantitative ecology.
The main result of Caumette et al. (2024) is that temperature and humidity conditions both at the local and regional scales are the main predictors of B. dorsalis emergence date, while orchard management practices seem to have relatively little influence. This suggests that favourable climatic conditions may allow the persistence of small populations of B. dorsalis over the dry season, which may then act as a propagule source for early re-infestations. However, as the authors explain, the resulting regression model is not designed for predictive purposes and should not at this stage be used for decision-making in pest management. Its main interest rather resides in identifying potential key factors favoring early infestations of B. dorsalis, and help focusing future experimental field studies on the most relevant levers for integrated pest management in mango orchards.
In a wider perspective, this work also provides a convincing proof-of-concept for the use of boosting methods to identify the most influential factors in large, multivariate datasets in a variety of ecological systems. It is also crucial to keep in mind that the current exponential growth in high-throughput environmental data (Lucivero 2020) could quickly come into conflict with the need to reduce the environmental footprint of research (Mariette et al. 2022). In this context, robust and accessible methods for extracting and exploiting all the information available in already existing datasets might prove essential to a sustainable pursuit of science.
References
Caumette C, Diatta P, Piry S, Chapuis M-P, Faye E, Sigrist F, Martin O, Papaïx J, Brévault T, Berthier K. 2024. Hierarchizing multi-scale environmental effects on agricultural pest population dynamics: a case study on the annual onset of Bactrocera dorsalis population growth in Senegalese orchards. bioRxiv 2023.11.10.566583, ver. 3 peer-reviewed and recommended by Peer Community in Ecology. https://doi.org/10.1101/2023.11.10.566583
Dietze MC. 2017. Ecological Forecasting. Princeton University Press
Farley SS, Dawson A, Goring SJ, Williams JW. 2018. Situating Ecology as a Big-Data Science: Current Advances, Challenges, and Solutions. BioScience, 68, 563–576, https://doi.org/10.1093/biosci/biy068
Lucivero F. 2020. Big Data, Big Waste? A Reflection on the Environmental Sustainability of Big Data Initiatives. Science and Engineering Ethics 26, 1009–1030. https://doi.org/10.1007/s11948-019-00171-7
Mariette J, Blanchard O, Berné O, Aumont O, Carrey J, Ligozat A-L, Lellouch E, Roche P-E, Guennebaud G, Thanwerdas J, Bardou P, Salin G, Maigne E, Servan S, Ben-Ari T 2022. An open-source tool to assess the carbon footprint of research. Environmental Research: Infrastructure and Sustainability, 2022. https://dx.doi.org/10.1088/2634-4505/ac84a4
Sigrist F. 2022. Gaussian process boosting. The Journal of Machine Learning Research, 23, 10565-10610. https://jmlr.org/papers/v23/20-322.html