Recommendation

Spatial patterns and autocorrelation challenges in ecological conservation

Eric Goberville based on reviews by Nigel Yoccoz and Charles J Marsh

A recommendation of:

Efficient sampling designs to assess biodiversity spatial autocorrelation : should we go fractal?

Fabien Laroche (2023), bioRxiv, ver.4, peer-reviewed and recommended by PCI Ecology https://doi.org/10.1101/2022.07.29.501974

Read preprint in preprint server Now published in Peer Community Journal

Codes used in this study

Scripts used to obtain or analyze results

Abstract

EN

AR

ES

FR

HI

JA

PT

RU

ZH-CN

Efficient sampling designs to assess biodiversity spatial autocorrelation : should we go fractal?

Quantifying the autocorrelation range of species distribution in space is necessary for applied ecological questions, like implementing protected area networks or monitoring programs. However, the power of spatial sampling designs to estimate this range is negatively related with other objectives such as estimating environmental effects acting upon species distribution. Mixing random sampling points and systematic grid (`hybrid' designs) is a classic solution to make a trade-off. However, fractal designs (i.e. self-similar designs with well-identified scales) could make an even better compromise, because they cover a wide array of possible autocorrelation range values across scales. Using maximum likelihood estimation in an optimal design of experiments approach, we compared errors of hybrid and fractal designs when simultaneously estimating an effect acting upon a response variable and the residual autocorrelation range. We found that Pareto-optimal sampling strategies depended on the feasible grid mesh size (FGMS) over the study area, given the sampling budget. When the FMGS was shorter than expected autocorrelation range values, grid design was the best option on all criteria. When the FMGS was around or larger than expected autocorrelation range values, the choice of designs depended on the effect under study. Fractal designs outperformed hybrid designs when studying the effect of a monotonic environmental gradient across space, while grid design was more efficient for other types of question. Beyond the niche identified in our analysis, fractal designs may also appear interesting when studying response variables with more heterogeneous spatial structure across scales, and when considering more practical criteria of performance such as the distance needed to cover the design.

beta-diversity; distance-decay; fractal; maximum likelihood; model-based inference; optimal design; sampling design; spatial autocorrelation

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

تصاميم فعالة لأخذ العينات لتقييم الارتباط الذاتي المكاني للتنوع البيولوجي: هل يجب أن نذهب إلى النمط الكسري؟

يعد قياس نطاق الارتباط الذاتي لتوزيع الأنواع في الفضاء أمرًا ضروريًا للمسائل البيئية التطبيقية، مثل تنفيذ شبكات المناطق المحمية أو برامج المراقبة. ومع ذلك، فإن قوة تصاميم أخذ العينات المكانية لتقدير هذا النطاق ترتبط سلبًا بأهداف أخرى مثل تقدير التأثيرات البيئية التي تؤثر على توزيع الأنواع. يعد خلط نقاط أخذ العينات العشوائية والشبكة المنهجية (التصميمات "المختلطة") حلاً كلاسيكيًا لإجراء مقايضة. ومع ذلك، فإن التصميمات الكسورية (أي التصاميم المتشابهة ذاتيًا ذات المقاييس المحددة جيدًا) يمكن أن تقدم حلاً وسطًا أفضل، لأنها تغطي مجموعة واسعة من قيم نطاق الارتباط الذاتي المحتملة عبر المقاييس. باستخدام تقدير الاحتمال الأقصى في التصميم الأمثل لنهج التجارب، قمنا بمقارنة أخطاء التصميمات الهجينة والكسورية عند تقدير التأثير الذي يعمل على متغير الاستجابة ونطاق الارتباط الذاتي المتبقي في نفس الوقت. لقد وجدنا أن استراتيجيات أخذ العينات باريتو الأمثل تعتمد على حجم شبكة الشبكة الممكنة (FGMS) على منطقة الدراسة، نظرا لميزانية أخذ العينات. عندما كان FMGS أقصر من قيم نطاق الارتباط التلقائي المتوقع، كان تصميم الشبكة هو الخيار الأفضل في جميع المعايير. عندما كانت FMGS حول أو أكبر من قيم نطاق الارتباط الذاتي المتوقع، فإن اختيار التصاميم يعتمد على التأثير قيد الدراسة. تفوقت التصاميم الكسورية على التصاميم الهجينة عند دراسة تأثير التدرج البيئي الرتيب عبر الفضاء، في حين كان تصميم الشبكة أكثر كفاءة لأنواع أخرى من الأسئلة. وبعيدًا عن المجال المحدد في تحليلنا، قد تبدو التصميمات الكسورية أيضًا مثيرة للاهتمام عند دراسة متغيرات الاستجابة ذات البنية المكانية غير المتجانسة عبر المقاييس، وعند النظر في معايير أكثر عملية للأداء مثل المسافة اللازمة لتغطية التصميم.

التنوع بيتا؛ اضمحلال المسافة؛ كسورية. أقصى احتمال؛ الاستدلال القائم على النموذج؛ التصميم الأمثل تصميم العينات؛ الارتباط الذاتي المكاني

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Diseños de muestreo eficientes para evaluar la autocorrelación espacial de la biodiversidad: ¿deberíamos volvernos fractales?

Cuantificar el rango de autocorrelación de la distribución de especies en el espacio es necesario para cuestiones ecológicas aplicadas, como la implementación de redes de áreas protegidas o programas de monitoreo. Sin embargo, el poder de los diseños de muestreo espacial para estimar este rango está negativamente relacionado con otros objetivos como la estimación de los efectos ambientales que actúan sobre la distribución de las especies. Combinar puntos de muestreo aleatorios y cuadrículas sistemáticas (diseños "híbridos") es una solución clásica para llegar a un acuerdo. Sin embargo, los diseños fractales (es decir, diseños autosemejantes con escalas bien identificadas) podrían lograr un compromiso aún mejor, porque cubren una amplia gama de posibles valores de rango de autocorrelación entre escalas. Utilizando la estimación de máxima verosimilitud en un enfoque de diseño óptimo de experimentos, comparamos los errores de diseños híbridos y fractales al estimar simultáneamente un efecto que actúa sobre una variable de respuesta y el rango de autocorrelación residual. Descubrimos que las estrategias de muestreo óptimas de Pareto dependían del tamaño de malla de la cuadrícula factible (FGMS) en el área de estudio, dado el presupuesto de muestreo. Cuando el FMGS era más corto que los valores del rango de autocorrelación esperados, el diseño de cuadrícula fue la mejor opción en todos los criterios. Cuando el FMGS estaba alrededor o era mayor que los valores del rango de autocorrelación esperados, la elección de los diseños dependía del efecto bajo estudio. Los diseños fractales superaron a los diseños híbridos al estudiar el efecto de un gradiente ambiental monótono en el espacio, mientras que el diseño de cuadrícula fue más eficiente para otros tipos de preguntas. Más allá del nicho identificado en nuestro análisis, los diseños fractales también pueden parecer interesantes cuando se estudian variables de respuesta con una estructura espacial más heterogénea en todas las escalas y cuando se consideran criterios de desempeño más prácticos, como la distancia necesaria para cubrir el diseño.

diversidad beta; decadencia de distancia; fractal; máxima verosimilitud; inferencia basada en modelos; diseño óptimo; diseño de muestreo; autocorrelación espacial

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Des plans d’échantillonnage efficaces pour évaluer l’autocorrélation spatiale de la biodiversité : faut-il passer au fractal ?

Quantifier la plage d'autocorrélation de la répartition des espèces dans l'espace est nécessaire pour les questions écologiques appliquées, comme la mise en œuvre de réseaux de zones protégées ou de programmes de surveillance. Cependant, la puissance des plans d'échantillonnage spatial pour estimer cette plage est négativement liée à d'autres objectifs tels que l'estimation des effets environnementaux agissant sur la répartition des espèces. Mélanger des points d'échantillonnage aléatoires et une grille systématique (conceptions « hybrides ») est une solution classique pour faire un compromis. Cependant, les conceptions fractales (c'est-à-dire les conceptions auto-similaires avec des échelles bien identifiées) pourraient constituer un compromis encore meilleur, car elles couvrent un large éventail de valeurs de plage d'autocorrélation possibles à travers les échelles. En utilisant l'estimation du maximum de vraisemblance dans une approche de conception optimale d'expériences, nous avons comparé les erreurs de conceptions hybrides et fractales lors de l'estimation simultanée d'un effet agissant sur une variable de réponse et sur la plage d'autocorrélation résiduelle. Nous avons constaté que les stratégies d'échantillonnage Pareto-optimales dépendaient de la taille réalisable du maillage de la grille (FGMS) sur la zone d'étude, compte tenu du budget d'échantillonnage. Lorsque le FMGS était plus court que les valeurs attendues de la plage d’autocorrélation, la conception de la grille était la meilleure option pour tous les critères. Lorsque le FMGS était proche ou supérieur aux valeurs attendues de la plage d'autocorrélation, le choix des modèles dépendait de l'effet étudié. Les conceptions fractales ont surpassé les conceptions hybrides lors de l’étude de l’effet d’un gradient environnemental monotone dans l’espace, tandis que la conception en grille s’est révélée plus efficace pour d’autres types de questions. Au-delà de la niche identifiée dans notre analyse, les conceptions fractales peuvent également sembler intéressantes lors de l'étude de variables de réponse avec une structure spatiale plus hétérogène à travers les échelles, et lorsque l'on considère des critères de performance plus pratiques tels que la distance nécessaire pour parcourir la conception.

bêta-diversité ; décroissance de la distance ; fractale; plausibilité maximum; inférence basée sur un modèle ; conception optimale ; plan d'échantillonnage ; autocorrélation spatiale

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

जैव विविधता स्थानिक स्वसहसंबंध का आकलन करने के लिए कुशल नमूना डिजाइन: क्या हमें फ्रैक्टल जाना चाहिए?

संरक्षित क्षेत्र नेटवर्क या निगरानी कार्यक्रमों को लागू करने जैसे व्यावहारिक पारिस्थितिक प्रश्नों के लिए अंतरिक्ष में प्रजातियों के वितरण की स्वत: सहसंबंध सीमा की मात्रा निर्धारित करना आवश्यक है। हालाँकि, इस सीमा का अनुमान लगाने के लिए स्थानिक नमूनाकरण डिज़ाइन की शक्ति अन्य उद्देश्यों से नकारात्मक रूप से संबंधित है जैसे कि प्रजातियों के वितरण पर कार्य करने वाले पर्यावरणीय प्रभावों का अनुमान लगाना। यादृच्छिक नमूनाकरण बिंदुओं और व्यवस्थित ग्रिड (`हाइब्रिड' डिज़ाइन) को मिलाना एक व्यापार-बंद बनाने का एक क्लासिक समाधान है। हालाँकि, फ्रैक्टल डिज़ाइन (यानी अच्छी तरह से पहचाने गए पैमानों के साथ स्व-समान डिज़ाइन) एक और भी बेहतर समझौता कर सकते हैं, क्योंकि वे सभी पैमानों पर संभावित ऑटोसहसंबंध रेंज मानों की एक विस्तृत श्रृंखला को कवर करते हैं। प्रयोग दृष्टिकोण के इष्टतम डिजाइन में अधिकतम संभावना अनुमान का उपयोग करते हुए, हमने एक प्रतिक्रिया चर और अवशिष्ट ऑटोसहसंबंध सीमा पर अभिनय करने वाले प्रभाव का अनुमान लगाते समय हाइब्रिड और फ्रैक्टल डिजाइन की त्रुटियों की तुलना की। हमने पाया कि पेरेटो-इष्टतम नमूनाकरण रणनीतियाँ नमूना बजट को देखते हुए, अध्ययन क्षेत्र में व्यवहार्य ग्रिड जाल आकार (एफजीएमएस) पर निर्भर करती हैं। जब एफएमजीएस अपेक्षित ऑटोसहसंबंध सीमा मूल्यों से कम था, तो ग्रिड डिजाइन सभी मानदंडों पर सबसे अच्छा विकल्प था। जब एफएमजीएस अपेक्षित ऑटोसहसंबंध सीमा मूल्यों के आसपास या उससे बड़ा था, तो डिजाइन की पसंद अध्ययन के तहत प्रभाव पर निर्भर करती थी। अंतरिक्ष में एक मोनोटोनिक पर्यावरणीय ढाल के प्रभाव का अध्ययन करते समय फ्रैक्टल डिज़ाइन ने हाइब्रिड डिज़ाइन से बेहतर प्रदर्शन किया, जबकि ग्रिड डिज़ाइन अन्य प्रकार के प्रश्नों के लिए अधिक कुशल था। हमारे विश्लेषण में पहचाने गए आला से परे, स्केल में अधिक विषम स्थानिक संरचना के साथ प्रतिक्रिया चर का अध्ययन करते समय, और डिज़ाइन को कवर करने के लिए आवश्यक दूरी जैसे प्रदर्शन के अधिक व्यावहारिक मानदंडों पर विचार करते समय फ्रैक्टल डिज़ाइन भी दिलचस्प लग सकते हैं।

बीटा-विविधता; दूरी-क्षय; भग्न; अधिकतम संभाव्यता; मॉडल-आधारित अनुमान; इष्टतम डिजाइन; नमूना डिजाइन; स्थानिक स्वसहसंबंध

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

生物多様性の空間的自己相関を評価するための効率的なサンプリング設計: フラクタルにする必要がありますか?

宇宙における種の分布の自己相関範囲を定量化することは、保護地域ネットワークの導入や監視プログラムなどの応用生態学的問題に必要です。しかし、この範囲を推定するための空間サンプリング設計の力は、種の分布に作用する環境影響の推定などの他の目的とは負の関係にあります。ランダムなサンプリングポイントと体系的なグリッド (「ハイブリッド」設計) を混合することは、トレードオフを実現する古典的な解決策です。ただし、フラクタル設計 (つまり、スケールが明確に識別された自己相似設計) は、スケール全体で考えられる自己相関範囲の値を広範囲にカバーしているため、さらに良い妥協策を講じることができます。最適実験計画法の最尤推定を使用して、応答変数に作用する効果と残差自己相関範囲を同時に推定する際のハイブリッド計画とフラクタル計画の誤差を比較しました。パレート最適サンプリング戦略は、サンプリング予算が与えられた場合、調査領域全体の実行可能なグリッドメッシュサイズ (FGMS) に依存することがわかりました。 FMGS が予想される自己相関範囲の値よりも短かった場合、グリッド設計がすべての基準において最良の選択肢でした。 FMGS が予想される自己相関範囲の値付近またはそれよりも大きい場合、設計の選択は研究対象の効果に依存します。空間にわたる単調な環境勾配の影響を研究する場合、フラクタルデザインはハイブリッドデザインよりも優れたパフォーマンスを発揮しましたが、他の種類の質問ではグリッドデザインの方が効率的でした。フラクタル設計は、分析で特定されたニッチを超えて、スケール全体でより不均一な空間構造を持つ応答変数を研究する場合や、設計をカバーするために必要な距離などのより実用的なパフォーマンス基準を考慮する場合にも興味深いと思われる可能性があります。

ベータ多様性。距離減衰;フラクタル;最大の可能性。モデルベースの推論。最適な設計。サンプリング設計。空間的自己相関

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Desenhos amostrais eficientes para avaliar a autocorrelação espacial da biodiversidade: devemos ir para o fractal?

Quantificar a faixa de autocorrelação da distribuição de espécies no espaço é necessário para questões ecológicas aplicadas, como a implementação de redes de áreas protegidas ou programas de monitoramento. No entanto, o poder dos desenhos de amostragem espacial para estimar esta amplitude está negativamente relacionado com outros objectivos, tais como estimar os efeitos ambientais que actuam sobre a distribuição das espécies. Misturar pontos de amostragem aleatórios e grades sistemáticas (desenhos “híbridos”) é uma solução clássica para fazer uma compensação. No entanto, designs fractais (ou seja, designs auto-semelhantes com escalas bem identificadas) poderiam constituir um compromisso ainda melhor, porque cobrem uma ampla gama de possíveis valores de faixa de autocorrelação entre escalas. Usando estimativa de máxima verossimilhança em uma abordagem de planejamento ideal de experimentos, comparamos erros de projetos híbridos e fractais ao estimar simultaneamente um efeito agindo sobre uma variável de resposta e a faixa de autocorrelação residual. Descobrimos que as estratégias de amostragem ótimas de Pareto dependiam do tamanho viável da malha da grade (FGMS) na área de estudo, dado o orçamento de amostragem. Quando o FMGS foi menor que os valores esperados do intervalo de autocorrelação, o desenho da grade foi a melhor opção em todos os critérios. Quando o FMGS estava próximo ou maior que os valores esperados da faixa de autocorrelação, a escolha dos desenhos dependeu do efeito em estudo. Os projetos fractais superaram os projetos híbridos ao estudar o efeito de um gradiente ambiental monotônico no espaço, enquanto o projeto de grade foi mais eficiente para outros tipos de questões. Além do nicho identificado em nossa análise, os designs fractais também podem parecer interessantes ao estudar variáveis de resposta com estrutura espacial mais heterogênea entre escalas e ao considerar critérios de desempenho mais práticos, como a distância necessária para cobrir o design.

diversidade beta; decadência da distância; fractal; probabilidade máxima; inferência baseada em modelos; design ideal; desenho amostral; autocorrelação espacial

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Эффективные схемы выборки для оценки пространственной автокорреляции биоразнообразия: стоит ли нам идти на фрактал?

Количественная оценка диапазона автокорреляции распределения видов в космосе необходима для решения прикладных экологических вопросов, таких как создание сетей охраняемых территорий или программ мониторинга. Однако способность пространственной выборки оценить этот диапазон отрицательно связана с другими целями, такими как оценка воздействия окружающей среды на распределение видов. Смешение точек случайной выборки и систематической сетки («гибридные» схемы) является классическим решением, позволяющим найти компромисс. Однако фрактальные схемы (т. е. самоподобные схемы с четко определенными масштабами) могут стать еще лучшим компромиссом, поскольку они охватывают широкий спектр возможных значений диапазона автокорреляции в разных масштабах. Используя оценку максимального правдоподобия в подходе оптимального планирования экспериментов, мы сравнили ошибки гибридного и фрактального планов при одновременной оценке эффекта, действующего на переменную отклика и диапазон остаточной автокорреляции. Мы обнаружили, что оптимальные по Парето стратегии выборки зависят от возможного размера сетки сетки (FGMS) на исследуемой территории с учетом бюджета выборки. Когда FMGS был короче ожидаемых значений диапазона автокорреляции, лучшим вариантом по всем критериям была конструкция сетки. Когда FMGS был около или превышал ожидаемые значения диапазона автокорреляции, выбор планов зависел от изучаемого эффекта. Фрактальные конструкции превзошли гибридные схемы при изучении эффекта монотонного градиента окружающей среды в пространстве, тогда как сеточная конструкция оказалась более эффективной для других типов вопросов. Помимо ниши, определенной в нашем анализе, фрактальные конструкции также могут оказаться интересными при изучении переменных отклика с более неоднородной пространственной структурой в разных масштабах, а также при рассмотрении более практических критериев производительности, таких как расстояние, необходимое для покрытия конструкции.

бета-разнообразие; расстояние-распад; фрактал; максимальная вероятность; вывод на основе модели; оптимальная конструкция; дизайн выборки; пространственная автокорреляция

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

评估生物多样性空间自相关的有效抽样设计：我们应该采用分形吗？

量化物种在空间分布的自相关范围对于应用生态问题（例如实施保护区网络或监测计划）是必要的。然而，空间抽样设计估计该范围的能力与其他目标（例如估计对物种分布的环境影响）负相关。混合随机采样点和系统网格（“混合”设计）是进行权衡的经典解决方案。然而，分形设计（即具有明确尺度的自相似设计）可以做出更好的妥协，因为它们涵盖了跨尺度的各种可能的自相关范围值。在最佳实验设计方法中使用最大似然估计，我们在同时估计作用于响应变量和残差自相关范围的效应时比较了混合设计和分形设计的误差。我们发现，在给定抽样预算的情况下，帕累托最优抽样策略取决于研究区域的可行网格尺寸（FGMS）。当 FMGS 短于预期自相关范围值时，网格设计是所有标准的最佳选择。当 FMGS 接近或大于预期自相关范围值时，设计的选择取决于所研究的效果。在研究跨空间的单调环境梯度的影响时，分形设计优于混合设计，而网格设计对于其他类型的问题更有效。除了我们分析中确定的利基之外，在研究跨尺度具有更异构空间结构的响应变量时，以及在考虑更实际的性能标准（例如覆盖设计所需的距离）时，分形设计也可能显得有趣。

β-多样性；距离衰减；分形；最大似然;基于模型的推理；优化设计；抽样设计；空间自相关

Submission: posted 21 April 2023, validated 24 April 2023
Recommendation: posted 02 January 2024, validated 03 January 2024

Cite this recommendation as:
Goberville, E. (2024) Spatial patterns and autocorrelation challenges in ecological conservation. Peer Community in Ecology, 100536. https://doi.org/10.24072/pci.ecology.100536

Recommendation

“Pattern, like beauty, is to some extent in the eye of the beholder” (Grant 1977 in Wiens, 1989)

Ecologists are immersed in unraveling the complex spatial patterns that govern species diversity, driven by both practical and theoretical imperatives (Rahbek, 2005; Wang et al., 2019). This dual focus necessitates a practical imperative for strategic biodiversity conservation, requiring a nuanced understanding of locations with peak species richness and dynamic shifts in species assemblages (Chase et al., 2020). Simultaneously, there is a theoretical interest in using diversity patterns as empirical testing grounds for theories explaining factors influencing diversity disparities and the associated increase in species turnover correlated with inter-site distance (Condit et al., 2002).

McGill (2010), in his paper "Matters of Scale", highlights the scale-dependent nature of ecology, aligning with the recognition that spatial autocorrelation is inherent in biogeographical data and often correlated with sample size (Rahbek, 2005). Spatial autocorrelation, often underestimated in ecological studies (Dormann, 2007), occurs when proximate locations exhibit similarities in ecological attributes (Tobler, 1970; Getis, 2010), introducing a latent bias that compromises the robustness of ecological findings (Dormann, 2007; Dormann et al., 2007). This phenomenon serves as both an asset, providing valuable information for inferring processes from patterns (Palma et al. 1999), and a challenge, imposing limitations on hypothesis testing and prediction (Dormann et al., 2007 and references therein). Various factors contribute to spatial autocorrelation, with three primary contributors (Dormann et al., 2007; Legendre, 1993; Legendre and Fortin, 1989; Legendre and Legendre, 2012): (i) distance-related effects in biological processes, (ii) misrepresentation of non-linear relationships between the environment and species as linear and (iii) the oversight of a crucial spatially structured environmental determinant in the statistical model, leading to spatial structuring in the response (Dormann et al., 2007).

Recognising the pivotal role of spatial heterogeneity in ecological theories (Wang et al., 2019), it becomes imperative to discern and address the limitations introduced by spatial autocorrelation (Legendre, 1993). McGill (2011) emphasises that the ultimate goal of biodiversity pattern studies should be to develop a quantitative predictive theory useful for conservation. The spatial dimension's importance in study planning, determining the system's scale, appropriate quadrat size, and spacing between sampling stations, is paramount (Fortin, 1999a,b). Responses to these considerations are intricately linked with study objectives and insights from pre-sampling campaigns, underscoring the need for a nuanced and rigorous approach (Delmelle, 2021).

Understanding statistical techniques and nested sampling designs is crucial to answering fundamental ecological questions (Dormann et al., 2007; McDonald, 2012). In addressing spatial autocorrelation challenges, ecologists must recognize the limitations of many standard statistical methods in ecological studies (Dale and Fortin, 2002; Legendre and Fortin, 1989; Steel et al., 2013). In the initial phases of description or hypothesis generation, ecologists should proactively acknowledge the spatial structure in their data and conduct tests for spatial autocorrelation (for a comprehensive description, see Legendre and Fortin, 1989): various tools, including correlograms, spectral analysis, the Mantel test, and clustering methods, facilitate the assessment and description of spatial structures. The partial Mantel test enables the study of causal models with space as an explanatory variable. Techniques for mapping ecological variables, such as interpolation, trend surface analysis, and constrained clustering, yield maps providing valuable insights into the spatial dynamics of ecological systems.

This refined consideration of spatial autocorrelation emerges as an imperative in ecological research, fostering a deeper and more precise understanding of the intricate interplay between species diversity, spatial patterns, and the inherent limitations imposed by spatial autocorrelation (Legendre et al., 2002). This not only contributes significantly to the scientific discourse in ecology but also aligns with McGill's vision of developing predictive theories for effective conservation (Bacaro et al., 2016; McGill, 2011).

In this study by Fabien Laroche (2023), titled “Efficient sampling designs to assess biodiversity spatial autocorrelation: should we go fractal?” the primary focus was on addressing the challenges associated with estimating the autocorrelation range of species distribution across spatial scales. The study aimed to explore alternative sampling designs, with a particular focus on the application of fractal designs—self-similar designs with well-identified scales. The overarching goal was to evaluate whether fractal designs could offer a more efficient compromise compared to traditional hybrid designs, which involve mixing random sampling points with a systematic grid.

Virtual ecology provides a way to test whether sampling designs can accurately detect or quantify effects of interest before implementing them in the field. Beyond the question of assessing the power of empirical designs, a virtual ecology analysis contributes to clearly formulating the set of questions associated with a design. However, only a few virtual studies have focused on efficient designs to accurately estimate the autocorrelation range of biodiversity variables. In this study, the statistical framework of optimal design of experiments was employed—a methodology often used in building and comparing designs of temporal or spatiotemporal biodiversity surveys but rarely applied to the specific problem of quantifying spatial autocorrelation.

Key findings from the study shed light on optimal sampling strategies, with a notable dependence on the feasible grid mesh size over the study area in relation to expected autocorrelation range values. The results demonstrated that the efficiency of designs varied based on the specific effect under study. Fractal designs, however, exhibited superior performance, particularly when assessing the effect of a monotonic environmental gradient across space.

In conclusion, the study provides valuable insights into the potential benefits of incorporating fractal designs in biodiversity studies, offering a nuanced and efficient approach to estimate spatial autocorrelation. These findings contribute significantly to the ongoing scientific discourse in ecology, providing practical considerations for improving sampling designs in biodiversity assessments.

References

Bacaro, G., Altobelli, A., Cameletti, M., Ciccarelli, D., Martellos, S., Palmer, M.W., Ricotta, C., Rocchini, D., Scheiner, S.M., Tordoni, E., Chiarucci, A., 2016. Incorporating spatial autocorrelation in rarefaction methods: Implications for ecologists and conservation biologists. Ecological Indicators 69, 233-238. https://doi.org/10.1016/j.ecolind.2016.04.026

Chase, J.M., Jeliazkov, A., Ladouceur, E., Viana, D.S., 2020. Biodiversity conservation through the lens of metacommunity ecology. Annals of the New York Academy of Sciences 1469, 86-104. https://doi.org/10.1111/nyas.14378

Condit, R., Pitman, N., Leigh, E.G., Chave, J., Terborgh, J., Foster, R.B., Núñez, P., Aguilar, S., Valencia, R., Villa, G., Muller-Landau, H.C., Losos, E., Hubbell, S.P., 2002. Beta-Diversity in Tropical Forest Trees. Science 295, 666-669. https://doi.org/10.1126/science.1066854

Dale, M.R.T., Fortin, M.-J., 2002. Spatial autocorrelation and statistical tests in ecology. Écoscience 9, 162-167. https://doi.org/10.1080/11956860.2002.11682702

Delmelle, E.M., 2021. Spatial Sampling, in: Fischer, M.M., Nijkamp, P. (Eds.), Handbook of Regional Science. Springer Berlin Heidelberg, Berlin, Heidelberg, pp. 1829-1844.

Dormann, C.F., 2007. Effects of incorporating spatial autocorrelation into the analysis of species distribution data. Global Ecology & Biogeography 16, 129-128. https://doi.org/10.1111/j.1466-8238.2006.00279.x

Dormann, C.F., McPherson, J.M., Araújo, M.B., Bivand, R., Bolliger, J., Carl, G., Davies, R.G., Hirzel, A., Jetz, W., Kissling, W.D., Kühn, I., Ohlemüler, R., Peres-Neto, P.R., Reineking, B., Schröder, B., Schurr, F.M., Wilson, R., 2007. Methods to account for spatial autocorrelation in the analysis of species distributional data: a review. Ecography 33, 609-628. https://doi.org/10.1111/j.2007.0906-7590.05171.x

Fortin, M.-J., 1999a. Effects of quadrat size and data measurement on the detection of boundaries. Journal of Vegetation Science 10, 43-50. https://doi.org/10.2307/3237159

Fortin, M.-J., 1999b. Effects of sampling unit resolution on the estimation of spatial autocorrelation. Écoscience 6, 636-641. https://doi.org/10.1080/11956860.1999.11682547

Getis, A., 2010. Spatial Autocorrelation, in: Fischer, M.M., Getis, A. (Eds.), Handbook of Applied Spatial Analysis: Software Tools, Methods and Applications. Springer Berlin Heidelberg, Berlin, Heidelberg, pp. 255-278.

Laroche, F., 2023. Efficient sampling designs to assess biodiversity spatial autocorrelation: should we go fractal? bioRxiv, 2022.07.29.501974, ver. 4 peer-reviewed and recommended by Peer Community in Ecology. https://doi.org/10.1101/2022.07.29.501974

Legendre, P., 1993. Spatial Autocorrelation: Trouble or New Paradigm? Ecology 74, 1659-1673. https://doi.org/10.2307/1939924

Legendre, P., Dale, M.R.T., Fortin, M.-J., Gurevitch, J., Hohn, M., Myers, D., 2002. The consequences of spatial structure for the design and analysis of ecological field surveys. Ecography 25, 601-615. https://doi.org/10.1034/j.1600-0587.2002.250508.x

Legendre, P., Fortin, M.J., 1989. Spatial pattern and ecological analysis. Vegetatio 80, 107-138. https://doi.org/10.1007/BF00048036

Legendre, P., Legendre, L., 2012. Numerical Ecology, Third Edition ed. Elsevier, The Netherlands.

McDonald, T., 2012. Spatial sampling designs for long-term ecological monitoring, in: Cooper, A.B., Gitzen, R.A., Licht, D.S., Millspaugh, J.J. (Eds.), Design and Analysis of Long-term Ecological Monitoring Studies. Cambridge University Press, Cambridge, pp. 101-125.

McGill, B.J., 2010. Matters of Scale. Science 328, 575-576. https://doi.org/10.1126/science.1188528

McGill, B.J., 2011. Linking biodiversity patterns by autocorrelated random sampling. American Journal of Botany 98, 481-502. https://doi.org/10.3732/ajb.1000509

Rahbek, C., 2005. The role of spatial scale and the perception of large-scale species-richness patterns. Ecology Letters 8, 224-239. https://doi.org/10.1111/j.1461-0248.2004.00701.x

Steel, E.A., Kennedy, M.C., Cunningham, P.G., Stanovick, J.S., 2013. Applied statistics in ecology: common pitfalls and simple solutions. Ecosphere 4, art115. https://doi.org/10.1890/ES13-00160.1

Tobler, W.R., 1970. A Computer Movie Simulating Urban Growth in the Detroit Region. Economic Geography 46, 234-240. https://doi.org/10.2307/143141

Wang, S., Lamy, T., Hallett, L.M., Loreau, M., 2019. Stability and synchrony across ecological hierarchies in heterogeneous metacommunities: linking theory to data. Ecography 42, 1200-1211. https://doi.org/10.1111/ecog.04290

Wiens, J.A., 1989. The ecology of bird communities. Cambridge University Press.

PDF recommendation

Conflict of interest:
The recommender in charge of the evaluation of the article and the reviewers declared that they have no conflict of interest (as defined in the code of conduct of PCI) with the authors or with the content of the article. The authors declared that they comply with the PCI rule of having no financial conflicts of interest in relation to the content of the article.

Funding:
Agence Nationale de la Recherche (ANR), grant n° 19-CE32-0002-01

Reviews

Evaluation round #2

DOI or URL of the preprint: https://doi.org/10.1101/2022.07.29.501974

Version of the preprint: 3

Author's Reply, 01 Dec 2023

Download author's reply https://doi.org/10.24072/pci.ecology.100536.ar2

Decision by Eric Goberville, posted 05 Nov 2023, validated 06 Nov 2023

Dear Dr. Laroche,

Thank you for your submission entitled "Efficient sampling designs to assess biodiversity spatial autocorrelation: should we go fractal?" to PCIEcology. We have received feedback from the two referees, and I would like to express my appreciation for their comprehensive and insightful reviews.

Both reviewers commend your meticulous work in addressing their previous comments and recommendations. They highlight significant improvements, including the transformation of complex code into an informative Rmarkdown document and the innovative approach to defining 'rugged' environmental variables. However, there are some minor suggestions related to code structure, conditional chunk evaluation, and the need for clearer figure annotations.

Furthermore, they note limitations in the example visualizations, which could be expanded to cover a broader range of 'as' values. The importance of specifying the grid mesh size is also emphasized. The 'Spanning path length' analysis, while valuable, is seen as somewhat of an afterthought and should be included in the methods section for better contextual understanding.

One reviewer underscores the absence of essential information in the methods section, particularly regarding repetitions and averaging, urging greater clarity in this regard. They also recommend distinguishing between regular grid and random data points in hybrid models and using subscript in plot titles for figures 4 and 6. Figure 10 is seen as somewhat unclear. The suggestion is made to define the ratio of sampling area to the number of sampling points for better interpretation. Additionally, a few minor points are addressed.

In conclusion, the requested revisions represent a relatively modest effort compared to the substantial improvements you have already made to your study. I believe that these adjustments will contribute to the final acceptance of your article in PCI in Ecology.

Sincerely,
Eric Goberville

https://doi.org/10.24072/pci.ecology.100536.d2

Reviewed by Nigel Yoccoz, 01 Nov 2023

The author has carefully revised the paper, adding much relevant information and additional simulations. I look forward to applications of some of the ideas developed in the paper, and the scripts provide the necessary tools.

https://doi.org/10.24072/pci.ecology.100536.rev21

Reviewed by Charles J Marsh, 04 Nov 2023

Review for Laroche - Efficient sampling designs to assess biodiversity spatial autocorrelation: should we go fractal?

This is a follow up review. Again, someone better than me will need to evaluate the mathematical approaches, especially with regards the SI, and I have focussed my review on the other aspects. The author has put in considerable work to address comments reviewers had outlined last time, and I think those changes have been implemented really well. Changes include more realistic environmental covariates, better visualisations of the covariates and the results, an examination of trade-offs between sampling for multiple variables, a discussion of sampling effort required for different schemes, and a tidying up of the R code.

First, the previously uninterpretable code has been converted into a lovely Rmarkdown doc that outlines various steps and provides the code. Really a lot of work has gone into this, and I really commend the author on the effort, it is well worth it, and will be very useful for readers who might want to apply similar methods.

Second, the methods for defining the ‘rugged’ environmental variables is really nice, using sine waves and shifting the longitudinal position of the centre - an approach I think will be very useful applying to other simulation studies.

My remaining comments are pretty low-level suggestions or tweaks:

The Rmarkdown doc and code –

I got it working after a bit of trouble-shooting. I can confirm the code provides the same results as the manuscript. I also tried running it with different seeds for the randomness in the hybrid designs and the results were fairly robust, with only minor differences (one risk of true randomness of course is that you can end up with a terrible distribution of points). It’s still a bit opaque though in terms of annotations and formatting, which could be something to think about in the future to make the work more useful for others.

Otherwise, three small comments:

1) set the folder structure up in the code itself rather than in the markdown arguments to make it easier for users to repeat. Also, better to use file.path rather than paste for users with non-linux systems.

2) A lot of the code chunks are set to not evaluate, so knitting will fail. I understand that some are time-consuming and you prefer to read in the data files once generated, but maybe think about some simple if statements to run the loops only if the rdata files haven’t been created yet (and you could always provide the data files for download).

3) I don’t know what the figure ‘Global performance across autocorrelation values’ is showing – please annotate.

The example as visualised in the examples –
The as values span from 0.01 – 100, but the examples of as in figs 4 and 6 are really limited showing only 0.09 – 0.33 (ranks 7-12 out of 28 total). This also doesn’t include the grid mesh size which is the ‘switching point’ of behaviour (which I believe should be 0.38). When I have created the figures with a wider range of values it produces some interesting patterns not apparent from the current figures, and also makes much more sense of figure 8.

The ‘Spanning path length’ analysis –

This addition seems to be a bit of an afterthought – it is not described at all in the methods section and comes out of the blue in the results, with no indication as to the purpose of the analysis. Introduce the rationale for it and the methodology in the methods section, as it is important for the interpretation of the final figure.

Missing info in the methods –

As well as the distance analysis missing from the methods, there seems to be other key info not outlined in the methods and only apparent when going though the code. For example it is not clear from the methods whether there were any repeats, how many etc, and how they were averaged (working through the code I can see it is 30 reps but this info should be clearly stated in the methods). Please check that everything needed to replicate the study is outlined in the methods without referral to the markdown doc.

Fig. 8 legend –

All a bit messy. 1) What is the grid mesh size? What does centred mean if we don’t know what the extremes are? You may as well give the actual values (0.038 – 3.8, centre = 0.38 I think?).

2) Remove all the abbreviations (I’m not sure what ‘resp.’ means).

Fig. 4 and 6, maybe also other figures –

As well as indicating the most disordered value with a triangle, for the hybrid models you might consider having two symbols – one for the regular grid and one for random. Most people will really be only interested in the regular grid vs fractals, and so it is really worthwhile in every figure making it obvious where that regular grid lies. More minor, tweak the plot titles so that a[s] is in subscript

Figure 10 –

I’m not sure about the representation of the sampling area-budget arrows on the left and how easy it is to interpret. I see it is as more of a sampling area:no. sampling points ratio (more useful than sampling ‘budget’ I think, which depends on lots of other things). This then determines the FGM, which ultimately interacts with as. Perhaps that ratio can be defined by L as you have in the results, or maybe it is not generalisable in that fashion? Also, you could think about breaking it down for whether you are interested in estimating the autocorrelation mean or the range (or both).

Other trivial things:

Lines 333-337 – you could always use different coloured boxes to help indicate these

Sampling design figs 1, 2, 3 – v minor niggle but set asp=1 to keep aspect ratio of coordinates even

Line 50 – estimated ranges of what exactly, the autocorrelation range or distributional range?

Line 189 – ‘… accurately estimate …’

Line 220 – accurately what?

Line 445 - ‘… autocorrelation range was smaller …’

One final important point. I’m not Rob Ewers (I don’t have the beard for one), so make sure to remove his name from the acknowledgements.

https://doi.org/10.24072/pci.ecology.100536.rev22

Evaluation round #1

DOI or URL of the preprint: https://doi.org/10.1101/2022.07.29.501974

Version of the preprint: 2

Author's Reply, 09 Oct 2023

Download author's reply https://doi.org/10.24072/pci.ecology.100536.ar1

Decision by Eric Goberville, posted 15 Jul 2023, validated 17 Jul 2023

Dear Dr. Laroche,

Thank you for submitting your article titled "Efficient sampling designs to assess biodiversity spatial autocorrelation: should we go fractal?" to PCIEcology. We have now received feedback from two referees, and I would like to express my gratitude to both referees for their thorough and insightful review of your manuscript. I share their opinion regarding the relevance of your study and its significant contribution to the existing literature, as well as the importance of providing access to the code for ensuring reproducibility of the analyses. However, before your study can be considered for publication, some revisions and clarifications are necessary.

The first referee noted that your article addresses a pertinent subject but highlighted a lack of references to recent research in the field. It is strongly recommended to include credible sources to support your arguments and enhance the credibility of your article. Additionally, the referee encourages you to delve deeper into specific aspects of your analysis by providing concrete examples or case studies to substantiate your viewpoints.

The second referee acknowledges the commendable aspects of the article, including the methodology of defining hybrid and fractal patterns and the use of the Pareto front method to examine trade-offs. Overall, the referee provides valuable feedback regarding the need to consider multiple variables/species, the practicality of sampling designs, and the choice of environmental variables. Further exploration of these aspects is encouraged to improve the practical applicability of the study.

We kindly request that you take these comments and suggestions into consideration during the revision of your article. Please submit a revised version of your manuscript, incorporating the referees' remarks. Additionally, we would appreciate a detailed response letter addressing how you have addressed the referees' comments.

Thank you for your valuable contribution to our scientific journal, and we look forward to receiving your response.

Best regards,
Eric Goberville

https://doi.org/10.24072/pci.ecology.100536.d1

Reviewed by Nigel Yoccoz, 27 Jun 2023

Despite the recognized importance of sampling design, at least for researchers with an interest in statistical questions, it is remarkable that so few empirical studies in ecology are in fact designed according to well-defined objectives and some forms of random or systematic sampling. If one takes the example of species distributions, most studies use “available” data which are most often derived from opportunistic sampling or some form of hybrid designs (e.g. random design initially but with some nonrandom selection of final units linked for example to accessibility or observer availability). Many approaches have then been developed to account for this lack of design, but their robustness is often unclear. Clearly it would be preferable to start with a good sampling design.

This paper investigates different designs – random, grid, fractal (multiple scales) – and their efficiency when autocorrelation can be seen either as a “nuisance” and a parameter of interest. It is based on extensive simulations, and using a model-based approach for estimation. The conclusions are that fractal designs are seldom efficient. The scripts for running the simulations are available, but I did not run the simulations to check the results.

This is an interesting contribution for researchers working on sampling design, as it explicitly addresses different objectives (i.e. not “just” estimating population size, or an environmental effect). I could add that a specific difficulty with autocorrelation from a statistical point of view is that it may be hard to distinguish between a “real” autocorrelation due for example to intrinsic processes such as dispersal and the effect of a spatial covariate having an autocorrelation with the same range (i.e. it is not just an issue of bias but also of identifiability). As one often does not know what are the effects and range of environmental covariates, it is not obvious how sampling should be done. This paper addresses some of the issues associated with autocorrelation and estimating effects of covariates, and perhaps the author should emphasize the importance of making simulations to assess different designs depending on study objectives. Simulations are useful not just for assessing different design as is well done in this paper, but also because it forces the researchers to specify objectives, both in terms of ecological questions and in terms if what can be realistically expected in terms of precision/bias.

https://doi.org/10.24072/pci.ecology.100536.rev11

Reviewed by Charles J Marsh, 13 Jul 2023

Download the review https://doi.org/10.24072/pci.ecology.100536.rev12

User comments

No user comments yet

or Register
Submit a preprint