Reassessment of French breeding bird population sizes: from citizen science observations to nationwide estimates

Nigel Yoccoz

doi:10.24072/pci.ecology.100683

Reassessment of French breeding bird population sizes: from citizen science observations to nationwide estimates

Nigel Yoccoz based on reviews by 2 anonymous reviewers

A recommendation of:

Reassessment of French breeding bird population sizes using citizen science and accounting for species detectability

Jean Nabias, Luc Barbaro, Benoit Fontaine, Jérémy Dupuy, Laurent Couzi, Clément Vallé, Romain Lorrillière (2024), HAL, ver.2, peer-reviewed and recommended by PCI Ecology https://hal.science/hal-04478371

Read preprint in preprint server Now published in a journal

Data used for results

Codes used in this study

Scripts used to obtain or analyze results

Abstract

ZH-CN

Reassessment of French breeding bird population sizes using citizen science and accounting for species detectability

Higher efficiency in large-scale and long-term biodiversity monitoring can be obtained through the use of Essential Biodiversity Variables, among which species population sizes provide key data for conservation programs. Relevant estimations and assessment of actual population sizes are critical for species conservation, especially in the current context of global biodiversity erosion. However, knowledge on population size varies greatly, depending on species conservation status and ranges.

While the most threatened or restricted-range species generally benefit from exhaustive counts and surveys, monitoring common and widespread species population size tends to be neglected or is simply more challenging to achieve. In such a context, citizen science (CS) is a powerful tool for the long-term monitoring of common species through the engagement of various volunteers, permitting data acquisition on the long term and over large spatial scales. Despite this substantially increased sampling effort, detectability issues imply that even common species may remain unnoticed at suitable sites. The use of structured CS schemes, including repeated visits, enables to model the detection process, permitting reliable inferences of population size estimates.

Here, we relied on a large French structured CS scheme (EPOC-ODF) comprising 27 156 complete checklists over 3 873 sites collected during the 2021-2023 breeding seasons to estimate the population size of 63 common bird species using Hierarchical Distance Sampling (HDS). These population size estimates were compared to the previous expert-based French breeding bird atlas estimations, which did not account for detectability issues.

We found that population size estimates from the former French breeding bird atlas were lower than those estimated using HDS for 65% of species. Such a prevalence of lower estimations is likely due to more conservative estimates inferred from semi-quantitative expert-based assessments used for the previous atlas. We also found that species with long-range songs such as the Common Cuckoo ( Cuculus canorus ), Eurasian Hoopoe ( Upupa epops ) or the Eurasian Blackbird ( Turdus merula ) had, in contrast, higher estimated population sizes in the previous atlas than in our HDS models.

Our study highlights the need to rely on sound statistical methodology to ensure reliable ecological inferences with adequate uncertainty estimation and advocates for a higher reliance on structured CS in support of long-term biodiversity monitoring.

Bird atlases ; Biogeography ; Breeding Bird Surveys ; Citizen Science ; Detectability ; Hierarchical Distance Sampling

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

إعادة تقييم أحجام الطيور الفرنسية المتكاثرة باستخدام علم المواطن والمحاسبة عن إمكانية اكتشاف الأنواع

يمكن تحسين مراقبة التنوع البيولوجي على نطاق واسع وطويل الأجل من خلال استخدام متغيرات التنوع البيولوجي الأساسية، ومن بينها أحجام مجموعات الأنواع التي توفر البيانات الأساسية لبرامج الحفظ. تعد التقديرات ذات الصلة وتقييم أحجام السكان الفعلية أمرًا بالغ الأهمية لحفظ الأنواع، خاصة في السياق الحالي لتآكل التنوع البيولوجي العالمي. ومع ذلك، فإن المعرفة حول حجم السكان تختلف اختلافًا كبيرًا، اعتمادًا على حالة حفظ الأنواع ونطاقاتها. في حين أن الأنواع الأكثر تعرضًا للتهديد أو ذات النطاق المحدود تستفيد بشكل عام من عمليات التعداد والمسوحات الشاملة، فإن مراقبة الأنواع الشائعة والواسعة الانتشار بكفاءة تميل إلى الإهمال أو ببساطة يكون تحقيقها أكثر صعوبة.

في مثل هذا السياق، يعد علم المواطن (CS) أداة قوية للمراقبة طويلة المدى للأنواع الشائعة من خلال مشاركة مختلف المتطوعين، مما يسمح بالحصول على البيانات على المدى الطويل وعلى نطاقات مكانية كبيرة. على الرغم من هذا الجهد المتزايد بشكل كبير في أخذ العينات، فإن مشكلات قابلية الاكتشاف تعني أنه حتى الأنواع الشائعة قد تظل دون أن يلاحظها أحد في المواقع المناسبة. يتيح استخدام مخططات CS المنظمة، بما في ذلك الزيارات المتكررة، وضع نموذج لعملية الكشف، مما يسمح باستنتاجات موثوقة لتقديرات حجم السكان.

اعتمدنا هنا على مخطط CS منظم فرنسي كبير (EPOC-ODF) يضم 27156 قائمة مرجعية كاملة على مدى 3873 موقعًا تم جمعها خلال مواسم التكاثر 2021-2023 لتقدير عدد السكان حجم 63 نوعًا من الطيور الشائعة باستخدام أخذ عينات المسافة الهرمية (HDS). تمت مقارنة تقديرات حجم المجموعة هذه بتقديرات أطلس تكاثر الطيور الفرنسي السابق المستند إلى الخبراء، والذي لم يأخذ في الاعتبار مشكلات قابلية الاكتشاف.

تشير نتائجنا إلى انخفاض كبير في التقديرات لـ 65% من الأنواع الموجودة في الأطلس الفرنسي، ويرجع ذلك على الأرجح إلى التقديرات الأكثر تحفظًا المستنتجة من التقييمات شبه الكمية المستندة إلى الخبراء. لقد وجدنا أيضًا بعض المبالغة في التقدير للأنواع ذات الأغاني طويلة المدى مثل Cuculus canorus أو Upupa epops أو Turdus merula. تسلط دراستنا الضوء على الحاجة إلى الاعتماد على منهجية إحصائية سليمة لضمان استنتاجات بيئية غير متحيزة مع تقدير عدم اليقين الكافي وتدعو إلى الاعتماد بشكل أكبر على علوم الكمبيوتر المنظمة لدعم مراقبة التنوع البيولوجي.

أطالس الطيور ; الجغرافيا الحيوية مسوحات تربية الطيور ; علم المواطن ; قابلية الكشف أخذ العينات عن بعد الهرمي

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Reevaluación del tamaño de las poblaciones de aves reproductoras francesas utilizando la ciencia ciudadana y teniendo en cuenta la detectabilidad de las especies

El monitoreo de la biodiversidad a gran escala y a largo plazo se puede mejorar mediante el uso de Variables Esenciales de Biodiversidad, entre las cuales el tamaño de las poblaciones de especies proporciona datos clave para los programas de conservación. Las estimaciones y evaluaciones pertinentes del tamaño real de las poblaciones son fundamentales para la conservación de las especies, especialmente en el contexto actual de erosión de la biodiversidad global. Sin embargo, el conocimiento sobre el tamaño de la población varía mucho, dependiendo del estado de conservación y distribución de las especies. Si bien las especies más amenazadas o de distribución restringida generalmente se benefician de recuentos y estudios exhaustivos, el monitoreo eficiente de especies comunes y extendidas tiende a descuidarse o simplemente es más difícil de lograr.

En tal contexto, la ciencia ciudadana (CS) es una herramienta poderosa para el monitoreo a largo plazo de especies comunes a través de la participación de varios voluntarios, lo que permite la adquisición de datos a largo plazo. y en grandes escalas espaciales. A pesar de este esfuerzo de muestreo sustancialmente mayor, los problemas de detectabilidad implican que incluso las especies comunes pueden pasar desapercibidas en sitios adecuados. El uso de esquemas estructurados de CS, incluidas visitas repetidas, permite modelar el proceso de detección, permitiendo inferencias confiables de estimaciones del tamaño de la población.

Aquí, nos basamos en un gran esquema de CS estructurado francés (EPOC-ODF) que comprende 27 156 listas de verificación completas en 3 873 sitios recolectadas durante las temporadas de reproducción 2021-2023 para estimar la población. tamaño de 63 especies de aves comunes utilizando el muestreo jerárquico de distancia (HDS). Estas estimaciones del tamaño de la población se compararon con estimaciones anteriores del atlas francés de aves reproductoras realizadas por expertos, que no tenían en cuenta los problemas de detectabilidad.

Nuestros resultados indican fuertes subestimaciones para el 65% de las especies en el atlas francés, probablemente debido a estimaciones más conservadoras inferidas de evaluaciones semicuantitativas basadas en expertos. También encontramos algunas sobreestimaciones para especies con cantos de largo alcance como Cuculus canorus, Upupa epops o Turdus merula. Nuestro estudio destaca la necesidad de confiar en una metodología estadística sólida para garantizar inferencias ecológicas imparciales con una estimación de incertidumbre adecuada y aboga por una mayor dependencia de la CS estructurada en apoyo del monitoreo de la biodiversidad.

Atlas de aves; Biogeografía; Encuestas de aves reproductoras; Ciencia Ciudadana; Detectabilidad; Muestreo de distancia jerárquico

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Réévaluation de la taille des populations d'oiseaux nicheurs français à l'aide de la science citoyenne et de la prise en compte de la détectabilité des espèces

La surveillance de la biodiversité à grande échelle et à long terme peut être améliorée par l'utilisation de variables essentielles de la biodiversité, parmi lesquelles la taille des populations d'espèces fournit des données clés pour les programmes de conservation. Des estimations et évaluations pertinentes de la taille réelle des populations sont essentielles à la conservation des espèces, en particulier dans le contexte actuel d’érosion de la biodiversité mondiale. Cependant, les connaissances sur la taille de la population varient considérablement en fonction de l’état de conservation et de l’aire de répartition des espèces. Alors que les espèces les plus menacées ou à aire de répartition restreinte bénéficient généralement de dénombrements et d'enquêtes exhaustifs, le suivi efficace des espèces communes et répandues a tendance à être négligé ou est tout simplement plus difficile à réaliser.

Dans un tel contexte, la science citoyenne (CS) est un outil puissant pour le suivi à long terme des espèces communes grâce à l'engagement de divers bénévoles, permettant l'acquisition de données sur le long terme. et à de grandes échelles spatiales. Malgré cet effort d'échantillonnage considérablement accru, les problèmes de détectabilité impliquent que même les espèces communes peuvent passer inaperçues dans des sites appropriés. L'utilisation de schémas CS structurés, y compris des visites répétées, permet de modéliser le processus de détection, permettant ainsi des inférences fiables sur les estimations de la taille de la population.

Ici, nous nous sommes appuyés sur un grand système CS structuré français (EPOC-ODF) comprenant 27 156 listes de contrôle complètes sur 3 873 sites collectées au cours des saisons de reproduction 2021-2023 pour estimer la population. taille de 63 espèces d’oiseaux communs à l’aide de l’échantillonnage à distance hiérarchique (HDS). Ces estimations de la taille de la population ont été comparées aux estimations précédentes de l'atlas des oiseaux nicheurs français, basées sur des experts, qui ne tenaient pas compte des problèmes de détectabilité.

Nos résultats indiquent de fortes sous-estimations pour 65 % des espèces de l'atlas français, probablement en raison d'estimations plus conservatrices déduites d'évaluations semi-quantitatives basées sur des experts. Nous avons également constaté quelques surestimations pour les espèces aux chants à longue distance telles que Cuculus canorus, Upupa epops ou Turdus merula. Notre étude souligne la nécessité de s'appuyer sur une méthodologie statistique solide pour garantir des inférences écologiques impartiales avec une estimation adéquate de l'incertitude et plaide pour un recours accru à la CS structurée à l'appui de la surveillance de la biodiversité.

Atlas d'oiseaux ; Biogéographie ; Inventaires des oiseaux nicheurs ; Science citoyenne ; Détectabilité ; Échantillonnage hiérarchique à distance

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

नागरिक विज्ञान और प्रजातियों की पहचान के लिए लेखांकन का उपयोग करके फ्रांसीसी प्रजनन पक्षी आबादी के आकार का पुनर्मूल्यांकन

आवश्यक जैव विविधता चर के उपयोग से बड़े पैमाने पर और दीर्घकालिक जैव विविधता निगरानी में सुधार किया जा सकता है, जिसके बीच प्रजातियों की आबादी का आकार संरक्षण कार्यक्रमों के लिए महत्वपूर्ण डेटा प्रदान करता है। प्रजातियों के संरक्षण के लिए प्रासंगिक अनुमान और वास्तविक जनसंख्या आकार का आकलन महत्वपूर्ण है, विशेष रूप से वैश्विक जैव विविधता क्षरण के वर्तमान संदर्भ में। हालाँकि, प्रजातियों के संरक्षण की स्थिति और सीमा के आधार पर जनसंख्या के आकार पर ज्ञान बहुत भिन्न होता है। जबकि सबसे अधिक खतरे वाली या प्रतिबंधित-श्रेणी वाली प्रजातियाँ आम तौर पर विस्तृत गणना और सर्वेक्षण से लाभान्वित होती हैं, सामान्य और व्यापक प्रजातियों की कुशलतापूर्वक निगरानी करना उपेक्षित होता है या इसे प्राप्त करना अधिक चुनौतीपूर्ण होता है।

ऐसे संदर्भ में, नागरिक विज्ञान (सीएस) विभिन्न स्वयंसेवकों की भागीदारी के माध्यम से सामान्य प्रजातियों की दीर्घकालिक निगरानी के लिए एक शक्तिशाली उपकरण है, जो दीर्घकालिक डेटा अधिग्रहण की अनुमति देता है और बड़े स्थानिक पैमानों पर। इस पर्याप्त रूप से बढ़े हुए नमूनाकरण प्रयास के बावजूद, पता लगाने योग्य मुद्दों का अर्थ यह है कि सामान्य प्रजातियाँ भी उपयुक्त स्थलों पर किसी का ध्यान नहीं जा सकती हैं। बार-बार दौरे सहित संरचित सीएस योजनाओं का उपयोग, जनसंख्या आकार अनुमानों के विश्वसनीय अनुमान की अनुमति देते हुए, पहचान प्रक्रिया को मॉडल करने में सक्षम बनाता है।

यहां, हमने जनसंख्या का अनुमान लगाने के लिए एक बड़ी फ्रांसीसी संरचित सीएस योजना (ईपीओसी-ओडीएफ) पर भरोसा किया, जिसमें 2021-2023 प्रजनन मौसम के दौरान एकत्र की गई 3 873 साइटों पर 27 156 पूर्ण चेकलिस्ट शामिल थीं। पदानुक्रमित दूरी नमूनाकरण (एचडीएस) का उपयोग करके 63 सामान्य पक्षी प्रजातियों का आकार। इन जनसंख्या आकार अनुमानों की तुलना पिछले विशेषज्ञ-आधारित फ्रांसीसी प्रजनन पक्षी एटलस अनुमानों से की गई थी, जिसमें पता लगाने संबंधी समस्याएं शामिल नहीं थीं।

हमारे नतीजे फ्रेंच एटलस में 65% प्रजातियों के लिए मजबूत कम अनुमान का संकेत देते हैं, जो संभवतः अर्ध-मात्रात्मक विशेषज्ञ-आधारित आकलन से निकले अधिक रूढ़िवादी अनुमानों के कारण है। हमें कुकुलस कैनोरस, उपुपा एपॉप्स या टर्डस मेरुला जैसे लंबी दूरी के गीतों वाली प्रजातियों के लिए कुछ अधिक अनुमान भी मिले। हमारा अध्ययन पर्याप्त अनिश्चितता अनुमान के साथ निष्पक्ष पारिस्थितिक अनुमान सुनिश्चित करने के लिए ठोस सांख्यिकीय पद्धति पर भरोसा करने की आवश्यकता पर प्रकाश डालता है और जैव विविधता निगरानी के समर्थन में संरचित सीएस पर अधिक निर्भरता की वकालत करता है।

पक्षी एटलस; बायोग्राफी ; प्रजनन पक्षी सर्वेक्षण; नागरिक विज्ञान; पता लगाने की क्षमता; श्रेणीबद्ध दूरी नमूनाकरण

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

市民科学と種の検出可能性を考慮したフランスの繁殖鳥の個体数の再評価

大規模かつ長期的な生物多様性モニタリングは、生物多様性必須変数を使用することで改善できます。生物多様性必須変数のうち、種の個体数サイズは保全プログラムの重要なデータとなります。実際の個体群サイズの関連する推定と評価は、特に地球規模の生物多様性の浸食という現在の状況において、種の保存にとって重要です。ただし、個体群の規模に関する知識は、種の保存状況や範囲によって大きく異なります。一般に、最も絶滅の危機に瀕している種や生息範囲が制限されている種は、徹底的な数の調査と調査から恩恵を受けますが、一般的で広範囲にわたる種を効率的に監視することは無視される傾向があり、達成するのがより困難であるだけです。

このような状況において、シチズンサイエンス (CS) は、さまざまなボランティアの参加を通じて一般種を長期的に監視するための強力なツールであり、長期的なデータ収集を可能にします。そして大きな空間スケールにわたって。このサンプリング努力の大幅な増加にもかかわらず、検出可能性の問題は、一般的な種でさえ適切な場所で気付かれないままになる可能性を示唆しています。繰り返しの訪問を含む構造化された CS スキームの使用により、検出プロセスをモデル化することができ、人口規模の推定値の信頼できる推論が可能になります。

ここでは、個体数を推定するために、2021～2023 年の繁殖期に収集された 3,873 の場所にわたる 27,156 の完全なチェックリストで構成されるフランスの大規模な構造化 CS スキーム (EPOC-ODF) に依存しました。階層距離サンプリング (HDS) を使用して、一般的な鳥類 63 種のサイズを調べます。これらの個体数推定値は、検出可能性の問題が考慮されていない、専門家に基づく以前のフランスの繁殖鳥類アトラス推定値と比較されました。

我々の結果は、フランスの地図帳に掲載されている種の 65% がかなり過小評価されていることを示しています。これは、専門家による半定量的な評価から推定された、より保守的な推定によるものと考えられます。また、Cuculus canorus、Upupa epops、Turdus merula など、長距離の歌を持つ種については、いくつかの過大評価も見つかりました。私たちの研究は、適切な不確実性推定による偏りのない生態学的推論を保証するために、健全な統計手法に依存する必要性を強調し、生物多様性モニタリングをサポートする構造化された CS への依存度を高めることを提唱しています。

鳥アトラス ;生物地理学 ;繁殖鳥類の調査 ;市民科学 ;検出可能性 ;階層的距離サンプリング

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Reavaliação do tamanho da população de aves reprodutoras francesas usando a ciência cidadã e contabilizando a detectabilidade das espécies

A monitorização da biodiversidade em grande escala e a longo prazo pode ser melhorada através da utilização de Variáveis Essenciais da Biodiversidade, entre as quais o tamanho das populações de espécies fornece dados fundamentais para programas de conservação. Estimativas e avaliações relevantes do tamanho real das populações são críticas para a conservação das espécies, especialmente no contexto atual de erosão da biodiversidade global. No entanto, o conhecimento sobre o tamanho da população varia muito, dependendo do estado de conservação e da distribuição das espécies. Embora as espécies mais ameaçadas ou de distribuição restrita geralmente beneficiem de contagens e levantamentos exaustivos, a monitorização eficiente de espécies comuns e difundidas tende a ser negligenciada ou é simplesmente mais difícil de alcançar.

Neste contexto, a ciência cidadã (CS) é uma ferramenta poderosa para a monitorização a longo prazo de espécies comuns através do envolvimento de vários voluntários, permitindo a aquisição de dados a longo prazo. e em grandes escalas espaciais. Apesar deste esforço de amostragem substancialmente aumentado, os problemas de detectabilidade implicam que mesmo as espécies comuns podem passar despercebidas em locais adequados. O uso de esquemas estruturados de CS, incluindo visitas repetidas, permite modelar o processo de detecção, permitindo inferências confiáveis de estimativas do tamanho da população.

Aqui, baseámo-nos num grande esquema de CS estruturado francês (EPOC-ODF) que compreende 27 156 listas de verificação completas em 3 873 locais recolhidos durante as épocas reprodutivas de 2021-2023 para estimar a população tamanho de 63 espécies de aves comuns usando amostragem de distância hierárquica (HDS). Essas estimativas do tamanho da população foram comparadas com as estimativas anteriores do atlas francês de aves reprodutoras baseadas em especialistas, que não levaram em conta problemas de detectabilidade.

Nossos resultados indicam fortes subestimações para 65% das espécies no atlas francês, provavelmente devido a estimativas mais conservadoras inferidas de avaliações semiquantitativas baseadas em especialistas. Também encontramos algumas estimativas exageradas para espécies com cantos de longo alcance, como Cuculus canorus, Upupa epops ou Turdus merula. Nosso estudo destaca a necessidade de contar com uma metodologia estatística sólida para garantir inferências ecológicas imparciais com estimativas de incerteza adequadas e defende uma maior dependência de SC estruturada em apoio ao monitoramento da biodiversidade.

Atlas de aves; Biogeografia; Pesquisas de aves reprodutoras; Ciência Cidadã; Detectabilidade ; Amostragem de distância hierárquica

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Переоценка размеров популяции гнездящихся птиц во Франции с использованием гражданской науки и учета выявляемости видов

Крупномасштабный и долгосрочный мониторинг биоразнообразия можно улучшить за счет использования основных переменных биоразнообразия, среди которых размеры популяций видов предоставляют ключевые данные для программ сохранения. Соответствующие оценки и оценка фактической численности популяций имеют решающее значение для сохранения видов, особенно в нынешнем контексте глобальной эрозии биоразнообразия. Однако знания о размере популяции сильно различаются в зависимости от статуса сохранения вида и ареала. В то время как виды, находящиеся под наибольшей угрозой или с ограниченным ареалом, обычно получают пользу от исчерпывающих подсчетов и обследований, эффективный мониторинг распространенных и широко распространенных видов, как правило, игнорируется или его просто сложнее достичь.

В таком контексте гражданская наука (CS) является мощным инструментом долгосрочного мониторинга распространенных видов посредством привлечения различных добровольцев, позволяющего собирать данные в долгосрочной перспективе. и в больших пространственных масштабах. Несмотря на существенное увеличение усилий по отбору проб, проблемы с обнаружением означают, что даже обычные виды могут остаться незамеченными на подходящих участках. Использование структурированных схем CS, включая повторные посещения, позволяет моделировать процесс обнаружения, позволяя делать надежные выводы об оценке численности популяции.

Здесь для оценки популяции мы использовали крупную французскую структурированную схему CS (EPOC-ODF), включающую 27 156 полных контрольных списков на 3 873 участках, собранных в течение сезонов размножения 2021–2023 гг. размер 63 распространенных видов птиц с использованием иерархической дистанционной выборки (HDS). Эти оценки размера популяции были сопоставлены с предыдущими оценками французского атласа гнездящихся птиц, основанными на экспертной оценке, в которых не учитывались проблемы обнаружения.

Наши результаты указывают на сильное занижение оценки 65 % видов во французском атласе, вероятно, из-за более консервативных оценок, полученных на основе полуколичественных экспертных оценок. Мы также обнаружили несколько завышенных оценок для видов с дальнобойным пением, таких как Cuculus canorus, Upupa epops или Turdus merula. Наше исследование подчеркивает необходимость полагаться на надежную статистическую методологию для обеспечения объективных экологических выводов с адекватной оценкой неопределенности и выступает за более широкое использование структурированной CS для поддержки мониторинга биоразнообразия.

Атласы птиц; Биогеография; Исследования гнездящихся птиц; Гражданская наука; Обнаруживаемость; Иерархическая дистанционная выборка

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

利用公民科学重新评估法国种鸟种群规模并考虑物种可检测性

使用基本生物多样性变量可以改善大规模和长期的生物多样性监测，其中物种种群规模为保护计划提供了关键数据。对实际种群规模的相关估计和评估对于物种保护至关重要，特别是在当前全球生物多样性侵蚀的背景下。然而，关于种群规模的知识差异很大，具体取决于物种保护状况和范围。虽然最受威胁或限制范围的物种通常受益于详尽的计数和调查，但有效监测常见和广泛分布的物种往往会被忽视，或者只是更难以实现。

在这种背景下，公民科学 (CS) 是一种强大的工具，可以通过各种志愿者的参与来长期监测常见物种，从而实现长期数据采集并且在大的空间尺度上。尽管采样工作量大幅增加，但可检测性问题意味着，即使是常见物种也可能在合适的地点未被注意到。使用结构化 CS 方案（包括重复访问）可以对检测过程进行建模，从而可以对种群规模估计进行可靠的推断。

在这里，我们依靠法国大型结构化 CS 计划 (EPOC-ODF) 来估算种群数量，该计划包括 2021-2023 年繁殖季节期间收集的 3,873 个地点的 27,156 个完整清单使用分层距离采样 (HDS) 测量 63 种常见鸟类的大小。这些种群规模估计值与之前基于专家的法国种鸟图集估计值进行了比较，后者没有考虑可检测性问题。

我们的结果表明，法国地图集中 65% 的物种被严重低估，这可能是由于半定量专家评估得出的更为保守的估计所致。我们还发现一些对具有长距离鸣叫的物种的估计过高，例如 Cuculus canorus、Upupa epops 或 Turdus merula。我们的研究强调需要依靠健全的统计方法来确保公正的生态推论和充分的不确定性估计，并主张更多地依赖结构化CS来支持生物多样性监测。

鸟类地图集；生物地理学；鸟类繁殖调查；公民科学；可检测性；分层距离采样

Submission: posted 26 February 2024, validated 27 February 2024
Recommendation: posted 28 June 2024, validated 29 June 2024

Cite this recommendation as:
Yoccoz, N. (2024) Reassessment of French breeding bird population sizes: from citizen science observations to nationwide estimates. Peer Community in Ecology, 100683. https://doi.org/10.24072/pci.ecology.100683

Recommendation

Estimating populations size of widespread, common species in a relatively large and heterogeneous country like France is difficult for several reasons, from having a sample covering well the diverse ecological gradients to accounting for detectability, the fact that absence of a species may represent a false negative, the species being present but not detected. Bird communities have been the focus of a very large number of studies, with some countries like the UK having long traditions of monitoring both common and rare species. Nabias et al. use a large, structured citizen science project to provide new estimates of common bird species, accounting for detectability and using different habitat and climate covariates to extrapolate abundance to non-sampled areas. About 2/3 of the species had estimates higher than what would have been expected using a previous attempt at estimating population size based in part on expert knowledge and projected using estimates of trends to the period covered by the citizen science sampling. Some species showed large differences between the two estimates, which could be in part explained by accounting for detectability.

This paper uses what is called model-based inference (as opposed to design-based inference, that uses the design to make inferences about the whole population; Buckland et al. 2000), both in terms of detectability and habitat suitability. The estimates obtained depend on how well the model components approximate the underlying processes, which in a complex dataset like this one is not easy to assess. But it clearly shows that detectability may have substantial implications for the population size estimates. This is of course not new but has rarely been done at this scale and using a large sample obtained on many species. Interesting further work could focus on testing the robustness of the model-based approach by for example sampling new plots and compare the expected values to the observed values. Such a sampling could be stratified to maximize the discrimination between expected low and high abundances, at least for species where the estimates might be considered as uncertain, or for which estimating population sizes is deemed important.

References

Buckland, S. T., Goudie, I. B. J., & Borchers, D. L. (2000). Wildlife Population Assessment: Past Developments and Future Directions. Biometrics, 56(1), 1-12. https://doi.org/10.1111/j.0006-341X.2000.00001.x

Nabias, J., Barbaro, L., Fontaine, B., Dupuy, J., Couzi, L., et al. (2024) Reassessment of French breeding bird population sizes using citizen science and accounting for species detectability. HAL, ver. 2 peer-reviewed and recommended by Peer Community in Ecology. https://hal.science/hal-04478371

PDF recommendation

Conflict of interest:
The recommender in charge of the evaluation of the article and the reviewers declared that they have no conflict of interest (as defined in the code of conduct of PCI) with the authors or with the content of the article. The authors declared that they comply with the PCI rule of having no financial conflicts of interest in relation to the content of the article.

Funding:
ANRT CIFRE 2021/0305

Reviews

Evaluation round #1

DOI or URL of the preprint: https://hal.science/hal-04478371

Version of the preprint: 1

Author's Reply, 04 Jun 2024

Download author's reply

We would like to deeply thank Prof. Yoccoz, for his interest in our study and his editorial work and two anonymous reviewers who provided invaluable insights and comments on the manuscript which greatly improved our initial work. We responded to all their general remarks and addressed their comments in the following document.

https://doi.org/10.24072/pci.ecology.100683.ar1

Decision by Nigel Yoccoz, posted 07 May 2024, validated 11 May 2024

Mr. Nabias

Two reviewers have carefully evaluated your paper, and the reviews are positive. They have provided many constructive comments, that will help you revise the paper. I concur with the reviewers that your analyses represent a nice case of hierarchical modelling.

Your paper is also a useful reminder that there can be a large uncertainty in population size estimates of common species in a country that is large and relatively heterogenous. As reviewer #2 wrote, the difference between the two estimates cannot be simply interpreted as a measure of bias, as your estimates using citizen science data do not represent the truth, they are likely to be biased even if less than the original method, so the real bias when comparing to the first estimates might be lower or higher than the one you refer to when writing "over/under-estimation". It is enough to mention the large differences in estimates between the two methods.

I was also wondering if there is any information available from other countries with bird monitoring schemes that could inform the estimates you discuss, particularly when there is a large discrepancy. I understand that it is hard to compare France to e.g. Germany, Spain or the UK, but it might be worthwhile for at least a few species to compare your estimates to those available elsewhere.

Best regards

Nigel Yoccoz

https://doi.org/10.24072/pci.ecology.100683.d1

Reviewed by anonymous reviewer 1, 02 May 2024

This paper is at the interface between methodological development and applied ecology. While threatened or restricted-range species often benefit from exhaustive counts and surveys, monitoring common and widespread species is often neglected or poses significant challenges. In this context, citizen science (CS) emerges as a powerful tool for long-term monitoring, engaging volunteers to collect data across vast spatial scales. Despite the substantial increase in sampling efforts facilitated by CS, issues of detectability persist, potentially leading to the oversight of even common species in suitable habitats. This study draws on data from a large French CS program, EPOC-ODF, which amassed over 27,000 complete checklists across nearly 4,000 sites during the 2021-2023 breeding seasons. Using Hierarchical Distance Sampling (HDS), population size estimates were derived for 63 common bird species. Comparing these population size estimates to those from a previous expert-based atlas reveals significant underestimations in the atlas, likely due to conservative estimates. Some species with long-range songs were overestimated. The findings stress the importance of employing robust statistical methodologies to ensure unbiased ecological inferences and advocate for increased use of structured CS for biodiversity monitoring.

The manuscript is well-written, and the authors have managed to make it flow smoothly despite the multitude of in-depth analyses presented. The research question is quite clearly presented, and the different components of the article well linked to this question. The study's context is well-defined, elucidating the novelty the authors aim to underscore and the associated challenges. Note that I’m not an expert on the methods used in this paper nor citizen data analysis. Hence while some of my comments may be a bit naive, I believe they can be useful as other readers may share similar misunderstandings.

Although the methods used are quite innovative and present several advantages, the manuscript could highlight better a number of points regarding concerning the assumptions underlying the models and the associated methodological choices, that seem crucial and are not covered sufficiently at this stage in my opinion. The authors could also spell out in greater detail the limitations of their approach, which might lead them to be more cautious about their overall conclusions. With a view to improving our knowledge of abundances, which as the authors explain is a non-trivial issue, it seems important to make the reader aware that the approach can be further improved and the potential consequences.

General comments:

Results and discussion focus heavily on comparison with the old method, and perhaps too little on the current method. In particular, the probability of detection, which seems to me to be at the heart of this new approach, is not sufficiently discussed. Among other things, the influence of methodological choices and selected covariates is only very slightly addressed. In particular, I suggest expanding the bibliography on the subject. This is an example, but other could be used: Joshua H. Schmidt, Carol L. McIntyre, Margaret C. MacCluskie, Accounting for incomplete detection: What are we estimating and how might it affect long-term passerine monitoring programs?,Biological Conservation,Volume 160,2013, Pages 130-139,ISSN 0006-320.

Besides, I didn't quite understand how the different intra-annual replicates were incorporated into the model. I think I understood that they were linked to the probability of detection via the date and time of the survey, but I think this could be explained more clearly. If that's the case, why didn't the results section deal with the phenology of detection over the course of the season and the day, in order to suggest potential improvements to the protocol in the future, for example?

The authors go back and forth between the concepts of abundances and trends (and sometimes distribution) throughout the manuscript, leading to confusion on the part of the reader as to which specific question is being addressed by which part. Among other things, this led me to wonder about the differences between the EPOC-ODF program and the FBBS (that is quickly mentioned in the method part because used to extrapolate current abundances from ArGeom). I think that a sentence explaining all this might help the reader to understand why this approach is not directly compared in the article with the FBBS results.

The, I was a bit confused as for why the use of covariates is only mentioned from line 156 onwards. The latter seem important in the approach considered, and perhaps their use and what it implies should be mentioned earlier, particularly in terms of the precision they bring, or not, to detection modeling. In addition, their influence may deserve to be discussed in the discussion part: to what extent do these choices influence the estimates?

Finally, I believe results could be better organized and benefit from subsectioning. There are a lot of models and methods, and it was difficult for me to know what results were linked to what method. The result part is short, maybe some results about detection probabilities, and notably the different covariates relations could be added to then enrich the discussion. I also was wondering why the IC for detection probability is so small on Figure 5 and it means.

Specific comments:

I suggest rephrasing the subtitles to make it more explicit to help make reading easier (L156, 184, 220, 248, 276).

L67-69: Consider rephrase the sentence which as it stands is too vague. The second half is not entirely clear. Additionally, the link between "agricultural and planning policies" and bird abundances/trends has not been explained before.

L 106: What concrete criterion does "Medium" refer to?

L 125: quality of which aspect of inferences?

L 124: The objectives could be rephrased and further detailed in separate sentences. I wonder if it might be helpful to flip the sentence, starting with the objective "we propose an estimation method..." and then coming to the methods.

L 312: Breeding bird populations abundances and/or trends? I propose to clarify this point throughout the whole manuscript. I feel that the authors go back and forth between these notions, which can sometimes be confusing.

L142: « encountered » visually and/or singing?

L145: What is the surface of the square of the grid? Does it correspond to the point counts or is it used to set a round buffer?

L145: Perhaps add some information on why this choice of 5min, in light of the literature on the subject (5min sufficient for all species?).

L162: To this stage, we don’t know what the covariates were chosen for? Is this to model p or N. I feel this whole section is a bit confusing.

L 166: I didn’t understand this sentence when first reading the manuscript, only later when reading the part on modelling. I wonder if it could be rephrased somehow.

L168: Why chose to group water bodies and mineral surfaces? What ecological meaning justify this choice?

L172 and S3: It seems to me that Axis 1 is probably highly correlated with elevation, and maybe moisture as well. In a general way, I feel that covariate choices are not discussed enough. Could you discuss to what extent these choices influence results? What are the ecological hypotheses behind these choices? Perhaps you could discuss whether some bioclimatic covariates (wind?) could also be used to model detection?

Fig S3.1: This figure is extremely useful. Consider adding it to the main text?

L 174: Why 500m buffer radii? Scales and sampled surfaces are quite confusing in a general way. Maybe a figure illustrating/summarizing this could help?

L 180-182: I'm not entirely sure what is done here and how it is linked with previous paragraphs.

L189: Why 5 bin classes?

L191: I'm not sure if Julian date and hour are the effort covariates. I'm not quite sure which parameters they are incorporated into. Is it detection probability or also abundances?

L200-204: It might be helpful to consider cutting this sentence. Perhaps we could have one with the general case and then a second for the exceptions.

L213-218: Could you please clarify what these data from 2022 are and why they were not included in the whole analysis? It might be helpful to rephrase this section to make it more explicit.

L 272: Could you explain how you took account for it?

L 278-288: This section is quite complex to follow. It would be helpful to have more clarity on how you implemented this in your models.

L291: Could you please clarify what 14.84 refers to?

Figure 4: Point (2) is not very clear, what does extrapolation sign refer to?

L 356-358: Perhaps it would be helpful to be more explicit about what your hypothesis is about where these differences come from? If not species detection probabilities.

L365: Did the previous method also use covariates?

L362-368: This is an interesting point, but it might be clearer if it were rephrased slightly. In particular, it would be helpful to have more information about the effect of time and date on detection probabilities in the results section. It's not clear to me to what extent you're modelling the phenology of detection during spring.

L368-376: This paragraph could be in the introduction instead, because it presents the « old » method?

L381: I'm not quite sure what this means, but it sounds interesting. Perhaps you could try rephrasing it?

L 398: « Actually, community … » instead?

L421: This seems like an important point. It's not entirely clear how many species are concerned by this in your study. Could you please elaborate on how this affects your results?

https://doi.org/10.24072/pci.ecology.100683.rev11

Reviewed by anonymous reviewer 2, 30 Apr 2024

The paper compares estimates of abundance of common birds across France using two different atlas data sets. The first data set is a survey from 2012 for which abundance estimates were derived without statistical modelling. These estimates are then compared to estimates from a new atlas scheme for which the authors suggest hierarchical distance sampling (HDS) models to estimate abundance.

Considerable effort and thought have been put into the modelling process for the more recent data, with seemingly well considered choices regarding which covariates enter the different components of the HDS model.

The new atlas survey and how the hierarchical distance sampling model is used to estimate abundance is described in detail (although descriptions could sometimes be clearer, see below). However, given that the comparison between estimates from the previous survey and the new survey is the main focus, I’m missing sufficient detail about the previous scheme, especially as the main reference provided for it is in French. What was the statistical design of the previous survey, how were counts conducted, estimates derived etc? Estimates from the old survey are also described as expert based, but as currently described in the text it is just a quantitative computation from “measured abundance”, with no expert knowledge used in the process.

My main concern is otherwise that the authors claim without evidence that their estimates from the new scheme are better than those of the old one. For example, if estimates from the old survey are lower than those from the new survey, they are referred to as underestimating abundance (and vice versa). The assumption is that the modelling provides more accurate inference than the previous ad-hoc approach. This may seem reasonable, especially since the modelling is largely based on sound reasoning. But the fact is that since we don’t know the true abundances we do not known which estimates are closest to truth. A more nuanced discussion of the differences, not taking for granted that the HDS modelling will automatically provide better estimates is therefore necessary.

One concern, for instance, is that the N-mixture model used as one component of the HDS is not a very robust approach because essential information to estimate detection (availability in the HDS model) is missing (Barker et al. 2018). The N-mixture model can underestimate or overestimate abundance, it is not necessarily unbiased (e.g. Duarte 2018).

In addition to the above, I would suggest the authors to take another careful pass with the text. There is missing text in some places, new paragraphs where they are not needed etc. The Methods section could be improved for better clarity, and the Discussion better structured and more focused on the central questions.

Barker, R. J., Schofield, M. R., Link, W. A., & Sauer, J. R. (2018). On the reliability of N-mixture models for count data. Biometrics, 74(1), 369–377. https://doi.org/10.1111/biom.12734

Duarte, A., Adams, M. J., & Peterson, J. T. (2018). Fitting N-mixture models to count data with unmodeled heterogeneity: Bias, diagnostics, and alternative approaches. Ecological Modelling, 374, 51–59. https://doi.org/10.1016/j.ecolmodel.2018.02.007

Detailed comments;

Line 49-52. This is an example of a sentence that need to be more carefully worded. You have not shown that your estimates are unbiased.

L99. In Europe there are monitoring schemes specifically targeting common species though, it would be a bit of a stretch to say that they are neglected or overlooked.

L112-115. Should be the other way around? Geometric means are smaller than arithmetic means.

L130-130. There is in fact no test of whether the new data set provides estimate closer to the truth.

L174-175. Revise wording.

L187-188. Not quite clear what is meant here. Do you mean that you truncated distances above the 95% quantile to the 95% quantile?

L185-204. I would not be able to repeat the modelling process or model selection strategy from these explanations. Please try to revise the method description to improve clarity.

L185- Was the year of survey included in model somehow? Why, why not?

L205-208. Not clear how C-hat was defined or calculated.

L213. Clarify that you are assessing robustness of estimates to exclusion of one year of data (rather than general robustness).

L216-218. How do you draw the conclusion that estimates are robust to exclusion of one year when confidence intervals for 9 out of 30 species don’t overlap?

L233-234. Define “coefficient of variation of the range uncertainty between pre- and post-treatment estimates”.

L233-234. What about NT2 extrapolation?

L234. The pre- and post-treatment labels do not accurately convey what is done. Something like ‘outlier-trimmed’ and ‘untrimmed’ seems more appropriate.

L239. Which “comparison analysis”?

L249. → “comparable estimates between the old and the new survey, we restricted…”

L251-256. Remind the reader here that ArGeom estimates the number of breeding pairs.

L255-256. This could lead to errors though, since lack of sexual dimorphisms does not imply that males and females are equally likely to be detected.

L260. As the conclusion is that ArGeom provides lower estimates, it might be of interest to compare estimates using the upper bound in addition to the midpoint (just a suggestion).

L268. The notation “delta_methods” is somewhat unfortunate as the delta-method is a standard statistical approach not related to the use here.

L272-274. Not clear in what way you “took account of ArGeom uncertainty”.

L284-285. Revise wording in “using weighted means final candidate sets models in regards to AICc scores”.

L283-288. This analysis does not account for uncertainty in delta_methods. i.e. error in the estimate of delta is not accounted for.

L297. I suggest providing estimates of average estimated availability and detection probability in an appendix. This would be useful for understanding to what extent the N-mixture part inflates abundance, for example.

L308. “sits” → “its”.

L317 and elsewhere. Avoid qualifiers like “under” and “overestimation” and use something neutral like “estimated lower compared to HDS”.

L318. Why is habitat specialist/generalist a relevant variable for the difference between the two approaches? Was this based on a formal analysis?

L350-352. Here you are assuming that the HDS estimates are correctly representing truth.

L352. “presumed known uncertainties ranges” ?

L359-362. You found no association between detection probabilities and delta_methods, but still draw the conclusion that detection causes the difference? This needs further elaboration.

L369. “deviating from expert opinion reliance” - do you mean “derived from export opinion”?

L370-372. Difficult sentence.

L377-384. I had a hard time following the argument in this paragraph. Consider rephrasing.

L407-414. If conservation status was an important question, why is it not mentioned in the methods/results?

L421. What does “inferences of clustered individuals” mean?

Fig 5B-C. Explain what the figure shows. “Marginal” can mean many different things in a statistical context. I think the figure shows the predicted response across different values of detection probabilities while the other covariate is held constant (perhaps at its mean?)?

https://doi.org/10.24072/pci.ecology.100683.rev12