Close printable page

Recommendation

A deep learning model to unlock secrets of animal movement and behaviour

Cédric Sueur based on reviews by Jacob Davidson and 1 anonymous reviewer

A recommendation of:

MoveFormer: a Transformer-based model for step-selection animal movement modelling

Ondřej Cífka, Simon Chamaillé-Jammes, Antoine Liutkus (2023), bioRxiv, ver.4, peer-reviewed and recommended by PCI Ecology https://doi.org/10.1101/2023.03.05.531080

Read preprint in preprint server

Data used for results

Codes used in this study

Scripts used to obtain or analyze results

Abstract

EN

AR

ES

FR

HI

JA

PT

RU

ZH-CN

MoveFormer: a Transformer-based model for step-selection animal movement modelling

The movement of animals is a central component of their behavioural strategies. Statistical tools for movement data analysis, however, have long been limited, and in particular, unable to account for past movement information except in a very simplified way. In this work, we propose MoveFormer, a new step-based model of movement capable of learning directly from full animal trajectories. While inspired by the classical step-selection framework and previous work on the quantification of uncertainty in movement predictions, MoveFormer also builds upon recent developments in deep learning, such as the Transformer architecture, allowing it to incorporate long temporal contexts. The model predicts an animal’s next movement step given its past movement history, including not only purely positional and temporal information, but also any available environmental covariates such as land cover or temperature. We apply our model to a diverse dataset made up of over 1550 trajectories from over 100 studies, and show how it can be used to gain insights about the importance of the provided context features, including the extent of past movement history. Our software, along with the trained model weights, is released as open source.

animal movement, habitat selection, deep learning, method, spatial memory

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

MoveFormer: نموذج قائم على المحولات لنمذجة حركة الحيوان بالاختيار التدريجي

تعد حركة الحيوانات عنصرًا أساسيًا في استراتيجياتها السلوكية. ومع ذلك، ظلت الأدوات الإحصائية لتحليل بيانات الحركة محدودة منذ فترة طويلة، وعلى وجه الخصوص، غير قادرة على حساب معلومات الحركة السابقة إلا بطريقة مبسطة للغاية. في هذا العمل، نقترح MoveFormer، وهو نموذج جديد للحركة قائم على الخطوات قادر على التعلم مباشرة من مسارات الحيوانات الكاملة. في حين أن MoveFormer مستوحى من إطار الاختيار التدريجي الكلاسيكي والعمل السابق على القياس الكمي لعدم اليقين في تنبؤات الحركة، فإنه يعتمد أيضًا على التطورات الأخيرة في التعلم العميق، مثل بنية Transformer، مما يسمح له بدمج سياقات زمنية طويلة. يتنبأ النموذج بالخطوة التالية لحركة الحيوان بالنظر إلى تاريخ حركته السابقة، بما في ذلك ليس فقط المعلومات الموضعية والزمنية البحتة، ولكن أيضًا أي متغيرات بيئية متاحة مثل الغطاء الأرضي أو درجة الحرارة. نحن نطبق نموذجنا على مجموعة بيانات متنوعة تتكون من أكثر من 1550 مسارًا من أكثر من 100 دراسة، ونوضح كيف يمكن استخدامه للحصول على رؤى حول أهمية ميزات السياق المتوفرة، بما في ذلك مدى تاريخ الحركة السابقة. يتم إصدار برنامجنا، بالإضافة إلى أوزان النماذج المدربة، كمصدر مفتوح.

حركة الحيوان، اختيار الموطن، التعلم العميق، الطريقة، الذاكرة المكانية

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

MoveFormer: un modelo basado en Transformer para modelar el movimiento de animales con selección de pasos

El movimiento de los animales es un componente central de sus estrategias de comportamiento. Sin embargo, las herramientas estadísticas para el análisis de datos de movimientos han sido limitadas durante mucho tiempo y, en particular, no han podido dar cuenta de la información de movimientos pasados excepto de una manera muy simplificada. En este trabajo, proponemos MoveFormer, un nuevo modelo de movimiento basado en pasos capaz de aprender directamente de trayectorias animales completas. Si bien se inspira en el marco clásico de selección de pasos y en trabajos previos sobre la cuantificación de la incertidumbre en las predicciones de movimiento, MoveFormer también se basa en desarrollos recientes en aprendizaje profundo, como la arquitectura Transformer, lo que le permite incorporar contextos temporales prolongados. El modelo predice el siguiente paso de movimiento de un animal teniendo en cuenta su historial de movimientos pasado, incluyendo no sólo información puramente posicional y temporal, sino también cualquier covariable ambiental disponible, como la cobertura del suelo o la temperatura. Aplicamos nuestro modelo a un conjunto de datos diverso compuesto por más de 1550 trayectorias de más de 100 estudios y mostramos cómo se puede utilizar para obtener información sobre la importancia de las características del contexto proporcionadas, incluido el alcance de la historia de los movimientos pasados. Nuestro software, junto con los pesos del modelo entrenado, se publica como código abierto.

movimiento animal, selección de hábitat, aprendizaje profundo, método, memoria espacial

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

MoveFormer : un modèle basé sur Transformer pour la modélisation des mouvements d'animaux par sélection par étapes

Le mouvement des animaux est un élément central de leurs stratégies comportementales. Cependant, les outils statistiques d’analyse des données sur les mouvements ont longtemps été limités et, en particulier, incapables de rendre compte des informations sur les mouvements passés, sauf de manière très simplifiée. Dans ce travail, nous proposons MoveFormer, un nouveau modèle de mouvement basé sur des étapes, capable d'apprendre directement à partir de trajectoires complètes d'animaux. Bien qu'inspiré par le cadre classique de sélection par étapes et par des travaux antérieurs sur la quantification de l'incertitude dans les prédictions de mouvement, MoveFormer s'appuie également sur les développements récents en matière d'apprentissage profond, tels que l'architecture Transformer, lui permettant d'incorporer des contextes temporels longs. Le modèle prédit la prochaine étape de mouvement d’un animal en fonction de son historique de mouvements passés, y compris non seulement des informations purement positionnelles et temporelles, mais également toutes les covariables environnementales disponibles telles que la couverture terrestre ou la température. Nous appliquons notre modèle à un ensemble de données diversifié composé de plus de 1 550 trajectoires issues de plus de 100 études, et montrons comment il peut être utilisé pour mieux comprendre l'importance des caractéristiques contextuelles fournies, y compris l'étendue de l'historique des mouvements passés. Notre logiciel, ainsi que les poids des modèles entraînés, sont publiés en open source.

mouvement animal, sélection d'habitat, apprentissage profond, méthode, mémoire spatiale

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

मूवफॉर्मर: चरण-चयन पशु आंदोलन मॉडलिंग के लिए एक ट्रांसफार्मर-आधारित मॉडल

जानवरों की आवाजाही उनकी व्यवहारिक रणनीतियों का एक केंद्रीय घटक है। हालाँकि, आंदोलन डेटा विश्लेषण के लिए सांख्यिकीय उपकरण लंबे समय से सीमित हैं, और विशेष रूप से, बहुत सरल तरीके को छोड़कर पिछले आंदोलन की जानकारी का हिसाब देने में असमर्थ हैं। इस कार्य में, हम मूवफॉर्मर का प्रस्ताव करते हैं, जो आंदोलन का एक नया चरण-आधारित मॉडल है जो पूर्ण पशु प्रक्षेप पथ से सीधे सीखने में सक्षम है। जबकि शास्त्रीय चरण-चयन ढांचे और आंदोलन की भविष्यवाणियों में अनिश्चितता की मात्रा का ठहराव पर पिछले काम से प्रेरित होकर, मूवफॉर्मर गहन शिक्षा में हाल के विकास, जैसे ट्रांसफार्मर वास्तुकला, पर भी निर्माण करता है, जो इसे लंबे समय के संदर्भों को शामिल करने की अनुमति देता है। मॉडल किसी जानवर के पिछले आंदोलन के इतिहास को देखते हुए उसके अगले आंदोलन चरण की भविष्यवाणी करता है, जिसमें न केवल विशुद्ध रूप से स्थितीय और अस्थायी जानकारी शामिल है, बल्कि भूमि कवर या तापमान जैसे कोई भी उपलब्ध पर्यावरणीय सहसंयोजक भी शामिल है। हम अपने मॉडल को 100 से अधिक अध्ययनों के 1550 से अधिक प्रक्षेप पथों से बने विविध डेटासेट पर लागू करते हैं, और दिखाते हैं कि इसका उपयोग पिछले आंदोलन के इतिहास की सीमा सहित प्रदान की गई संदर्भ सुविधाओं के महत्व के बारे में अंतर्दृष्टि प्राप्त करने के लिए कैसे किया जा सकता है। हमारा सॉफ्टवेयर, प्रशिक्षित मॉडल वेट के साथ, ओपन सोर्स के रूप में जारी किया गया है।

पशु आंदोलन, आवास चयन, गहन शिक्षा, विधि, स्थानिक स्मृति

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

MoveFormer: ステップ選択による動物の動きモデリング用の Transformer ベースのモデル

動物の動きは、動物の行動戦略の中心的な要素です。しかし、運動データ分析のための統計ツールは長い間制限されており、特に、非常に単純化された方法以外では過去の運動情報を説明することができませんでした。この研究では、動物の完全な軌跡から直接学習できる新しいステップベースの動作モデルである MoveFormer を提案します。 MoveFormer は、古典的なステップ選択フレームワークと動き予測の不確実性の定量化に関する以前の研究からインスピレーションを受けていますが、Transformer アーキテクチャなどの深層学習の最近の開発にも基づいて構築されており、長い時間的コンテキストを組み込むことができます。このモデルは、純粋な位置情報や時間情報だけでなく、土地被覆や温度などの利用可能な環境共変量も含む過去の移動履歴を考慮して、動物の次の移動ステップを予測します。私たちは、100 以上の研究からの 1,550 以上の軌跡で構成される多様なデータセットにモデルを適用し、過去の移動履歴の範囲など、提供されたコンテキスト特徴の重要性についての洞察を得るためにモデルを使用する方法を示します。私たちのソフトウェアは、トレーニングされたモデルの重みとともにオープンソースとしてリリースされています。

動物の運動、生息地の選択、ディープラーニング、メソッド、空間記憶

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

MoveFormer: um modelo baseado em Transformer para modelagem de movimento animal com seleção de etapas

O movimento dos animais é um componente central de suas estratégias comportamentais. No entanto, as ferramentas estatísticas para análise de dados de movimentos têm sido limitadas e, em particular, incapazes de contabilizar informações de movimentos anteriores, exceto de uma forma muito simplificada. Neste trabalho, propomos o MoveFormer, um novo modelo de movimento baseado em passos, capaz de aprender diretamente a partir de trajetórias completas de animais. Embora inspirado na estrutura clássica de seleção de etapas e em trabalhos anteriores sobre a quantificação da incerteza nas previsões de movimento, o MoveFormer também se baseia em desenvolvimentos recentes em aprendizagem profunda, como a arquitetura Transformer, permitindo-lhe incorporar contextos temporais longos. O modelo prevê o próximo passo de movimento de um animal, dado o seu histórico de movimentos passados, incluindo não apenas informações puramente posicionais e temporais, mas também quaisquer covariáveis ambientais disponíveis, como cobertura do solo ou temperatura. Aplicamos nosso modelo a um conjunto de dados diversificado composto por mais de 1.550 trajetórias de mais de 100 estudos e mostramos como ele pode ser usado para obter insights sobre a importância das características de contexto fornecidas, incluindo a extensão do histórico de movimentos anteriores. Nosso software, junto com os pesos do modelo treinado, é lançado como código aberto.

movimento animal, seleção de habitat, aprendizagem profunda, método, memória espacial

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

MoveFormer: модель на основе Transformer для моделирования движений животных с пошаговым выбором.

Движение животных является центральным компонентом их поведенческих стратегий. Однако статистические инструменты для анализа данных о перемещении уже давно ограничены и, в частности, не способны учитывать информацию о прошлых перемещениях, кроме как в очень упрощенном виде. В этой работе мы предлагаем MoveFormer, новую пошаговую модель движения, способную обучаться непосредственно на полных траекториях животных. Вдохновленный классической структурой выбора шагов и предыдущими работами по количественной оценке неопределенности в прогнозах движения, MoveFormer также опирается на последние разработки в области глубокого обучения, такие как архитектура Transformer, что позволяет ему включать длинные временные контексты. Модель предсказывает следующий шаг движения животного, учитывая его прошлую историю перемещений, включая не только чисто позиционную и временную информацию, но и любые доступные ковариаты окружающей среды, такие как растительный покров или температура. Мы применяем нашу модель к разнообразному набору данных, состоящему из более чем 1550 траекторий из более чем 100 исследований, и показываем, как ее можно использовать для получения информации о важности предоставленных особенностей контекста, включая масштабы истории прошлых движений. Наше программное обеспечение вместе с обученными весами моделей распространяется с открытым исходным кодом.

движение животных, выбор среды обитания, глубокое обучение, метод, пространственная память

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

MoveFormer：基于 Transformer 的模型，用于步骤选择动物运动建模 c6cfb464eb4405aa1673106a4077a12 动物运动、栖息地选择、深度学习、方法、空间记忆

动物的运动是其行为策略的核心组成部分。然而，用于运动数据分析的统计工具长期以来一直受到限制，特别是除非以非常简单的方式，否则无法解释过去的运动信息。在这项工作中，我们提出了 MoveFormer，一种新的基于步骤的运动模型，能够直接从完整的动物轨迹中学习。虽然受到经典的步骤选择框架和之前关于运动预测不确定性量化的工作的启发，MoveFormer 还建立在深度学习的最新发展基础上，例如 Transformer 架构，使其能够纳入长时间上下文。该模型根据动物过去的运动历史来预测其下一步的运动，不仅包括纯粹的位置和时间信息，还包括任何可用的环境协变量，例如土地覆盖或温度。我们将我们的模型应用于由来自 100 多项研究的 1550 多个轨迹组成的多样化数据集，并展示了如何使用它来深入了解所提供的上下文特征的重要性，包括过去运动历史的范围。我们的软件以及经过训练的模型权重作为开源发布。

Submission: posted 22 March 2023, validated 22 March 2023
Recommendation: posted 29 September 2023, validated 29 September 2023

Cite this recommendation as:
Sueur, C. (2023) A deep learning model to unlock secrets of animal movement and behaviour. Peer Community in Ecology, 100531. https://doi.org/10.24072/pci.ecology.100531

Recommendation

The study of animal movement is essential for understanding their behaviour and how ecological or global changes impact their routines [1]. Recent technological advancements have improved the collection of movement data [2], but limited statistical tools have hindered the analysis of such data [3–5]. Animal movement is influenced not only by environmental factors but also by internal knowledge and memory, which are challenging to observe directly [6,7]. Routine movement behaviours and the incorporation of memory into models remain understudied.

Researchers have developed ‘MoveFormer’ [8], a deep learning-based model that predicts future movements based on past context, addressing these challenges and offering insights into the importance of different context lengths and information types. The model has been applied to a dataset of over 1,550 trajectories from various species, and the authors have made the MoveFormer source code available for further research.

Inspired by the step-selection framework and efforts to quantify uncertainty in movement predictions, MoveFormer leverages deep learning, specifically the Transformer architecture, to encode trajectories and understand how past movements influence current and future ones – a critical question in movement ecology. The results indicate that integrating information from a few days to two or three weeks before the movement enhances predictions. The model also accounts for environmental predictors and offers insights into the factors influencing animal movements.

Its potential impact extends to conservation, comparative analyses, and the generalisation of uncertainty-handling methods beyond ecology, with open-source code fostering collaboration and innovation in various scientific domains. Indeed, this method could be applied to analyse other kinds of movements, such as arm movements during tool use [9], pen movements, or eye movements during drawing [10], to better understand anticipation in actions and their intentionality.

References

1. Méndez, V.; Campos, D.; Bartumeus, F. Stochastic Foundations in Movement Ecology: Anomalous Diffusion, Front Propagation and Random Searches; Springer Series in Synergetics; Springer: Berlin, Heidelberg, 2014; ISBN 978-3-642-39009-8.
https://doi.org/10.1007/978-3-642-39010-4

2. Fehlmann, G.; King, A.J. Bio-Logging. Curr. Biol. 2016, 26, R830-R831.
https://doi.org/10.1016/j.cub.2016.05.033

3. Jacoby, D.M.; Freeman, R. Emerging Network-Based Tools in Movement Ecology. Trends Ecol. Evol. 2016, 31, 301-314.
https://doi.org/10.1016/j.tree.2016.01.011

4. Michelot, T.; Langrock, R.; Patterson, T.A. moveHMM: An R Package for the Statistical Modelling of Animal Movement Data Using Hidden Markov Models. Methods Ecol. Evol. 2016, 7, 1308-1315.
https://doi.org/10.1111/2041-210X.12578

5. Wang, G. Machine Learning for Inferring Animal Behavior from Location and Movement Data. Ecol. Inform. 2019, 49, 69-76.
https://doi.org/10.1016/j.ecoinf.2018.12.002

6. Noser, R.; Byrne, R.W. Change Point Analysis of Travel Routes Reveals Novel Insights into Foraging Strategies and Cognitive Maps of Wild Baboons. Am. J. Primatol. 2014, 76, 399-409.
https://doi.org/10.1002/ajp.22181

7. Fagan, W.F.; Lewis, M.A.; Auger‐Méthé, M.; Avgar, T.; Benhamou, S.; Breed, G.; LaDage, L.; Schlägel, U.E.; Tang, W.; Papastamatiou, Y.P. Spatial Memory and Animal Movement. Ecol. Lett. 2013, 16, 1316-1329.
https://doi.org/10.1111/ele.12165

8. Cífka, O.; Chamaillé-Jammes, S.; Liutkus, A. MoveFormer: A Transformer-Based Model for Step-Selection Animal Movement Modelling. bioRxiv 2023, ver. 4 peer-reviewed and recommended by Peer Community in Ecology.
https://doi.org/10.1101/2023.03.05.531080

9. Ardoin, T.; Sueur, C. Automatic Identification of Stone-Handling Behaviour in Japanese Macaques Using LabGym Artificial Intelligence. 2023, https://doi.org/10.13140/RG.2.2.30465.02402

10. Martinet, L.; Pelé, M. Drawing in Nonhuman Primates: What We Know and What Remains to Be Investigated. J. Comp. Psychol. Wash. DC 1983 2021, 135, 176-184, doi:10.1037/com0000251.
https://doi.org/10.1037/com0000251

PDF recommendation

Conflict of interest:
The recommender in charge of the evaluation of the article and the reviewers declared that they have no conflict of interest (as defined in the code of conduct of PCI) with the authors or with the content of the article. The authors declared that they comply with the PCI rule of having no financial conflicts of interest in relation to the content of the article.

Funding:
This work was supported by the LabEx NUMEV (ANR-10-LABX-0020) and the REPOS project, both funded by the I-Site MUSE (ANR-16-IDEX-0006). Computations were performed using HPC/AI resources from GENCI-IDRIS (Grant AD011012019R1).

Reviews

Evaluation round #1

DOI or URL of the preprint: https://doi.org/10.1101/2023.03.05.531080

Version of the preprint: 2

Author's Reply, 27 Sep 2023

Download author's reply Download tracked changes file

Dear recommender,

please see our response to reviews in the pdf. We also provide a version with track-change so you can more immediately see the revisions made. The document with these changes accepted is the lastest version online on bioRxiv.

We hope this revision will match your expectations.

Best regards,

Simon Chamaillé-Jammes, on behalf of the authors.

https://doi.org/10.24072/pci.ecology.100531.ar1

Decision by Cédric Sueur, posted 18 May 2023, validated 21 May 2023

Dear authors,

We received the comments of two reviewers. Please prepare a response. We will be pleased to review another version of your manuscript.

All the best,

Cédric Sueur

https://doi.org/10.24072/pci.ecology.100531.d1

Reviewed by Jacob Davidson, 17 May 2023

The authors develop a machine learning approach for predicting animal trajectories that uses the Transformer network architecture in order to incorporate past information of multiple features into predictions. By fitting to data from many different species using openly available MoveBank data, the authors compare predictive ability for different features and as a function of how much time of previous data is included.

I think the approach is interesting and makes a contribution that others working with movement data can use and build on. I have one main concern about the fitting procedure though: mainly, can the conclusions comparing the species be made, when the model is fit to all data together and the number of data points varies so large between species? The authors also note this (line 468). Does this discrepancy of data affect the conclusions? I could imagine an alternative fitting procedure, where each species is weighted equally, instead of each trajectory point. I feel that this comparison, or else further description justifying why the species comparison is driven by behavioral differences instead of simply different amounts of data, is needed.

Minor comments:

Legend text on Fig 1 is too small, and lacks units

Fig 2 shows PCA results for comparing the species, but does not show the PCA vectors. I'm not familiar with the Wikipedia2Vec data, but for PCA the vector components are normally shown, so that one can see what the embeddings represent. If this is not relevant for showing the Wikipedia2Vec embeddings, then it should at least be mentioned.

https://doi.org/10.24072/pci.ecology.100531.rev11

Reviewed by anonymous reviewer 1, 12 May 2023

# General comments

In this paper, the authors propose a deep learning (neural network) model for analysing animal movement trajectories, called MoveFormer. The model is step-based, predicting an animal’s next step based on the environmental context (as in a step-selection function). However, the model learns the entire trajectory before that step, thus incorporating (potentially long) temporal context to make predictions. Being a deep learning approach, we expect that the model is capable of learning complex relationships and having high predictive power. I believe that a similar deep learning approach for analysing trajectories (sequences) is Long Short Term Memory, but the paper uses recent developments such as the Transformer architecture. In my reading, I have not seen a similar approach applied to trajectory data. In general, I think this is a useful contribution to movement ecology: I feel we will (and should) see more approaches like this, which leverage the potential of deep learning methods for incorporating sequence (temporal) information when analysing animal trajectories. Specifically, the incorporation of previous movements (history) is an important advantage of the approach, and the estimation of a ‘context length’ (time window) that is most important for being able to learn the trajectory is a key contribution.

I have some familiarity with simpler machine learning methods, but not much expertise in deep learning. I cannot comment on many technical aspects of the work, particularly the specific architecture and implementation of this model. Nonetheless, I offer below a few general comments, and a small number of specific comments.

Clearly the model has impressive predictive capability – I wondered whether it’s possible to forecast more than one step ahead, or to predict a whole sequence of the same output length as the input? I understand that this is no longer exactly step selection, then, but I think this would be the kind of application many movement ecologists would be interested in. If we can only predict one step ahead, then a simpler approach may be better (next point)?

I was curious about inference from the model. If the model cannot (or should not) be used to predict more than one step ahead, and inference is limited due to the black box nature of the model, is it better to sticks with more traditional step selection functions if inference is the goal? I guess this is particularly relevant given how much data are clearly required for the present model. Regarding inference, is it possible to look at ‘selection’ of environmental conditions, along with the importance currently shown in the paper?

I appreciated that some features of the model – I’m thinking particularly of the different time-scale periods in the model – were kept general to maximise the wider (future) application of the model.

Regarding the stated contributions number 2 and 3 -- “Second, the proposed approach is flexible enough to allow each step in the context to be defined not only by the locations of the start and end points, but also by any kind of features that could be relevant, in particular environmental variables. Third, we show how the model can be used to gain insights about the importance of the provided context, both in terms of the extent of the past that it is useful to know, and in terms of what kinds of information are most ecologically relevant to predict an animal’s movement” – it would be really interesting (in future work!) to see how the model responds to variation in the spatial scale and resolution of the environmental context variables.

I found the evaluation of the relevant context length very interesting. In future work it would be interesting to further examine patterns of context length among species, beyond what is presented here, and in different ecological settings.

The present study uses GPS data – I would be interested to hear the authors’ thoughts on how to deal with lower accuracy tracking data such as Argos.

An important positive element of the work is the open-source release of the software, although I have not had the opportunity to try it.

Overall, the manuscript is clearly written and neatly presented.

# A few specific comments

L70-73: I don't agree that step selection functions are *the* approach to analyse animal trajectories. My own feeling is that other methods such as Hidden Markov Models or regression of trajectory parameters against environmental covariates are *at least* equally common, if not more so. To avoid this statement (which I think is debatable), consider: "Step-selection function (SSF) models, which compare actual movement steps with realistic candidate ones, are routinely used to infer and quantify the effect of environmental variables, such as land cover or temperature, on animal trajectories".

L105-107: I assume the time-window is arbitrary, which was one of your criticisms of current methods for incorporating previous context ("familiarity") in step selection functions.

Table 1: Add column name abbreviations to the table caption. Describe 'Section' part of the table (training, validation, test) in the caption.

L150: '408 observations'. Also, translate this approximately to a real duration in days?

L150-153: How is the split assigned (what proportions)?

L156-158: What is the reason for doing this?

L159: I think the taxon vectors will need some further explanation. Is the vector approximating the taxonomic relationships? Okay, I see this information down on L171-172. I suggest moving this information (or something like it) up to the beginning of the paragraph, to immediately give readers the context. Although, I still wonder if the vector embedding is capturing the actual taxonomic relationship, or only the ‘semantic similarity’ [L164-165] (in other words, how similar given Wikipedia entries are). The PCA figure, for example, shows that the embedding is okay at class level, but not very good at order level (no clear clustering of orders within classes). Also, the Spearman correlation (= 0.68, L 170) is not great. Could there be a different way to embed taxonomy? In understand that there may not be, and this is a minor point.

L184: ‘resample’ rather than ‘sample’?

L189: From this dataset (and Figure 1), I gather that no marine trajectories are included. Were these explicitly excluded using a filter at some point?

L312. Perhaps start a new paragraph here.

Figure 7: The labels ‘bioclim’ are not informative.

L502-505: I’m not sure I completely understood this point, but it seems to be quite relevant give our wish to make inference from the model. So, currently you present no inference for features that were relevant in the learned trajectory, only for the next predicted step? How are the two related to one another?

L537-540: I wonder, also, whether using higher temporal resolution data would reveal a second peak in context length, indicative of nested scale-patterns in the trajectories (e.g., fine temporal context nested within a longer context).

https://doi.org/10.24072/pci.ecology.100531.rev12