Chicken-or-egg: Which came first, the question or the model?

The questions we ask

The quest for generality is at the core of ecology. Once we observe some relationship between X and Y, we usually want to know how general that relationship is, and how much the relationship can vary in different conditions – whether it’s ecosystems, species, individuals, sites, or other groupings. In other words, we are interested in separating the variation we observe into (1) a general relationship between X and Y that explains what we see most of the time, and (2) conditions under which this general relationship may differ. These conditions can be ecological (i.e. maybe species differ in the way they respond to X), but they can also be due to the process of observing the relationships (i.e. maybe X responds to Y in site A as expected and B but not C and D, for reasons we haven’t measured).

As an example, we know that plants generally grow faster when they are exposed to more light. If we go out and measure plant growth rates in sites A, B, C, and D, we may see that this general relationship holds. But, some species may actually grow better in low light, and their relationship with light will differ. 

When we monitor biodiversity change, we are often asking a similar question. How are species’ abundances generally responding to global change: are they declining? stable? Growing? Once we know this general response, we also need to know how individual species are doing: which species are declining the fastest, and need immediate conservation interventions? Which ones are stable but not growing, but should continue being monitored in case this changes? Which ones are growing, and may be doing better after conservation or may not need any conservation action for the time being?

So, our question is generally framed in this way: what is the general relationship, and when and how much can this relationship differ between species?

How we answer these questions

To answer these questions, we build models, collect data, and evaluate how well our model explains the data. For simplicity, let’s explore an example where we want to understand Y ~ X for many species. One approach is to model Y ~ X for each species and then try to put all these models together to get some kind of average or agreement between them. Maybe we could look at how the slope of our models vary across these species, to make inferences about how the strength of the relationship between X and Y differs between species.  

Another approach is to build one model that disentangles the Y ~ X relationship into a general component, which summarises the general trajectory for all species, and a species-level component, which estimates how the relationship of Y ~ X is conditional on species identity. This is, essentially, a hierarchical model: we are estimating variation (1) across species, and (2) between species in one model.

Which comes first? The chicken or the egg?

Over the last decade, we’ve seen a burst in modeling capacity in ecology. We have big datasets, more powerful computers, a wide catalogue of statistical modeling tools and literature to back it up, and importantly – a ton of questions we want to answer.

These questions should, of course, always be based on our previous understanding of relationships we’ve studied between Y and X from the literature and our observations of nature. Our questions are supposed to shape the model we build – but, we have to admit that sometimes the model shapes the question first. Maybe our labs use certain models that we’ve adopted ourselves, and we find ways to apply this model to new datasets. Maybe our field loves using a certain approach (like community ecologists love a good PCA – no shame here, I love a PCA too), so we reach for it first. Maybe we think a newly published modeling approach is fancy and cool, and want to find a way to use it in our research. These are all fine, honestly. 

But we are wondering: Are our research questions sometimes limited by our model “comfort zone”, and if so, how do we push past this?

In other words, like the chicken and the egg: Which came first, the question or the model?

Are there questions that we aren’t yet asking because we don’t know how to answer them? Are some of these questions actually possible to answer now that we have an abundance of models, types and sizes of datasets, and powerful computers?

Join a community call!

If you’re interested in talking more about these models, please join our community calls during March and April 2025! We welcome anyone interested in GAMs, computational ecology, or eager to learn more about HGAMs to participate in the following sessions:

These community calls are intended to help us face this question. Each discussion will focus on the outstanding ecological questions that we could answer with HGAMs, highlighting a wide array of potential applications for specific types of ecological and evolutionary data. Join us in thinking about how we could use HGAMs to push ecological research forward! You do not need any background with hierarchical modeling or generalized additive models to join these discussions.

Each discussion will follow this structure:

ActivityDuration
Welcome & Scope10 mins
Individual reflection before the small group discussion5 mins
Small group discussion: What question do you usually ask? What question would you like to ask next?What model(s) do you use to answer your questions? Are some of your questions (or questions you’d like to ask) something you could ask with a hierarchical model, or hierarchical GAM?15 mins
Whole group discussion (discuss the small group findings)25 min
Wrap up & next steps5 mins
Close

What is this all for?

Our intention is to collaboratively write a Perspective paper to highlight the outstanding questions in ecology that could be explored with hierarchical GAMs. 

Please fill this form to let us know how you would like to participate (or not) in the next steps. 

All participants of this call who contributed to the notes and discussions will be credited in the acknowledgements of the paper unless otherwise communicated to us. 

Authored by Camille Lévesque and Katherine Hébert

HGAMs working group community calls

We invite you to join our series of discussions on the applications of Hierarchical Generalized Additive Models (HGAMs) in ecology. These discussions are part of a broader initiative led by BIOS2’s HGAMs working group, aimed at promoting the understanding and application of these models. 

We welcome anyone interested in GAMs, computational ecology, or eager to learn more about HGAMs to participate in the following sessions:

Each discussion will focus on the outstanding ecological questions that we could answer with HGAMs, highlighting a wide array of potential applications for specific types of ecological and evolutionary data. Join us in thinking about how we could use HGAMs to push ecological research forward!

Hierarchical Generalized Additive Models

On March 3rd, 2025, BIOS² will host a new training on hierarchical generalized additive models (HGAMs) by fellows Camille Lévesque and Katherine Hébert.

This course is designed to demystify hierarchical modelling as powerful tools to model population dynamics, spatial distributions, and any non-linear relationships in your ecological data. The training will be divided into two blocks. First, we will cover hierarchies in biology, data, and in models to understand what hierarchical models are, some of the forms they can take, and the fundamentals of how they work. Second, we will introduce latent variable modelling as a way to explain even more of the variation in our response variables, to better disentangle the hierarchies of variation in our data. Both blocks will include a theoretical presentation followed by hands-on coding exercises to implement and interpret hierarchical GAMs.

This training will be given in English, and the coding exercises will be done in R. We recommend installing R and RStudio prior to the workshop, and will send more detailed instructions about packages to install and data to download in the days before the workshop.

We recommend previous experience with GAMs before taking this training. If you would like to follow an introduction to GAMs before this workshop, please have a look at Eric Pedersen’s Introduction to GAMs (https://bios2.usherbrooke.ca/2021/10/20/workshop-gams-2021/), the Québec Centre for Biodiversity Science’s Workshop 8: GAMs (http://r.qcbs.ca/workshop08/book-en/) or take this BIOS²+QCBS hybrid training that will happen on February 28th, 2025.

What: Short course on hierarchical generalized additive models (3h)
When: March 3, 1pm-4pm EST (includes 15 minute break)
Where: Online on Zoom
Registration: https://us02web.zoom.us/meeting/register/_JZMQEXhRaqxVvCKUqM4jA

Student-Led research group call 2024/25

The concept and idea of a “working group” varies immensely in academia, but in the industry, it has a very clear definition: it is a group of people working on a very specific problem within a timeframe. Although the time to digest and rethink deeply complex ecological problems is instrumental for scientific progress, very often scientists are called to act, to propose a specific solution for a specific problem (even more in the rapidly changing world we’re living in).

The Student-Led Working group aims to promote leadership opportunities for BIOS2 fellows while fostering diversity, inclusion and accessibility in research environments. This call simulates a project development pipeline, with four phases: initiation, planning, execution and closing. Proponents of a working group would submit a project where the organization and leadership are well-defined in the project, as well as expectations, budget and timelines. BIOS2 fellows are invited to submit a project proposal with a schedule for up to seven months, including at least the execution and closing phases of a research project.

A budget of up to CAD$ 5000 is available for travel, accommodation, material and publication fees. In-person event coordination and/or mentoring for organizers can be provided by the BIOS2 coordination team.

Project submission

Each project would clearly define a roadmap including the four phases of project development cited above.

The project should not last more than one academic year and can be very short to include only the final phase of the working group. For example, if a student have already gone through the ideation phase, but needs help planning and executing their projects, they should include the ideation phase in the project and describe the outcomes that would be the input of the planning phase.

Additionally, the group members should be defined by the proponent. A very important aspect of a working group is the choice of the people who will work together. It requires planning and leadership skills to identify the people with the technical baggage, interests and availability to work on a project. It’s up to the proponent how they will find the right people. If needed, the applicant group can open a call for applications to invite more participants, and BIOS2 can help publicizing this call, but the final selection should be made entirely by the proposal’s leadership group. BIOS2 will consider the diversity of the group and justifications for the group composition when deciding if the project will be funded.

The project may or may not include in-person meetings, but should include at least one sprint period. Sprints are short, time-boxed periods when a team works to complete a very well-defined set amount of work. The amount of work defined by the leaders must be compatible with the length of the sprint: the shorter the period, the smaller the number of tasks to be completed. An imbalance in this ratio will certainly lead to burnout and frustration. The sustainability and inclusivity of in-person meetings must be considered during the selection process.

The application package will consist of a written project (that can be supported by other media, such as videos, audio, images, websites, etc.), a lean canvas proposal, a budget spreadsheet and a timeline of ideation (when applicable), planning, execution and closing phases. This timeline must comprise any period between December 1st, 2024 and August 31st, 2025, including the closing phase.

The written project and supporting media should expand on the bullet points added to the lean canvas, and there, proponents must define:

  • One clear problem to be solved and why it’s important to solve it. Applicants should include their problem in a good theoretical context that can be understood by specialists and non-specialists.
  • Description of the human resources of the group, i.e., names and contributions of each participant. Applicants must not attach any CVs – instead, they should explain why they decided to invite these people to help them solve the problem defined before.
  • Expected outcomes: explicit what and how many products they expect to deliver by the end of the timeline, why they are important, and how they can be used as a measure of success.
  • The budget document must be filled in in detail. Applicants should include itemized expenses instead of broad categories (i.e., instead of “support for participants”, they should try to describe which kind of support they are talking about – child care? transportation? translation? masks?).
  • Applicants must also include a description of regular activities, depending on the number of phases they include in the project (frequency of virtual and in-person meetings, visits, training, etc.).

The templates for the lean canvas and budget are available below:

Eligibility

Applicants must be currently registered in a study program in ecology, environmental science, evolution or a related discipline in a Canadian university and be a BIOS2 fellow. The other members of the group don’t need to fulfill these requirements, but only students can receive funds from BIOS2. Non-students in the group should be funded by other sources.

Selection process and deadlines

Applications must be submitted by November 8th, 2024. The selection committee will be composed of BIOS2 and QCBS staff members and alumni. Results are to be expected in the last two weeks of November, 2024.

Application packages should be submitted as one single zip file through this form: https://tally.so/r/31vypW

Apply to participate in a Hydro Quebec + BIOS² working group

The NSERC-CREATE training program in Computational Biology BIOS2, in collaboration with Hydro Quebec, is now accepting applications to participate in a working group that will be held from March 18 to 22, 2024, in the Montreal Area, QC. Applications are due on January 15, 2024. To apply, please fill out this form: https://forms.office.com/r/mGL9thRSSJ

Description of the activity

Working groups consist of a small group of researchers who meet in person and work intensively and collaboratively on a research question, to solve a specific problem, or to develop a specific project. The working group’s goal is to prototype a chapter of an environmental impact assessment report focused on biodiversity measures, using empirical data from real Hydro Quebec projects. Students will work to implement a chapter that should help with decision-making that is efficient, applicable and with content that is easy to communicate.

Eligibility

Applicants must be currently registered in a study program in ecology, environmental science, evolution or a related discipline in a Canadian university. The call is primarily for graduate students, but a few places will be open for undergraduate students and for postdocs whose research aligns with the theme of the working group.

Ideally, candidates will be proficient in R with some experience of working with large ecological datasets, in a collaborative environment, and communicating their analysis and results. Candidates should also be able to participate in person on all 5 days of the activities. Given the goal of the working group, the following skills will be prioritized in the selection process:

Communication

  • Data visualization
  • Map design
  • Technical writing in English
  • Technical writing in French
  • Environmental policy
  • Decision making

Computational Ecology

  • Multivariate analysis
  • Geographical statistics
  • Data management
  • Climate change
  • Simulations
  • Ecological niche and species distribution modeling
  • Landscape ecology
  • Environmental impact assessment
  • Community ecology

Please notice you don’t need to check all the boxes in the list of skills above to apply. Any combination of skills is welcome, just make sure to emphasize them in the application. It is important, however, to keep in mind that there will be mandatory readings in French.

The highest priority will go to BIOS2 Fellows. Participation fees (travel, accommodation and meals during the event) will be covered for BIOS2 program members, as well as for students registered at a BIOS2-affiliated university. Students from other universities are welcome to apply but must cover their cost of participation.

Candidates with caring responsibilities are welcome and encouraged to apply and we will accommodate their needs, as much as possible. The common language used in this working group will be English, but support in French, Portuguese and Spanish will be provided.

Presentation of the project

Prototype of a biodiversity chapter for environmental impact assessment reports

Leaders: Alexandre Beauchemin, Biologiste, M.Sc. Env. (Hydro Québec), Amélie Drolet Biologiste, M.Sc. (Hydro Quebec), and Dominique Gravel, PhD (BIOS2, Réseau d’observation de la biodiversité du Québec, Université de Sherbrooke).
Facilitator: Gracielle Higino, PhD (BIOS2)

Date of the working group: March 18 to 22, 2024.
Location: In person, in the Montreal Area (place TBD)
Application deadline: January 15, 2024
Application form: https://forms.office.com/r/mGL9thRSSJ

If you have any questions, do not hesitate to contact us at pgm_bios2@usherbrooke.ca.

Apply to participate in a BIOS² + LDP Student-led Working Group

The NSERC-CREATE training program in Computational Biology BIOS2, in collaboration with the NSERC-CREATE Living Data Project from the Canadian Institute of Ecology and Evolution, are now accepting applications to participate in a student-led working group that will be held from January 23 to 27, 2023, in the Montreal Area, QC.

The theme of the working group is: Assessing the potential for climate-driven range shifts through multiple landscapes features across the Canada-US border.