1 Introduction
Population dynamics studies aim at understanding variation in the size and structure of populations. This variation results from flows of individuals, which, together with flows of matter and energy and flows of genes are one of the three dominant dynamics of the living world. In population dynamics, the effects of births, deaths and dispersal, affected by various sources of variation [1], accumulate over time in a multiplicative process. As a consequence, population dynamics is often counter-intuitive [2] and mathematical models have been early and widely used [3,4].
With the development of computers, population dynamics modelling has developed explosively for the last 40 years. Vertebrate population studies have played a major role in this development, for at least three reasons: a wide diversity of life cycles (generation time ranges from a few weeks in small rodents to decades in whales or elephants), the fact that vertebrates can be counted, observed and marked relatively easily, and concerns on the impact of human activities on vertebrate populations [5,6]. Moreover, as discussed later, because of their sensitivity to demographic changes, vertebrates are good indicators of the impact of human activities on ecological systems.
I review here progress in population dynamics models, and how it has improved our understanding of vertebrate population ecology. I consider statistical and dynamical models, with an emphasis on empirical approaches. Mixing modelling and biological ideas, this text is intended for a wide audience, while keeping references to a minimum and not pretending to exhaustiveness in such a rapidly evolving field. In particular, little mention will be made of population dynamics studies of marine resources [7], despite strikingly parallel developments.
Studies of vertebrate populations have paid attention for a long time to survival [8], fecundity [9] and dispersal [10], broadly called ‘vital rates’, as life history traits of particular interest. In the 1950s and 1960s, density dependence in vital rates, acting as a negative feedback on population growth [11], was opposed to density-independent variation [11,12] in a long controversy [13]: the impossibility of long-term exponential growth requires density dependence, while density-independent environmental variation in vital rates is often prominent in empirical studies.
I will review progress since this period in dynamical models (Section 2) and in statistical ones (Section 3), which changed markedly our knowledge of vital rates and in turn our understanding of vertebrate population dynamics (Section 4), to conclude by a general discussion (Section 5).
2 The development of discrete time models
Dynamical population models consider changes in population size and structure over time. At one extreme, population size can be reduced to a scalar, neglecting any structure. At the other, population size can be viewed as a function of several real variables, such as age, body weight, or spatial coordinates. The time scale can be continuous or discrete. A wide and convenient type of models considers a discrete time scale and a discrete structure based on classes of individuals (age or size classes, sites or ‘patches’). Population at time t is then represented as a non-negative vector . Age-structured models with age class length equal to the time step are particularly straightforward [14], although developmental stages, sites or general states can be also considered and possibly combined with age [15–17].
A deterministic discrete time model consists then of a recurrence equation , where is a square matrix. Matrix models sensu lato, excellently reviewed by Caswell [15], have several advantages:
- – they are computationally practical [18], easily studied and communicated to biologists;
- – they are easily generalized (Table 1), similar generalizations for other types of models [19] being generally more involved;
- – they are biologically relevant for a seasonal environment, as common for vertebrates, and easily built from life cycle graphs, with vital rates as natural parameters (Figs. 1 and 2, about the White Stork C. ciconia [20]).
Various types of discrete time population models
Feature | Recurrence equation | Type of model | Mathematical tools | Recent key reference |
Constant parameters | Matrix models stricto sensu | Linear algebra | [15] | |
Density dependence | Density-dependent matrix models, Discrete time logistic growth | Nonlinear dynamics | [15] | |
Random environment | Random environment models | Products of random matrices | [29] | |
Demographic stochasticity | Branching processes | Applied probability | [31] |
The asymptotic behaviour of constant parameter matrix models is well known [15]: under mild assumptions, the matrix M has a real positive dominant eigenvalue associated to a positive right eigenvector, which give the asymptotic growth rate and structure of the population, respectively. The left eigenvector, also positive, consists of ‘reproductive values’, interpretable as relative age or stage specific contributions to growth. Constant parameter matrix models, albeit restrictive, are extremely useful for population ‘projections’, i.e., predictions valid under the current parameter values [15]. The asymptotic growth rate is a nonlinear and non-explicit function of demographic parameters: sensitivity analysis helps understanding its variation with respect to the parameters. Generation time (the mean age of mothers at birth in the asymptotic population structure [21]) plays there a central role: for instance, the relative sensitivity (or ‘elasticity’) of to a change in all fecundity parameters is the inverse of generation time [22,23]. In all long lived species, survival is thus the most sensitive parameter (White Stork: Table 2).
Sensitivity analysis of population growth rate as a function of demographic parameters for the Alsace White Stork (model and parameters in Fig. 2) (after [20] modified). Baseline scenario is , R=0.800, a=3.000, , , leading to asymptotic population growth rate λ=1.0195 and generation time . Numerical results (last but one column) illustrate formal links between elasticity and generation time (last column)
Parameter θ | Asymptotic growth rate λ | Elasticity, numerical | Elasticity, formal |
Based on a 2% change in parameter | |||
Proportion of successful nests R | 1.0228 | 0.1596 | |
Number of young per successful pair a | 1.0228 | 0.1596 | |
First year survival | 1.0228 | 0.1596 | |
After first year survival | 1.0366 | 0.8386 | |
Overall survival, and | 1.0399 | 1.0000 | 1.0000 |
One can distinguish between individuals just appeared in the population, by birth or immigration, as , and other individuals, resulting from survival or dispersal within the population, as [15]. The resulting model equation considers growth and turnover, closely linking matrix models with the stable population theory [3,24] and renewal theory [16]. One may easily show that the asymptotic proportion of new individuals, if counted in reproductive value, is the inverse of generation time. Reproductive value is also the relevant currency to compare flows of individuals in dispersal [25] or optimal harvesting [26] models. Generation time and reproductive value appear thus besides growth rate and population structure as key concepts and outputs of demographic models.
In density-dependent models , the growth rate generally decreases when population increases (see, however, [27]). The behaviour of nonlinear dynamic systems has received a lot of mathematical attention. Discrete time models exhibit successive bifurcations in asymptotic behaviour when the maximum growth rate increases [28], often culminating in chaotic behaviour. However, most realistic applications to vertebrate populations lead to a stable equilibrium.
In random environment models, the demographic parameters, and hence the matrix change randomly over time t, as a stochastic process. The long-term growth rate of the population differs from the growth rate of expected population size (Fig. 3). It can be derived from ergodic theorems on products of random matrices, organised in a coherent theory of random environment models by Tuljapurkar [29]. Remarkably, there is little theory for the simultaneous presence and potentially complex interplay of density dependence and environmental stochasticity (see, however, [30]).
Another relevant generalisation concerns demographic stochasticity, which addresses the discrete and probabilistic nature of demographic processes. Mortality is then, for instance, a binomial process. Branching processes (BP) [31], in which integer population size varies according to a Markov chain, are a canonical tool. A particularly striking feature of BP is that, when population size is prevented to diverge to infinity by density dependence, extinction is certain: stabilization can then only occur conditional on non-extinction [31], as a ‘quasi-stationary’ distribution of population size. Time to extinction follows then a geometric distribution, as commonly observed in ad hoc extinction models used in ‘Population Viability Analysis’ [32].
Discrete time models can be studied numerically, with program ULM [18] standing out as a powerful and user-friendly tool. Their empirical use directly depends on demographic parameter estimation.
3 Statistical models: modern capture–recapture methodology
Among demographic parameters, fecundity is relatively easily estimated in vertebrates. Estimating survival probabilities is a challenge that has received considerable attention over the last 20 years. In animal populations, individuals are rarely, if ever, followed exhaustively long enough to efficiently estimate survival (e.g., radio-tagging [33]). Even in plants, dormancy can prevent from detecting individuals [34] and using models based on exhaustive follow-up. While early survival studies based on marked individuals [8] were derived from life table approaches, the entire topic of demographic parameter estimation in natural populations has been recast in the comprehensive framework of capture-mark-recapture (CMR) methodology. Marking individuals, at birth or later, and sampling them through time and space is indeed the closest counterpart that one may reach of the exhaustive registration of births and deaths in human populations.
CMR methods were first developed for closed populations, i.e., with no death or recruitment of individuals, recruitment in a generic sense being the entry in the part of the population amenable to capture. The assumption of closure often appeared untenable: mortality and recruitment were first viewed as nuisance parameters, needed to remain able to estimate population size. Early developments culminated in a fully stochastic model [35,36]. However, realizing that survival probabilities could be estimated based on successive sampling of marked individuals, without reference to their dilution among the unmarked ones [37], induced a move in emphasis from the estimation of population size to that of survival probabilities [38]. The resulting Cormack–Jolly–Seber (CJS) survival model generalises binomial chains for follow-up data to CMR data. In its original version, the survival and detection probabilities were time dependent, to be able to obtain explicit estimators. Soon, generalisations took into account variation in parameters with age [39]. Optimal goodness-of-fit contingency table tests partitioned into interpretable components were developed [40,41].
In parallel, the 1970s saw the development of generalized linear models (GLM), in which covariates or factors (as in Analysis of Variance) are considered to affect counts, proportions... [42]. GLM ideas spread progressively in CMR models, with constant parameter models, models with covariates, etc. [43–45]. These developments were gathered, with an emphasis on flexibility in answering biological questions, in a comprehensive framework [46] with powerful and flexible software [47,48]. As an example, Table 3 shows how survival of the Alsace White Stork was related to drought in the Sahel wintering grounds [49]. Flexibility was enhanced by the development of model selection ideas [50], parameter identifiability diagnostics [51], and models alleviating the deleterious effect of heterogeneity of capture [52,53].
An example of relation between survival and an environmental covariate: Alsace White Stork adult survival and rain in Segou, in the Sahel wintering zone (after [49], simplified and completed). In the CMR survival models presented the probability of recapture is constant; ‘Experience’ corresponds to newly seen and already seen breeders; Rain corresponds to a logistic-linear relationship between survival and rain in Segou, by Experience class (2) or additively to Experience (3). The minimal Akaike Information Criterion [50] points to model (3) in which survival decreases with Sahel drought spells. Based on sensitivity analysis (Table 2), the induced variation in survival is sufficient to explain the strong decrease in the Alsace White Stork population
Survival model | Number of parameters | Relative deviance | Akaike information criterion |
(1) Experience. Time | 40 | 342.56 | 422.56 |
(2) Experience. Rain | 5 | 375.50 | 385.50 |
(3) Experience + Rain | 4 | 376.29 | 384.29 |
(4) Experience | 3 | 385.75 | 391.75 |
Recent and promising developments concern random effects. For instance, survival probabilities over time can be modelled as , with a single variance parameter . Random effects are also useful to assess the effect of a covariate in the presence of unexplained environmental variability, as . Bayesian methods provide suitable algorithms such as Monte-Carlo Markov Chains to estimate parameters [54,55].
Multistate capture–recapture (MSCR) models are a recent and powerful generalization: individuals move among several sites or states, still with incomplete detection over discrete occasions [56–58]. MSCR models encompass survival models based on live recapture (CJS), dead recoveries of marked individuals [59], and mixtures of such pieces of information [60]. They cover also models to analyse and estimate recruitment [61,62], natal and breeding dispersal [63,64]. Multievent models, a most promising generalization of MSCR models, consider uncertainty in state determination [65]. MSCR and multievent models are CMR counterparts of longitudinal models used in human behaviour and health studies [66].
In empirical population assessment, demographic parameter estimates, denoted in a generic fashion , are often fed into a matrix model . For instance, under constant parameters, the resulting dominant eigenvalue is an estimate of the asymptotic population growth rate. Estimates of its sampling uncertainty can be derived from sensitivity analysis [67]. When parameters vary over time, the model trajectory and population counts can be graphically compared, in an ad hoc fashion. Recent promising work [55,68,69] has proposed to formally link dynamical and statistical models using state-space models. The information from population counts is then combined with that from CMR data to estimate demographic parameters and population numbers and structure over time, based on Kalman filtering of Bayesian techniques. Such integrated modelling also offers possibilities of forecasting, of special interest for population management [70].
4 Consequences on the understanding of vertebrate population dynamics
Progress in capture–recapture methodology, including software, has been rapidly transferred to biologists, in a fashion similar to the transfer of progress in survival models [71] in biomedical research. It induced a “revolution, [...] still ongoing, and literally induced hundreds of papers” [72]. Ad hoc and irrelevant techniques (return rates, life tables) were nearly abandoned, which resulted in less biased estimates, often validated by using dynamical models. Since most sources of bias tended to underestimate survival, progress thus often resulted in an upward trend in survival estimates for a same species over time [23,73].
The linear model philosophy in CMR induced a strong focus on biological questions, as in the White Stork example. Evidence was obtained of the effects of wide-scale climatic variables and climate change on animal demography [74]. Accession to reproduction in animal populations was proven to be progressive [63,75]. Unambiguous evidence for a cost of reproduction on survival or future reproduction was obtained in observational [76] and experimental studies [77], demonstrating some trade-offs predicted by evolutionary theory. Similarly, senescence in survival [78,79] and life-long differences in demographic performance between cohorts [80] were shown to be common in ungulates.
Dispersal studies greatly benefited from the development of multistate models, providing information complementary to molecular marker estimates of gene flow [81]. The greater site fidelity of adult birds compared to immatures was largely confirmed, and dispersal was shown to vary with the environment [64], disturbance [63], and social behaviour [82]. In parallel, reflections on subdivided populations, or ‘metapopulations’ sensu lato [83] and dispersal studies helped renewing the concept of density dependence, emphasizing local density dependence [84,85], according to which dispersal, fecundity and survival may vary with local density through access to resource. Strong results were obtained in mammals [78,86]. Surprisingly, density dependence has been the subject of few manipulative experiments since the 1950s [87], although bird species using nest boxes seem an ideal material.
Changes in numbers are now currently interpreted in terms of turnover of individuals and of influential parameters, in a striking shift from pattern to process, whether at the single species level [88] or in a comparative perspective [80]. The upward revisions of survival, and the downward revision of recruitment, breeding propensity and reproductive output resulted in larger estimates of generation time, a integrative index of the position of a species or population on the ‘slow–fast’ gradient [22]. This gradient ranks species from slow, low reproduction, low maximum growth rate (K) species to fast, high-reproduction, low-survival, high-growth rate (r) species [89]. Generation time being inversely related to maximum growth rate [90], maximum growth rates in Vertebrates are thus lower than previously thought, a conclusion critical for conservation biology. For instance seabirds, with, commonly, generation times >20 years, stand out as species with a low maximum growth rate ( per year). They are thus extremely sensitive to man-induced mortality, e.g., incidental fishery by-catch [90,91]. Longevity and body size, strongly correlated with longevity [92,93] are indeed good predictors of extinction risk. Whether one speaks of Albatross, Whales, California Condors, sea turtles or Sequoias, there is thus a genuine “malediction of long-lived species”. Progress in vertebrate population modelling has played a major role in drawing the attention to this problem. These concerns come close to those on the overexploitation of marine resources [7], for which progress in quantitative approaches has also been decisive.
5 Discussion
The joint development of matrix models and capture-mark-recapture methods has led to a comprehensive and powerful theory and methodology for vertebrate population dynamic studies [94]. These studies have both benefited from and been at the source of much of this methodological development, in a fascinating pluridisciplinary story. The data from many long-term programs are being efficiently used to address a variety of biological questions, from evolutionary theory [95] to conservation ecology [96]. This revolution has definitely moved the emphasis in population dynamics from pattern to process.
Based on foreseeable developments, one can clearly predict population modelling in the broad sense considered here will still develop and diversify, and go on intimately influencing population biology as a whole. Three most promising directions are mixed models (random effects), integrated modelling and multievent (‘uncertain-state’) models. Mixed models allow partitioning variation in demographic rates, and their spread will mostly depend on the availability of user-friendly software. Integrated modelling is a natural way of analyzing ‘integrated monitoring’ combining population counts with CMR data. Continuing progress in individual marks and electronic devices providing access to biological characteristics [97] will have several consequences. First, it will require further developments of multistate and multi-event models. Secondly, population modelling will emphasize process even more strongly by bridges with individual-based and behaviour-based models [98]. These models focus on optimal individual choice rules (for dispersal, reproduction...), at the expense of parameter estimation procedures, while the demographic methods reviewed here focus on estimation and population-level mechanisms.
On the biological side, experiments on density dependence are needed to progress on this still challenging question. Continuing progress in the reliability of demographic parameter estimates will also induce broad revaluations in comparative demography, with strong consequences on conservation biology and our understanding of the diversification of life histories. Population modelling has become an integral part of population biology.