- Department of Organismal Biology, Uppsala University, Uppsala, Sweden
- Adaptation, Evolutionary Applications, Genotype-Phenotype, Hybridization / Introgression, Population Genetics / Genomics
A young age of subspecific divergence in the desert locust Schistocerca gregaria, inferred by ABC Random Forest
Estimating recent divergence history: making the most of microsatellite data and Approximate Bayesian Computation approaches
The present-day distribution of extant species is the result of the interplay between their past population demography (e.g., expansion, contraction, isolation, and migration) and adaptation to the environment. Shedding light on the timing and magnitude of key demographic events helps identify potential drivers of such events and interaction of those drivers, such as life history traits and past episodes of environmental shifts.
The understanding of the key factors driving species evolution gives important insights into how the species may respond to changing conditions, which can be particularly relevant for the management of harmful species, such as agricultural pests (e.g. ). Meaningful demographic inferences present major challenges. These include formulating evolutionary scenarios fitting species biology and the eco-geographical context and choosing informative molecular markers and accurate quantitative approaches to statistically compare multiple demographic scenarios and estimate the parameters of interest. A further issue comes with result interpretation. Accurately dating the inferred events is far from straightforward since reliable calibration points are necessary to translate the molecular estimates of the evolutionary time into absolute time units (i.e. years). This can be attempted in different ways, such as by using fossil and archaeological records, heterochronous samples (e.g. ancient DNA), and/or mutation rate estimated from independent data (e.g. ,  for review). Nonetheless, most experimental systems rarely meet these conditions, hindering the comprehensive interpretation of results.
The contribution of Chapuis et al.  addresses these issues to investigate the recent history of the African insect pest Schistocerca gregaria (desert locust). They apply Approximate Bayesian Computation-Random Forest (ABC-RF) approaches to microsatellite markers. Owing to their fast mutation rate microsatellite markers offer at least two advantages: i) suitability for analyzing recently diverged populations, and ii) direct estimate of the germline mutation rate in pedigree samples. The work of Chapuis et al.  benefits of both these advantages, since they have estimates of mutation rate and allele size constraints derived from germline mutations in the species .
The main aim of the study is to infer the history of divergence of the two subspecies of the desert locust, which have spatially disjoint distribution corresponding to the dry regions of North and West-South Africa. They first use paleo-vegetation maps to formulate hypotheses about changes in species range since the last glacial maximum. Based on them, they generate 12 divergence models. For the selection of the demographic model and parameter estimation, they apply the recently developed ABC-RF approach, a powerful inferential tool that allows optimizing the use of summary statistics information content, among other advantages . Some methodological novelties are also introduced in this work, such as the computation of the error associated with the posterior parameter estimates under the best scenario. The accuracy of timing estimate is assured in two ways: i) by the use of microsatellite markers with known evolutionary dynamics, as underlined above, and ii) by assessing the divergence time threshold above which posterior estimates are likely to be biased by size homoplasy and limits in allele size range . The best-supported model suggests a recent divergence event of the subspecies of S. gregaria (around 2.6 kya) and a reduction of populations size in one of the subspecies (S. g. flaviventris) that colonized the southern distribution area. As such, results did not support the hypothesis that the southward colonization was driven by the expansion of African dry environments associated with the last glacial maximum, as it has been postulated for other arid-adapted species with similar African disjoint distributions . The estimated time of divergence points at a much more recent origin for the two subspecies, during the late Holocene, in a period corresponding to fairly stable arid conditions similar to current ones [9,10].
Although the authors cannot exclude that their microsatellite data bear limited information on older colonization events than the last one, they bring arguments in favour of alternative explanations. The hypothesis privileged does not involve climatic drivers, but the particularly efficient dispersal behaviour of the species, whose individuals are able to fly over long distances (up to thousands of kilometers) under favourable windy conditions. A single long-distance dispersal event by a few individuals would explain the genetic signature of the bottleneck. There is a growing number of studies in phylogeography in arid regions in the Southern hemisphere, but the impact of past climate changes on the species distribution in this region remains understudied relative to the Northern hemisphere [11,12].
The study presented by Chapuis et al.  offers several important insights into demographic changes and the evolutionary history of an agriculturally important pest species in Africa, which could also mirror the history of other organisms in the continent. As the authors point out, there are necessarily some uncertainties associated with the models of past ecosystems and climate, especially for Africa. Interestingly, the authors argue that the information on paleo-vegetation turnover was more informative than climatic niche modeling for the purpose of their study since it made them consider a wider range of bio-geographical changes and in turn a wider range of evolutionary scenarios (see discussion in Supplementary Material). Microsatellite markers have been offering a useful tool in population genetics and phylogeography for decades, but their popularity is perhaps being taken over by single nucleotide polymorphism (SNP) genotyping and whole-genome sequencing (WGS) (the peak year of the number of the publication with “microsatellite” is in 2012 according to PubMed).
This study reaffirms the usefulness of these classic molecular markers to estimate past demographic events, especially when species- and locus-specific microsatellite mutation features are available and a powerful inferential approach is adopted. Nonetheless, there are still hurdles to overcome, such as the limitations in scenario choice associated with the simulation software used (e.g. not allowing for continuous gene flow in this particular case), which calls for further improvement of simulation tools allowing for more flexible modeling of demographic events and mutation patterns. In sum, this work not only contributes to our understanding of the makeup of the African biodiversity but also offers a useful statistical framework, which can be applied to a wide array of species and molecular markers (microsatellites, SNPs, and WGS).
 Lehmann, P. et al. (2018). Complex responses of global insect pests to climate change. bioRxiv, 425488. doi: https://dx.doi.org/10.1101/425488
 Donoghue, P. C., & Benton, M. J. (2007). Rocks and clocks: calibrating the Tree of Life using fossils and molecules. Trends in Ecology & Evolution, 22(8), 424-431. doi: https://dx.doi.org/10.1016/j.tree.2007.05.005
 Ho, S. Y., Lanfear, R., Bromham, L., Phillips, M. J., Soubrier, J., Rodrigo, A. G., & Cooper, A. (2011). Time‐dependent rates of molecular evolution. Molecular ecology, 20(15), 3087-3101. doi: https://dx.doi.org/10.1111/j.1365-294X.2011.05178.x
 Chapuis, M.-P., Raynal, L., Plantamp, C., Meynard, C. N., Blondin, L., Marin, J.-M. and Estoup, A. (2020). A young age of subspecific divergence in the desert locust Schistocerca gregaria, inferred by ABC Random Forest. bioRxiv, 671867, ver. 4 peer-reviewed and recommended by PCI Evolutionary Biology. doi: https://dx.doi.org/10.1101/671867
5] Chapuis, M.-P., Plantamp, C., Streiff, R., Blondin, L., & Piou, C. (2015). Microsatellite evolutionary rate and pattern in Schistocerca gregaria inferred from direct observation of germline mutations. Molecular ecology, 24(24), 6107-6119. doi: https://dx.doi.org/10.1111/mec.13465
 Raynal, L., Marin, J. M., Pudlo, P., Ribatet, M., Robert, C. P., & Estoup, A. (2018). ABC random forests for Bayesian parameter inference. Bioinformatics, 35(10), 1720-1728. doi: https://dx.doi.org/10.1093/bioinformatics/bty867
 Estoup, A., Jarne, P., & Cornuet, J. M. (2002). Homoplasy and mutation model at microsatellite loci and their consequences for population genetics analysis. Molecular ecology, 11(9), 1591-1604. doi: https://dx.doi.org/10.1046/j.1365-294X.2002.01576.x
 Moodley, Y. et al. (2018). Contrasting evolutionary history, anthropogenic declines and genetic contact in the northern and southern white rhinoceros (Ceratotherium simum). Proceedings of the Royal Society B, 285(1890), 20181567. doi: https://dx.doi.org/10.1098/rspb.2018.1567
 Kröpelin, S. et al. (2008). Climate-driven ecosystem succession in the Sahara: the past 6000 years. science, 320(5877), 765-768. doi: https://dx.doi.org/10.1126/science.1154913
 Maley, J. et al. (2018). Late Holocene forest contraction and fragmentation in central Africa. Quaternary Research, 89(1), 43-59. doi: https://dx.doi.org/10.1017/qua.2017.97
 Beheregaray, L. B. (2008). Twenty years of phylogeography: the state of the field and the challenges for the Southern Hemisphere. Molecular Ecology, 17(17), 3754-3774. doi: https://dx.doi.org/10.1111/j.1365-294X.2008.03857.x
 Dubey, S., & Shine, R. (2012). Are reptile and amphibian species younger in the Northern Hemisphere than in the Southern Hemisphere?. Journal of evolutionary biology, 25(1), 220-226. doi: https://dx.doi.org/10.1111/j.1420-9101.2011.02417.x
A video about this preprint is available here: