Submit a preprint

Latest recommendations

IdTitle * Authors * Abstract * Picture * Thematic fields * RecommenderReviewersSubmission date
24 Mar 2025
article picture

On the potential for GWAS with phenotypic population means and allele-frequency data (popGWAS)

popGWAS: Data-efficient trait mapping in natural populations for biodiversity research

Recommended by ORCID_LOGO based on reviews by Petri Kemppainen and 1 anonymous reviewer

The study by Pfenninger (2025) addresses the critical need to understand the genomic basis of ecologically important traits to better predict and respond to the impacts of global change on biodiversity (Gienapp et al. 2017). It introduces the popGWAS, a novel GWAS approach, which utilizes phenotypic population means and genome-wide allele frequency data, obtainable through methods like Pool-sequencing (Pool-Seq), to identify the genetic loci underlying quantitative polygenic traits in natural populations and predict their mean. The core idea is that trait-increasing alleles should exhibit higher frequencies in populations with higher mean trait values. popGWAS then maps mean allele frequencies across populations to their trait means. Working with as many allele frequency values as populations sampled, popGWAS potentially has more power to find significant associations at genomic loci than individual-based GWAS working with three genotypes at a locus. This new method addresses some of the problems faced by traditional genome-wide association studies (GWAS), which require extensive resources and large sample sizes, posing challenges for biodiversity research on non-model species in natural populations. 

To evaluate the effectiveness of popGWAS, Pfenninger (2025) conducted extensive population genetic forward simulations, examining scenarios with varying numbers of populations, ranging from 12 to 60. The results indicated that popGWAS performance improved with increasing sample size, showing a diminishing return above 36 populations. In a direct comparison across all simulation scenarios, popGWAS consistently outperformed individual-based GWAS (iGWAS). On average, popGWAS identified more true positive loci than iGWAS. In addition, when combined with minimum entropy feature selection (MEFS), popGWAS achieved large predictive accuracy of population means of 0.8 or better in over 97% of simulations with 36 or more populations, regardless of other parameters. In contrast, iGWAS failed to generate valid phenotypic predictions in over 70% of the simulations. Also, unlike iGWAS, popGWAS did not suffer from p-value inflation. Yet, population structure or varying levels of relatedness among individuals were not fully accounted for in the simulations. The extent to which popGWAS would be sensitive to such individual covariates remains to be shown. Finally, popGWAS was relatively insensitive to low trait heritability because random individual variation gets averaged out when calculating the population mean trait value. 

The study demonstrates that popGWAS is a promising approach, particularly for oligogenic and moderately polygenic traits. The method performs more poorly for polygenic traits with large genetic redundancy, where different alleles contribute to the same trait mean in different populations. The method thus performs better when large-effect loci contribute to genetic differentiation in parallel across populations, as expected when gene flow is moderate to high (Yeaman & Whitlock 2011). Low genomic predictability is reached when drift dominates or when genetic architectures are highly polygenic.

The popGWAS method proved effective with a moderate number of sampled populations and, when combined with machine learning for genomic prediction, exhibited strong performance in predicting population means, even for low-heritability traits. Notably, popGWAS consistently outperformed iGWAS in terms of identifying true positive loci and prediction accuracy. This suggests that popGWAS can make GWAS studies more accessible for biodiversity genomics research, providing a valuable tool for dissecting the genetic basis of complex traits in natural populations. A key aspect contributing to the efficiency of popGWAS is its compatibility with pooled sequencing (Pool-Seq). Pool-Seq provides estimates of allele frequencies within a population by sequencing a mixed DNA sample representing multiple individuals from that population (Futschik & Schlötterer 2010). This approach is significantly more cost-effective than sequencing each individual separately, allowing researchers to obtain genome-wide allele frequency data across multiple populations with a substantially reduced budget. This data efficiency makes GWAS more accessible to a wider range of researchers, particularly those working in biodiversity genomics where financial resources may be limited. Furthermore, popGWAS can be coupled with bulk phenotyping methods, such as automatic video recording, remote sensing, metabolomics/transcriptomics, etc., to efficiently obtain population-level phenotypic data, further streamlining the research process. Ultimately, popGWAS represents a valuable addition to the geneticist's toolkit, offering a complementary approach to iGWAS that can be particularly advantageous in specific research contexts where predicting trait mean is more important than resolving the precise genetic basis of a trait.

 

References

Futschik, A. and Schlötterer, C. 2010. The Next Generation of Molecular Markers From Massively Parallel Sequencing of Pooled DNA Samples. Genetics 186(1): 207-218. https://doi.org/10.1534/genetics.110.114397

Gienapp, P., Fior, S., Guillaume, F., Lasky, J. R., Sork, V. L. and Csilléry, K. 2017. Genomic Quantitative Genetics to Study Evolution in the Wild. Trends Ecol. Evol. 32(12): 897-908. https://doi.org/10.1016/j.tree.2017.09.004

Markus Pfenninger (2025) On the potential for GWAS with phenotypic population means and allele-frequency data (popGWAS). bioRxiv, ver.3 peer-reviewed and recommended by PCI Evol Biol https://doi.org/10.1101/2024.06.12.598621

Yeaman, S. and Whitlock, M. C. 2011. The genetic architecture of adaptation under migration-selection balance. Evolution 65(7): 1897-1911. https://doi.org/10.1111/j.1558-5646.2011.01269.x

 

On the potential for GWAS with phenotypic population means and allele-frequency data (popGWAS)Markus Pfenninger<p>It is vital to understand the genomic basis of differences in ecologically important traits if we are to understand the impact of global change on biodiversity and enhance our ability for targeted intervention. This study explores the potential...Bioinformatics & Computational Biology, Evolutionary Applications, Genotype-Phenotype, Population Genetics / GenomicsFrédéric Guillaume2024-06-15 08:41:14 View
05 Apr 2024
article picture

Does the seed fall far from the tree? Weak fine scale genetic structure in a continuous Scots pine population

Weak spatial genetic structure in a large continuous Scots pine population – implications for conservation and breeding

Recommended by ORCID_LOGO based on reviews by Joachim Mergeay, Jean-Baptiste Ledoux and Roberta Loh

Spatial genetic structure, i.e. the non-random spatial distribution of genotypes, arises in populations because of different processes including spatially limited dispersal and selection. Knowledge on the spatial genetic structure of plant populations is important to assess biological parameters such as gene dispersal distances and the potential for local adaptations, as well as for applications in conservation management and breeding. In their work, Niskanen and colleagues demonstrate a multifaceted approach to characterise the spatial genetic structure in two replicate sites of a continuously distributed Scots pine population in South-Eastern Finland. They mapped and assessed the ages of 469 naturally regenerated adults and genotyped them using a SNP array which resulted in 157 325 filtered polymorphic SNPs. Their dataset is remarkably powerful because of the large numbers of both individuals and SNPs genotyped. This made it possible to characterise precisely the decay of genetic relatedness between individuals with spatial distance despite the extensive dispersal capacity of Scots pine through pollen, and ensuing expectations of an almost panmictic population.

The authors’ data analysis was particularly thorough. They demonstrated that two metrics of pairwise relatedness, the genomic relationship matrix (GRM, Yang et al. 2011) and the kinship coefficient (Loiselle et al. 1995) were strongly correlated and produced very similar inference of family relationships: >99% of pairs of individuals were unrelated, and the remainder exhibited 2nd (e.g., half-siblings) to 4th degree relatedness. Pairwise relatedness decayed with spatial distance which resulted in extremely weak but statistically significant spatial genetic structure in both sites, quantified as Sp=0.0005 and Sp=0.0008. These estimates are at least an order of magnitude lower than estimates in the literature obtained in more fragmented populations of the same species or in other conifers. Estimates of the neighbourhood size, the effective number of potentially mating individuals belonging to a within-population neighbourhood (Wright 1946), were relatively large with Nb=1680-3210 despite relatively short gene dispersal distances, σg = 36.5–71.3m, which illustrates the high effective density of the population. 

The authors showed the implications of their findings for selection. The capacity for local adaptation depends on dispersal distances and the strength of the selection coefficient. In the study population, the authors inferred that local adaptation can only occur if environmental heterogeneity occurs over a distance larger than approximately one kilometre (or larger, if considering long-distance dispersal). Interestingly, in Scots pine, no local adaptation has been described on similar geographic scales, in contrast to some other European or Mediterranean conifers (Scotti et al. 2023).

The authors’ results are relevant for the management of conservation and breeding. They showed that related individuals occurred within sites only and that they shared a higher number of rare alleles than unrelated ones. Since rare alleles are enriched in new and recessive deleterious variants, selecting related individuals could have negative consequences in breeding programmes. The authors also showed, in their response to reviewers, that their powerful dataset was not suitable to obtain a robust estimate of effective population size, Ne, based on the linkage disequilibrium method (Do et al. 2014). This illustrated that the estimation of Ne used for genetic indicators supported in international conservation policy (Hoban et al. 2020, CBD 2022) remains challenging in large and continuous populations (see also Santo-del-Blanco et al. 2023, Gargiulo et al. 2024).

References

CBD (2022) Kunming-Montreal Global Biodiversity Framework. https://www.cbd.int/doc/decisions/cop-15/cop-15-dec-04-en.pdf

Do C, Waples RS, Peel D, Macbeth GM, Tillett BJ, Ovenden JR (2014). NeEstimator v2: re-implementation of software for the estimation of contemporary effective population size (Ne ) from genetic data. Molecular Ecology Resources 14: 209–214. https://doi.org/10.1111/1755-0998.12157

Gargiulo R, Decroocq V, González-Martínez SC, Paz-Vinas I, Aury JM, Kupin IL, Plomion C, Schmitt S, Scotti I, Heuertz M (2024) Estimation of contemporary effective population size in plant populations: limitations of genomic datasets. Evolutionary Applications, in press, https://doi.org/10.1101/2023.07.18.549323

Hoban S, Bruford M, D’Urban Jackson J, Lopes-Fernandes M, Heuertz M, Hohenlohe PA, Paz-Vinas I, et al. (2020) Genetic diversity targets and indicators in the CBD post-2020 Global Biodiversity Framework must be improved. Biological Conservation 248: 108654. https://doi.org/10.1016/j.biocon.2020.108654

Loiselle BA, Sork VL, Nason J & Graham C (1995) Spatial genetic structure of a tropical understorey shrub, Psychotria officinalis (Rubiaceae). American Journal of Botany 82: 1420–1425. https://doi.org/10.1002/j.1537-2197.1995.tb12679.x

Santos-del-Blanco L, Olsson S, Budde KB, Grivet D, González-Martínez SC, Alía R, Robledo-Arnuncio JJ (2022). On the feasibility of estimating contemporary effective population size (Ne) for genetic conservation and monitoring of forest trees. Biological Conservation 273: 109704. https://doi.org/10.1016/j.biocon.2022.109704

Scotti I, Lalagüe H, Oddou-Muratorio S, Scotti-Saintagne C, Ruiz Daniels R, Grivet D, et al. (2023) Common microgeographical selection patterns revealed in four European conifers. Molecular Ecology 32: 393-411. https://doi.org/10.1111/mec.16750

Wright S (1946) Isolation by distance under diverse systems of mating. Genetics 31: 39–59. https://doi.org/10.1093/genetics/31.1.39

Yang J, Lee SH, Goddard ME & Visscher PM (2011) GCTA: a tool for genome-wide complex trait analysis. The American Journal of Human Genetics 88: 76–82. https://www.cell.com/ajhg/pdf/S0002-9297(10)00598-7.pdf

Does the seed fall far from the tree? Weak fine scale genetic structure in a continuous Scots pine populationAlina K. Niskanen, Sonja T. Kujala, Katri Kärkkäinen, Outi Savolainen, Tanja Pyhäjärvi<p>Knowledge of fine-scale spatial genetic structure, i.e., the distribution of genetic diversity at short distances, is important in evolutionary research and in practical applications such as conservation and breeding programs. In trees, related...Adaptation, Evolutionary Applications, Population Genetics / GenomicsMyriam Heuertz Joachim Mergeay2023-06-27 21:57:28 View
08 Oct 2019
article picture

Strong habitat and weak genetic effects shape the lifetime reproductive success in a wild clownfish population

Habitat variation of wild clownfish population shapes selfrecruitment more than genetic effects

Recommended by Philip Munday ? based on reviews by Juan Diego Gaitan-Espitia and Loeske Kruuk

Estimating the genetic and environmental components of variation in reproductive success is crucial to understanding the adaptive potential of populations to environmental change. To date, the heritability of lifetime reproductive success (fitness) has been estimated in a handful of wild animal population, mostly in mammals and birds, but has never been estimated for a marine species. The primary reason that such estimates are lacking in marine species is that most marine organisms have a dispersive larval phase, making it extraordinarily difficult to track the fate of offspring from one generation to the next.
In this study, Salles et al. [1] use an unprecedented 10 year data set for a wild population of orange clownfish (Amphiprion percula) to estimate the environmental, maternal and additive genetic components of life time reproductive success for the self-recruiting portion of the local population. Previous studies show that over 50% of juvenile clownfish recruiting to the population of clownfish at Kimbe Island (Kimbe Bay, PNG) are natal to the population. In other words, >50% of the juveniles recruiting to the population at Kimbe Island are offspring of parents from Kimbe Island. The identity and location of every adult clownfish in the Kimbe Island population was tracked over 10 years. At the same time newly recruiting juveniles were collected at regular intervals (biennially) and their parentage assigned with high confidence by 22 polymorphic microsatellite loci. Salles et al. then used a pedigree comprising 1735 individuals from up to 5 generations of clownfish at Kimbe Island to assess the contribution of every breeding pair of clownfish to self-recruitment within the local population. Because clownfish are site attached and live in close association with a host sea anemone, it was also possible to examine the contribution of reef location and host anemones species (either Heteractis magnifica or Stichodactyla gigantea) to reproductive success within the local population.
The study found that breeders from the eastern side of Kimbe Island, and mostly inhabiting S. gigantea sea anemones, produced more juveniles that recruited to the local population than breeders from other location around the island, or inhabiting H. magnifica. In fact, host anemone species and geographic location explained about 97% of the variance in reproductive success within the local population (i.e. excluding successful recruitment to other populations). By contrast, maternal and additive genetic effects explained only 1.9% and 1.3% of the variance, respectively. In other words, reef location and the species of host anemone inhabited had an overwhelming influence on the long-term contribution of breeding pairs of clownfish to replenishment of the local population. This overwhelming effect of the local habitat on reproductive success means that the population is potentially susceptible to rapid environmental changes - for example if S. giganta sea anemones are disproportionately susceptible to global warming, or reef habitats on the eastern side of the island are more susceptible to disturbance. By contrast, the small component of additive genetic variance in local reproductive success translated into low heritability and evolvability of lifetime reproductive success within the local population, as predicted by theory [2] and observed in some terrestrial species. Consequently, fitness would evolve slowly to environmental change.
Establishing the components of variation in fitness in a wild population of marine fishes is an astonishing achievement, made possible by the unprecedented long-term individual-level monitoring of the entire population of clownfish at Kimbe Island. A next step in this research would be to include other clownfish populations that are demographically and genetically connected to the Kimbe Island population through larval dispersal. It would be intriguing to establish the environmental, maternal and additive genetic components of reproductive success in the dispersing part of the Kimbe Island population, to see if this potentially differs among breeders who contribute more or less to replenishment within the local population.

References

[1] Salles, O. C., Almany, G. R., Berumen, M.L., Jones, G. P., Saenz-Agudelo, P., Srinivasan, M., Thorrold, S. R., Pujol, B., Planes, S. (2019). Strong habitat and weak genetic effects shape the lifetime reproductive success in a wild clownfish population. Zenodo, 3476529, ver. 3 peer-reviewed and recommended by Peer Community In Evolutionary Biology. doi: 10.5281/zenodo.3476529
[2] Fisher, R.A. (1930). The genetical theory of natural selection. Clarendon Press, Oxford, U.K.

Strong habitat and weak genetic effects shape the lifetime reproductive success in a wild clownfish populationOcéane C. Salles, Glenn R. Almany, Michael L. Berumen, Geoffrey P. Jones, Pablo Saenz-Agudelo, Maya Srinivasan, Simon Thorrold, Benoit Pujol, Serge Planes<p>Lifetime reproductive success (LRS), the number of offspring an individual contributes to the next generation, is of fundamental importance in ecology and evolutionary biology. LRS may be influenced by environmental, maternal and additive genet...Adaptation, Evolutionary Ecology, Life History, Quantitative GeneticsPhilip Munday 2018-10-01 09:00:53 View
20 Dec 2016
article picture
POSTPRINT

Experimental Evolution of Gene Expression and Plasticity in Alternative Selective Regimes

Genetic adaptation counters phenotypic plasticity in experimental evolution

Recommended by and

How do phenotypic plasticity and adaptive evolution interact in a novel or changing environment? Does evolution by natural selection generally reinforce initially plastic phenotypic responses, or does it instead oppose them? And to what extent does evolution of a trait involve evolution of its plasticity? These questions have lied at the heart of research on phenotypic evolution in heterogeneous environments ever since it was realized that the environment is likely to affect the expression of many (perhaps most) characters of an individual. Importantly, this broad definition of phenotypic plasticity as change in the average phenotype of a given genotype in response to its environment of development (or expression) does not involve any statement about the adaptiveness of the plastic changes. Theory on the evolution of plasticity has devoted much effort to understanding how reaction norm should evolve under different regimes of environmental change in space and time, and depending on genetic constraints on reaction norm shapes. However on an empirically ground, the questions above have mostly been addressed for individual traits, often chosen a priori for their likeliness to exhibit adaptive plasticity, and we still lack more systematic answers. These can be provided by so-called ‘phenomic’ approaches, where a large number of traits are tracked without prior information on their biological or ecological function. A problem is that the number of phenotypic characters that can be measured in an organism is virtually infinite (and to some extent arbitrary), and that scaling issues makes it difficult to compare different sets of traits. Gene-expression levels offer a partial solution to this dilemma, as they can be considered as a very large number of traits (one per typed gene) that can be measured easily and uniformly (fold change in the number of reads in RNAseq). As for any traits, expression levels of different genes may be genetically correlated, to an extent that depends on their regulation mechanism: cis-regulatory sequences that only affect expression of neighboring genes are likely to cause independent gene expression, while more systematic modifiers of expression (e.g. trans-regulators such as transcription factors) may cause correlated genetic responses of the expression of many genes. Huang and Agrawal [1] have studied plasticity and evolution of gene expression level in young larvae of populations of Drosophila melanogaster that have evolved for about 130 generations under either a constant environment (salt or cadmium), or an environment that is heterogeneous in time or space (combining salt and cadmium). They report a wealth of results, of which we summarize the most striking here. First, among genes that (i) were initially highly plastic and (ii) evolved significant divergence in expression levels between constant environment treatments, the evolved divergence is predominantly in the opposite direction to the initial plastic response. This suggests that either plasticity was initially maladaptive, or the selective pressure changed during the evolutionary process (see below). This somewhat unexpected result strikingly mirrors that from a study published last year in Nature [2], where the same pattern was found for responses of guppies to the presence of predators. However, Huang and Agrawal [1] went beyond this study by deciphering the underlying mechanisms in several interesting ways. First, they showed that change in gene expression often occurred at genes close to SNPs with differentiated frequencies across treatments (but not at genes with differentiated SNPs in their coding sequences), suggesting that cis-regulatory sequences are involved. This is also suggested by the fact that changes in gene expression are mostly caused by the increased expression of only one allele at polymorphic loci, and is a first step towards investigating the genetic underpinnings of (co)variation in gene expression levels. Another interesting set of findings concerns evolution of plasticity in treatments with variable environments. To compare the gene-expression plasticity that evolved in these treatments to an expectation, the authors considered that the expression levels in populations maintained for a long time under constant salt or cadmium had reached an optimum. The differences between these expression levels were thus assumed to predict the level of plasticity that should evolve in a heterogeneous environment (with both cadmium and salt) under perfect environmental predictability. The authors showed that plasticity did evolve more in the expected direction in heterogeneous than in constant environments, resulting in better adapted final expression levels across environments. Taken collectively, these results provide an unprecedented set of patterns that are greatly informative on how plasticity and evolution interact in constant versus changing environments. But of course, interpretations in terms of adaptive versus maladaptive plasticity are more challenging, as the authors themselves admit. Even though environmentally determined gene expression is the basic mechanism underlying the phenotypic plasticity of most traits, it is extremely difficult to relate to more integrated phenotypes for which we can understand the selection pressures, especially in multicellular organisms. The authors have recently investigated evolutionary change of quantitative traits in these selected lines, so it might be possible to establish links between reaction norms for macroscopic traits to those for gene expression levels. Such an approach would also involve tracking gene expression throughout life, rather than only in young larvae as done here, thus putting phenotypic complexity back in the picture also for expression levels. Another difficulty is that a plastic response that was originally adaptive may be replaced by an opposite evolutionary response in the long run, without having to invoke initially maladaptive plasticity. For instance, the authors mention the possibility that a generic stress response is initially triggered by cadmium, but is eventually unnecessary and costly after evolution of genetic mechanisms for cadmium detoxification (a case of so-called genetic accommodation). In any case, this study by Huang and Agrawal [1], together with the one by Ghalambor et al. last year [2], reports novel and unexpected results, which are likely to stimulate researchers interested in plasticity and evolution in heterogeneous environments for the years to come.

References

[1] Huang Y, Agrawal AF. 2016. Experimental Evolution of Gene Expression and Plasticity in Alternative Selective Regimes. PLoS Genetics 12:e1006336. doi: 10.1371/journal.pgen.1006336

[2] Ghalambor CK, Hoke KL, Ruell EW, Fischer EK, Reznick DN, Hughes KA. 2015. Non-adaptive plasticity potentiates rapid adaptive evolution of gene expression in nature. Nature 525: 372-375. doi: 10.1038/nature15256

Experimental Evolution of Gene Expression and Plasticity in Alternative Selective RegimesHuang Y, Agrawal AF<p>Little is known of how gene expression and its plasticity evolves as populations adapt to different environmental regimes. Expression is expected to evolve adaptively in all populations but only those populations experiencing environmental hete...Adaptation, Experimental Evolution, Expression Studies, Phenotypic PlasticityLuis-Miguel Chevin2016-12-20 09:04:15 View
23 Jan 2023
article picture

The genetic architecture of local adaptation in a cline

Environmental and fitness landscapes matter for the genetic basis of local adaptation

Recommended by ORCID_LOGO based on reviews by 2 anonymous reviewers

Natural landscapes are often composite, with spatial variation in environmental factors being the norm rather than exception. Adaptation to such variation is a major driver of diversity at all levels of biological organization, from genes to phenotypes, species and ultimately ecosystems. While natural selection favours traits that show a better fit to local conditions, the genomic response to such selection is not necessarily straightforward. This is because many quantitative traits are complex and the product of many loci, each with a small to moderate phenotypic contribution. Adapting to environmental challenges that occur in narrow ranges may thus prove difficult as each individual locus is easily swamped by alleles favoured across the rest of the population range. 

To better understand whether and how evolution overcomes such a hurdle, Laroche and Lenormand [1]  combine quantitative genetics and population genetic modelling to track genomic changes that underpin a trait whose fitness optimum differs between a certain spatial range, referred to as a “pocket”, and the rest of the habitat. As it turns out from their analysis, one critical and probably underappreciated factor in determining the type of genetic architecture that evolves is how fitness declines away from phenotypic optima. One classical and popular model of fitness landscape that relates trait value to reproductive success is Gaussian, whereby small trait variations away from the optimum result in even smaller variations in fitness. This facilitates local adaptation via the invasion of alleles of small effects as carriers inside the pocket show a better fit while those outside the pocket only suffer a weak fitness cost. By contrast, when the fitness landscape is more peaked around the optimum, for instance where the decline is linear, adaptation through weak effect alleles is less likely, requiring larger pockets that are less easily swamped by alleles selected in the rest of the range.  

In addition to mathematically investigating the initial emergence of local adaptation, Laroche and Lenormand use computer simulations to look at its long-term maintenance. In principle, selection should favour a genetic architecture that consolidates the phenotype and increases its heritability, for instance by grouping several alleles of large effects close to one another on a chromosome to avoid being broken down by meiotic recombination. Whether or not this occurs also depends on the fitness landscape. When the landscape is Gaussian, the genetic architecture of the trait eventually consists of tightly linked alleles of large effects. The replacement of small effects by large effects loci is here again promoted by the slow fitness decline around the optimum. This is because any shift in architecture in an adapted population requires initially crossing a fitness valley. With a Gaussian landscape, this valley is shallow enough to be crossed, facilitated by a bit of genetic drift. By contrast, when fitness declines linearly around the optimum, genetic architecture is much less evolutionarily labile as any architecture change initially entails a fitness cost that is too high to bear.     

Overall, Laroche and Lenormand provide a careful and thought-provoking analysis of a classical problem in population genetics. In addition to questioning some longstanding modelling assumptions, their results may help understand why differentiated populations are sometimes characterized by “genomic islands” of divergence, and sometimes not. 

References

[1] Laroche F, Lenormand T (2022) The genetic architecture of local adaptation in a cline. bioRxiv, 2022.06.30.498280, ver. 4 peer-reviewed and recommended by Peer Community in Evolutionary Biology. https://doi.org/10.1101/2022.06.30.498280

The genetic architecture of local adaptation in a clineFabien Laroche, Thomas Lenormand<p>Local adaptation is pervasive. It occurs whenever selection favors different phenotypes in different environments, provided that there is genetic variation for the corresponding traits and that the effect of selection is greater than the effect...Adaptation, Evolutionary Theory, Genome Evolution, Molecular Evolution, Population Genetics / Genomics, Quantitative GeneticsCharles Mullon2022-07-07 08:46:47 View
15 Feb 2019
article picture

Architectural traits constrain the evolution of unisexual flowers and sexual segregation within inflorescences: an interspecific approach

Sometimes, sex is in the head

Recommended by ORCID_LOGO based on reviews by 3 anonymous reviewers

Plants display an amazing diversity of reproductive strategies with and without sex. This diversity is particularly remarkable in flowering plants, as highlighted by Charles Darwin, who wrote several botanical books scrutinizing plant reproduction. One particularly influential work concerned floral variation [1]. Darwin recognized that flowers may present different forms within a single population, with or without sex specialization. The number of species concerned is small, but they display recurrent patterns, which made it possible for Darwin to invoke natural and sexual selection to explain them. Most of early evolutionary theory on the evolution of reproductive strategies was developed in the first half of the 20th century and was based on animals. However, the pioneering work by David Lloyd from the 1970s onwards excited interest in the diversity of plant sexual strategies as models for testing adaptive hypotheses and predicting reproductive outcomes [2]. The sex specialization of individual flowers and plants has since become one of the favorite topics of evolutionary biologists. However, attention has focused mostly on cases related to sex differentiation (dioecy and associated conditions [3]). Separate unisexual flower types on the same plant (monoecy and related cases, rendering the plant functionally hermaphroditic) have been much less studied, apart from their possible role in the evolution of dioecy [4] or their association with particular modes of pollination [5].
Two specific non-mutually exclusive hypotheses on the evolution of separate sexes in flowers (dicliny) have been proposed, both anchored in Lloyd’s views and Darwin’s legacy, with selfing avoidance and optimal limited resource allocation. Intermediate sex separation, in which sex morphs have different combinations of unisexual and hermaphrodite flowers, has been crucial for testing these hypotheses through comparative analyses of optimal conditions in suggested transitions. Again, cases in which floral unisexuality does not lead to sex separation have been studied much less than dioecious plants, at both the microevolutionary and macroevolutionary levels. It is surprising that the increasing availability of plant phylogenies and powerful methods for testing evolutionary transitions and correlations have not led to more studies, even though the frequency of monoecy is probably highest among diclinous species (those with unisexual flowers in any distribution among plants within a population [6]).
The study by Torices et al. [7] aims to fill this gap, offering a different perspective to that provided by Diggle & Miller [8] on the evolution of monoecious conditions. The authors use heads of a number of species of the sunflower family (Asteraceae) to test specifically the effect of resource limitation on the expression of sexual morphs within the head. They make use of the very particular and constant architecture of inflorescences in these species (the flower head or “capitulum”) and the diversity of sexual conditions (hermaphrodite, gynomonoecious, monoecious) and their spatial pattern within the flower head in this plant family to develop an elegant means of testing this hypothesis. Their results are consistent with their expectations on the effect of resource limitation on the head, as determined by patterns of fruit size within the head, assuming that female fecundity is more strongly limited by resource availability than male function.
The authors took on a huge challenge in choosing to study the largest plant family (about 25 thousand species). Their sample was limited to only about a hundred species, but species selection was very careful, to ensure that the range of sex conditions and the available phylogenetic information were adequately represented. The analytical methods are robust and cast no doubt on the reported results. However, I can’t help but wonder what would happen if the antiselfing hypothesis was tested simultaneously. This would require self-incompatibility (SI) data for the species sample, as the presence of SI is usually invoked as a powerful antiselfing mechanism, rendering the unisexuality of flowers unnecessary. However, SI is variable and frequently lost in the sunflower family [9]. I also wonder to what extent the very specific architecture of flower heads imposes an idiosyncratic resource distribution that may have fixed these sexual systems in species and lineages of the family. Although not approached in this study, intraspecific variation seems to be low. It would be very interesting to use similar approaches in other plant groups in which inflorescence architecture is lax and resource distribution may differ. A whole-plant approach might be required, rather than investigations of single inflorescences as in this study. This study has no flaws, but instead paves the way for further testing of a long-standing dual hypothesis, probably with different outcomes in different ecological and evolutionary settings. In the end, sex is not only in the head.

References

[1] Darwin, C. (1877). The different forms of flowers on plants of the same species. John Murray.
[2] Barrett, S. C. H., and Harder, L. D. (2006). David G. Lloyd and the evolution of floral biology: from natural history to strategic analysis. In L.D. Harder, L. D., and Barrett, S. C. H. (eds) Ecology and Evolution of Flowers. OUP, Oxford. Pp 1-21.
[3] Geber, M. A., Dawson, T. E., and Delph, L. F. (eds) (1999). Gender and sexual dimorphism in flowering plants. Springer, Berlin.
[4] Charlesworth, D. (1999). Theories of the evolution of dioecy. In Geber, M. A., Dawson T. E. and Delph L. F. (eds) (1999). Gender and sexual dimorphism in flowering plants. Springer, Berlin. Pp. 33-60.
[5] Friedman, J., and Barrett, S. C. (2008). A phylogenetic analysis of the evolution of wind pollination in the angiosperms. International Journal of Plant Sciences, 169(1), 49-58. doi: 10.1086/523365
[6] Renner, S. S. (2014). The relative and absolute frequencies of angiosperm sexual systems: dioecy, monoecy, gynodioecy, and an updated online database. American Journal of botany, 101(10), 1588-1596. doi: 10.3732/ajb.1400196
[7] Torices, R., Afonso, A., Anderberg, A. A., Gómez, J. M., and Méndez, M. (2019). Architectural traits constrain the evolution of unisexual flowers and sexual segregation within inflorescences: an interspecific approach. bioRxiv, 356147, ver. 3 peer-reviewed and recommended by PCI Evol Biol. doi: 10.1101/356147
[8] Diggle, P. K., and Miller, J. S. (2013). Developmental plasticity, genetic assimilation, and the evolutionary diversification of sexual expression in Solanum. American journal of botany, 100(6), 1050-1060. doi: 10.3732/ajb.1200647
[9] Ferrer, M. M., and Good‐Avila, S. V. (2007). Macrophylogenetic analyses of the gain and loss of self‐incompatibility in the Asteraceae. New Phytologist, 173(2), 401-414. doi: 10.1111/j.1469-8137.2006.01905.x

Architectural traits constrain the evolution of unisexual flowers and sexual segregation within inflorescences: an interspecific approachRubén Torices, Ana Afonso, Arne A. Anderberg, José M. Gómez and Marcos Méndez<p>Male and female unisexual flowers have repeatedly evolved from the ancestral bisexual flowers in different lineages of flowering plants. This sex specialization in different flowers often occurs within inflorescences. We hypothesize that inflor...Evolutionary Ecology, Morphological Evolution, Phenotypic Plasticity, Reproduction and Sex, Sexual SelectionJuan Arroyo Jana Vamosi, Marcial Escudero, Anonymous2018-06-27 10:49:52 View
06 Sep 2022
article picture

Masculinization of the X-chromosome in aphid soma and gonads

Sex-biased gene expression is not tissue-specific in Pea Aphids

Recommended by and based on reviews by Ann Kathrin Huylmans and 1 anonymous reviewer

Sexual antagonism (SA), wherein the fitness interests of the sexes do not align, is inherent to organisms with two (or more) sexes.  SA leads to intra-locus sexual conflict, where an allele that confers higher fitness in one sex reduces fitness in the other [1, 2].  This situation leads to what has been referred to as "gender load", resulting from the segregation of SA alleles in the population.  Gender load can be reduced by the evolution of sex-specific (or sex-biased) gene expression.  A specific prediction is that gene-duplication can lead to sub- or neo-functionalization, in which case the two duplicates partition the function in the different sexes.  The conditions for invasion by a SA allele differ between sex-chromosomes and autosomes, leading to the prediction that (in XY or XO systems) the X should accumulate recessive male-favored alleles and dominant female-favored alleles; similar considerations apply in ZW systems ([3, but see 4].

Aphids present an interesting special case, for several reasons: they have XO sex-determination, and three distinct reproductive morphs (sexual females, parthenogenetic females, and males).  Previous theoretical work by the lead author predict that the X should be optimized for male function, which was borne out by whole-animal transcriptome analysis [5].  

Here [6], the authors extend that work to investigate “tissue”-specific (heads, legs and gonads), sex-specific gene expression.  They argue that, if intra-locus SA is the primary driver of sex-biased gene expression, it should be generally true in all tissues.  They set up as an alternative the possibility that sex-biased gene expression could also be driven by dosage compensation.  They cite references supporting their argument that "dosage compensation (could be) stronger in the brain", although the underlying motivation for that argument appears to be based on empirical evidence rather than theoretical predictions.      

At any rate, the results are clear: all tissues investigated show masculinization of the X.  Further, X-linked copies of gene duplicates were more frequently male-biased than duplicated autosomal genes or X-linked single-copy genes.

To sum up, this is a nice empirical study with clearly interpretable (and interpreted) results, the most obvious of which is the greater sex-biased expression in sexually-dimorphic tissues.  Unfortunately, as the authors emphasize, there is no general theory by which SA, variable dosage-compensation, and meiotic sex chromosome inactivation can be integrated in a predictive framework.  It is to be hoped that empirical studies such as this one will motivate deeper and more general theoretical investigations.

References

[1] Rice WR, Chippindale AK (2001) Intersexual ontogenetic conflict. Journal of Evolutionary Biology 14: 685-693. https://doi.org/10.1046/j.1420-9101.2001.00319.x

[2] Bonduriansky R, Chenoweth SF (2009) Intralocus sexual conflict. Trends Ecol Evol 24: 280-288. https://doi.org/10.1016/j.tree.2008.12.005

[3] Rice WR. (1984) Sex chromosomes and the evolution of sexual dimorphism. Evolution 38: 735-742. https://doi.org/10.1086/595754

[4] Fry JD (2010) The genomic location of sexually antagonistic variation: some cautionary comments. Evolution 64: 1510-1516. https://doi.org/10.1111%2Fj.1558-5646.2009.00898.x

[5] Jaquiéry J, Rispe C, Roze D, Legeai F, Le Trionnaire G, Stoeckel S, et al. (2013) Masculinization of the X Chromosome in the Pea Aphid. PLoS Genetics 9. https://doi.org/10.1371/journal.pgen.1003690

[6] Jaquiéry J, Simon J-C, Robin S, Richard G, Peccoud J, Boulain H, Legeai F, Tanguy S, Prunier-Leterme N, Le Trionnaire G (2022) Masculinization of the X-chromosome in aphid soma and gonads. bioRxiv, 2021.08.13.453080, ver. 4 peer-reviewed and recommended by Peer Community in Evolutionary Biology. https://doi.org/10.1101/2021.08.13.453080 

Masculinization of the X-chromosome in aphid soma and gonadsJulie Jaquiery, Jean-Christophe Simon, Stephanie Robin, Gautier Richard, Jean Peccoud, Helene Boulain, Fabrice Legeai, Sylvie Tanguy, Nathalie Prunier-Leterme, Gael Letrionnaire<p>Males and females share essentially the same genome but differ in their optimal values for many phenotypic traits, which can result in intra-locus conflict between the sexes. Aphids display XX/X0 sex chromosomes and combine unusual X chromosome...Genetic conflicts, Genome Evolution, Reproduction and SexCharles Baer2021-08-16 08:56:08 View
13 Mar 2025
article picture

Unraveling genetic load dynamics during biological invasion: insights from two invasive insect species

The genetic load of invasive population: how little do we know ?

Recommended by ORCID_LOGO based on reviews by Sylvain Glémin and 2 anonymous reviewers

We live both in a worrying and fascinating time. Worrying because human-induced global change has dramatic consequences on biodiversity around the world. Fascinating because these changes enable us to witness evolutionary processes unfolding on relatively short time scales. One such process is biological invasion. An intriguing evolutionary question is to understand which factor facilitates the success of an invasive species. In particular, serial bottlenecks at the expanding front should reduce the effective population size and decrease genetic diversity. Theoretically, this will increase the fixation of deleterious mutations due to the effect of genetic drift and overall affect the evolutionary potential of the invading species. In the short term, reduced genetic diversity and inbreeding in small populations increases the number of recessive deleterious variants exposed in a homozygous state. This may generate a reduction in mean fitness of the population. However, in the long term and under specific demographic scenarios recessive deleterious alleles may be more efficiently removed by purifying selection. Such purging may explain the success of invasion by reducing inbreeding depression and minimizing loss of fitness. Here, Lombaert et al. estimate the genetic load in two invasive insect species, a predator species, the harlequin ladybird (Harmonia axyridis) and a crop pest species, the western corn rootworm (Diabrotica virgifera virgifera). 

The authors smartly took advantage of a pool-seq transcriptome-based exome capture method to estimate genetic load and assess the purge hypothesis using standard population genetic statistics, such as the ratio of nonsynonymous over synonymous expected heterozygosity, the frequency of derived alleles, and their excess or deficit.

The results revealed different patterns in the two species:

In the western corn rootworm, the authors find a clear signal of reduced genetic diversity in invasive populations. This was associated with a slightly reduced genetic load. However, there was only marginal evidence of purging regarding the most deleterious mutations, and in a single population, with moderately deleterious variants being weakly purged, as theoretically expected. 

In the harlequin ladybird, in contrast, the reduction of genetic diversity in invasive populations has been small, a result related to the mild severity of the bottlenecks. In this species, the authors found a tendency toward fixation of the genetic load and no signal of purging. 

Such results are intriguing, showing that different species seem to exhibit contrasted fate of genetic load. Differences in the invasion history and ecology of the species may explain these patterns. This is one of the first studies to use a population genomics approach to study the genetic load associated with biological invasion. Future studies based on whole genome data collected at the individual level across multiple species are needed to better understand the dynamics of genetic load during biological invasion and to draw more general conclusions. Advances in forward simulations may also be used to shed light on the evolution of the genetic load at different stages of the invasion process and under different strengths of bottlenecks.

 

References

Eric Lombaert, Aurelie Blin, Barbara Porro, Thomas Guillemaud, Julio S Bernal, Gary Chang, Natalia Kirichenko, Thomas W Sappington, Stefan Toepfer, Emeline Deleury (2025) Unraveling genetic load dynamics during biological invasion: insights from two invasive insect species. bioRxiv, ver.3 peer-reviewed and recommended by PCI Evol Biol https://doi.org/10.1101/2024.09.02.610743

Unraveling genetic load dynamics during biological invasion: insights from two invasive insect speciesEric Lombaert, Aurelie Blin, Barbara Porro, Thomas Guillemaud, Julio S Bernal, Gary Chang, Natalia Kirichenko, Thomas W Sappington, Stefan Toepfer, Emeline Deleury<p>Many invasive species undergo a significant reduction in genetic diversity, i.e. a genetic bottleneck, in the early stages of invasion. However, this reduction does not necessarily prevent them from achieving considerable ecological success and...Bioinformatics & Computational Biology, Population Genetics / GenomicsQuentin Rougemont2024-09-04 10:35:53 View
12 Nov 2020
article picture

Limits and Convergence properties of the Sequentially Markovian Coalescent

Review and Assessment of Performance of Genomic Inference Methods based on the Sequentially Markovian Coalescent

Recommended by ORCID_LOGO based on reviews by 3 anonymous reviewers

The human genome not only encodes for biological functions and for what makes us human, it also encodes the population history of our ancestors. Changes in past population sizes, for example, affect the distribution of times to the most recent common ancestor (tMRCA) of genomic segments, which in turn can be inferred by sophisticated modelling along the genome.
A key framework for such modelling of local tMRCA tracts along genomes is the Sequentially Markovian Coalescent (SMC) (McVean and Cardin 2005, Marjoram and Wall 2006) . The problem that the SMC solves is that the mosaic of local tMRCAs along the genome is unknown, both in their actual ages and in their positions along the genome. The SMC allows to effectively sum across all possibilities and handle the uncertainty probabilistically. Several important tools for inferring the demographic history of a population have been developed built on top of the SMC, including PSMC (Li and Durbin 2011), diCal (Sheehan et al 2013), MSMC (Schiffels and Durbin 2014), SMC++ (Terhorst et al 2017), eSMC (Sellinger et al. 2020) and others.
In this paper, Sellinger, Abu Awad and Tellier (2020) review these SMC-based methods and provide a coherent simulation design to comparatively assess their strengths and weaknesses in a variety of demographic scenarios (Sellinger, Abu Awad and Tellier 2020). In addition, they used these simulations to test how breaking various key assumptions in SMC methods affects estimates, such as constant recombination rates, or absence of false positive SNP calls.
As a result of this assessment, the authors not only provide practical guidance for researchers who want to use these methods, but also insights into how these methods work. For example, the paper carefully separates sources of error in these methods by observing what they call “Best-case convergence” of each method if the data behaves perfectly and separating that from how the method applies with actual data. This approach provides a deeper insight into the methods than what we could learn from application to genomic data alone.
In the age of genomics, computational tools and their development are key for researchers in this field. All the more important is it to provide the community with overviews, reviews and independent assessments of such tools. This is particularly important as sometimes the development of new methods lacks primary visibility due to relevant testing material being pushed to Supplementary Sections in papers due to space constraints. As SMC-based methods have become so widely used tools in genomics, I think the detailed assessment by Sellinger et al. (2020) is timely and relevant.
In conclusion, I recommend this paper because it bridges from a mere review of the different methods to an in-depth assessment of performance, thereby addressing both beginners in the field who just seek an initial overview, as well as experienced researchers who are interested in theoretical boundaries and assumptions of the different methods.

References

[1] Li, H., and Durbin, R. (2011). Inference of human population history from individual whole-genome sequences. Nature, 475(7357), 493-496. doi: https://doi.org/10.1038/nature10231
[2] Marjoram, P., and Wall, J. D. (2006). Fast"" coalescent"" simulation. BMC genetics, 7(1), 16. doi: https://doi.org/10.1186/1471-2156-7-16
[3] McVean, G. A., and Cardin, N. J. (2005). Approximating the coalescent with recombination. Philosophical Transactions of the Royal Society B: Biological Sciences, 360(1459), 1387-1393. doi: https://doi.org/10.1098/rstb.2005.1673
[4] Schiffels, S., and Durbin, R. (2014). Inferring human population size and separation history from multiple genome sequences. Nature genetics, 46(8), 919-925. doi: https://doi.org/10.1038/ng.3015
[5] Sellinger, T. P. P., Awad, D. A., Moest, M., and Tellier, A. (2020). Inference of past demography, dormancy and self-fertilization rates from whole genome sequence data. PLoS Genetics, 16(4), e1008698. doi: https://doi.org/10.1371/journal.pgen.1008698
[6] Sellinger, T. P. P., Awad, D. A. and Tellier, A. (2020) Limits and Convergence properties of the Sequentially Markovian Coalescent. bioRxiv, 2020.07.23.217091, ver. 3 peer-reviewed and recommended by PCI Evolutionary Biology. doi: https://doi.org/10.1101/2020.07.23.217091
[7] Sheehan, S., Harris, K., and Song, Y. S. (2013). Estimating variable effective population sizes from multiple genomes: a sequentially Markov conditional sampling distribution approach. Genetics, 194(3), 647-662. doi: https://doi.org/10.1534/genetics.112.149096
[8] Terhorst, J., Kamm, J. A., and Song, Y. S. (2017). Robust and scalable inference of population history from hundreds of unphased whole genomes. Nature genetics, 49(2), 303-309. doi: https://doi.org/10.1038/ng.3748

Limits and Convergence properties of the Sequentially Markovian CoalescentThibaut Sellinger, Diala Abu Awad, Aurélien Tellier<p>Many methods based on the Sequentially Markovian Coalescent (SMC) have been and are being developed. These methods make use of genome sequence data to uncover population demographic history. More recently, new methods have extended the original...Population Genetics / GenomicsStephan SchiffelsAnonymous2020-07-25 10:54:48 View
06 Oct 2017
article picture

Evolutionary analysis of candidate non-coding elements regulating neurodevelopmental genes in vertebrates

Combining molecular information on chromatin organisation with eQTLs and evolutionary conservation provides strong candidates for the evolution of gene regulation in mammalian brains

Recommended by ORCID_LOGO based on reviews by Marc Robinson-Rechavi and Charles Danko

In this manuscript [1], Francisco J. Novo proposes candidate non-coding genomic elements regulating neurodevelopmental genes.

What is very nice about this study is the way in which public molecular data, including physical interaction data, is used to leverage recent advances in our understanding to molecular mechanisms of gene regulation in an evolutionary context. More specifically, evolutionarily conserved non coding sequences are combined with enhancers from the FANTOM5 project, DNAse hypersensitive sites, chromatin segmentation, ChIP-seq of transcription factors and of p300, gene expression and eQTLs from GTEx, and physical interactions from several Hi-C datasets. The candidate regulatory regions thus identified are linked to candidate regulated genes, and the author shows their potential implication in brain development.

While the results are focused on a small number of genes, this allows to verify features of these candidates in great detail. This study shows how functional genomics is increasingly allowing us to fulfill the promises of Evo-Devo: understanding the molecular mechanisms of conservation and differences in morphology.

References

[1] Novo, FJ. 2017. Evolutionary analysis of candidate non-coding elements regulating neurodevelopmental genes in vertebrates. bioRxiv, 150482, ver. 4 of Sept 29th, 2017. doi: 10.1101/150482

Evolutionary analysis of candidate non-coding elements regulating neurodevelopmental genes in vertebratesFrancisco J. Novo<p>Many non-coding regulatory elements conserved in vertebrates regulate the expression of genes involved in development and play an important role in the evolution of morphology through the rewiring of developmental gene networks. Available biolo...Genome EvolutionMarc Robinson-Rechavi Marc Robinson-Rechavi, Charles Danko2017-06-29 08:55:41 View