Submit a preprint

Latest recommendationsrsstwitter

IdTitleAuthorsAbstract▲PictureThematic fieldsRecommenderReviewersSubmission date
17 Nov 2017
article picture

ABC random forests for Bayesian parameter inference

Machine learning methods are useful for Approximate Bayesian Computation in evolution and ecology

Recommended by Michael Blum based on reviews by Dennis Prangle and Michael Blum

It is my pleasure to recommend the paper by Raynal et al. [1] about using random forest for parameter inference. There are two reviews about the paper, one review written by Dennis Prangle and another review written by myself. Both reviews were positive and included comments that have been addressed in the current version of the preprint.

The paper nicely shows that modern machine learning approaches are useful for Approximate Bayesian Computation (ABC) and more generally for simulation-driven parameter inference in ecology and evolution.

The authors propose to consider the random forest approach, proposed by Meinshausen [2] to perform quantile regression. The numerical implementation of ABC with random forest, available in the abcrf package, is based on the RANGER R package that provides a fast implementation of random forest for high-dimensional data.

According to my reading of the manuscript, there are 3 main advantages when using random forest (RF) for parameter inference with ABC. The first advantage is that RF can handle many summary statistics and that dimension reduction is not needed when using RF.

The second advantage is very nicely displayed in Figure 5, which shows the main result of the paper. If correct, 95% posterior credibility intervals (C.I.) should contain 95% of the parameter values used in simulations. Figure 5 shows that posterior C.I. obtained with rejection are too large compared to other methods. By contrast, C.I. obtained with regression methods have been shrunken. However, the shrinkage can be excessive for the smallest tolerance rates, with coverage values that can be equal to 85% instead of the expected 95% value. The attractive property of RF is that C.I. have been shrunken but the coverage is of 100% resulting in a conservative decision about parameter values.

The last advantage is that no hyperparameter should be chosen. It is a parameter free approach, which is desirable because of the potential difficulty of choosing an appropriate acceptance rate.

The main drawback of the proposed approach concerns joint parameter inference. There are many settings where the joint parameter distribution is of interest and the proposed RF approach cannot handle that. In population genetics for example, estimation of the severity and of the duration of the bottleneck should be estimated jointly because of identifiability issues. The challenge of performing joint parameter inference with RF might constitute a useful research perspective.
 

References
 

[1] Raynal L, Marin J-M, Pudlo P, Ribatet M, Robert CP, Estoup A. 2017. ABC random forests for Bayesian parameter inference. arXiv 1605.05537v4, https://arxiv.org/pdf/1605.05537
[2] Meinshausen N. 2006. Quantile regression forests. Journal of Machine Learning Research 7: 983-999. http://www.jmlr.org/papers/v7/meinshausen06a.html

ABC random forests for Bayesian parameter inferenceLouis Raynal, Jean-Michel Marin, Pierre Pudlo, Mathieu Ribatet, Christian P. Robert, Arnaud EstoupThis preprint has been reviewed and recommended by Peer Community In Evolutionary Biology (http:// dx.doi.org/ 10.24072/ pci.evolbiol.100036). Approximate Bayesian computation (ABC) has grown into a standard methodology that manages Bayesian infer...Bioinformatics & Computational Biology, Evolutionary Applications, Other, Population Genetics / GenomicsMichael Blum 2017-07-06 07:42:00 View
13 Dec 2016
article picture
POSTPRINT

A supergene determines highly divergent male reproductive morphs in the ruff

Supergene Control of a Reproductive Polymorphism

Recommended by and

Two back-to-back papers published earlier this year in Nature Genetics provide compelling evidence for the control of a male reproductive polymorphism in a wading bird by a "supergene", a cluster of tightly linked genes [1-2]. The bird in question, the ruff (Philomachus pugnax), has a rather unusual reproductive system that consists of three distinct types of males ("reproductive morphs"): aggressive "independents" who represent the majority of males; a smaller fraction of non-territorial "satellites" who are submissive towards "independents"; and "faeders" who mimic females and are rare. Previous work has shown that the male morphs differ in major aspects of mating and aggression behavior, plumage coloration and body size, and that – intriguingly – this complex multi-trait polymorphism is apparently controlled by a single autosomal Mendelian locus with three alleles [3]. To uncover the genetic control of this polymorphism two independent teams, led by Terry Burke [1] and Leif Andersson [2], have set out to analyze the genomes of male ruffs. Using a combination of genomics and genetics, both groups managed to pin down the supergene locus and map it to a non-recombining, 4.5 Mb large inversion which arose 3.8 million years ago. While "independents" are homozygous for the ancestral uninverted sequence, "satellites" and "faeders" carry evolutionarily divergent, dominant alternative haplotypes of the inversion. Thus, as in several other notable cases, for example the supergene control of disassortative mating, aggressiveness and plumage color in white-throated sparrows [4], of mimicry in Heliconius and Papilio butterflies [5-6], or of social structure in ants [7], an inversion – behaving as a single "locus" – underpins the mechanistic basis of the supergene. More generally, and beyond inversions, a growing number of studies now shows that selection can favor the evolution of suppressed recombination, thereby leading to the emergence of clusters of tightly linked loci which can then control – presumably due to polygenic gene action – a suite of complex phenotypes [8-10]. A largely unresolved question in this field concerns the identity of the causative alleles and loci within a given supergene. Recent progress on this question has been made for example in Papilio polytes butterflies where a mimicry supergene has been found to involve – surprisingly – only a single but large gene: multiple mimicry alleles in the doublesex gene are maintained in strong linkage disequilibrium via an inversion. It will clearly be of great interest to see future examples of such a fine-scale genetic dissection of supergenes. In conclusion, we were impressed by the data and analyses of Küpper et al. [1] and Lamichhaney et al. [2]: both papers beautifully illustrate how genomics and evolutionary ecology can be combined to make new, exciting discoveries. Both papers will appeal to readers with an interest in supergenes, inversions, the interplay of selection and recombination, or the genetic control of complex phenotypes.

References

[1] Küpper C, Stocks M, Risse JE, dos Remedios N, Farrell LL, McRae SB, Morgan TC, Karlionova N, Pinchuk P, Verkuil YI, et al. 2016. A supergene determines highly divergent male reproductive morphs in the ruff. Nature Genetics 48:79-83. doi: 10.1038/ng.3443

[2] Lamichhaney S, Fan G, Widemo F, Gunnarsson U, Thalmann DS, Hoeppner MP, Kerje S, Gustafson U, Shi C, Zhang H, et al. 2016. Structural genomic changes underlie alternative reproductive strategies in the ruff (Philomachus pugnax). Nature Genetics 48:84-88. doi: 10.1038/ng.3430

[3] Lank DB, Smith CM, Hanotte O, Burke T, Cooke F. 1995. Genetic polymorphism for alternative mating behaviour in lekking male ruff Philomachus pugnax. Nature 378:59-62. doi: 10.1038/378059a0

[4] Tuttle Elaina M, Bergland Alan O, Korody Marisa L, Brewer Michael S, Newhouse Daniel J, Minx P, Stager M, Betuel A, Cheviron Zachary A, Warren Wesley C, et al. 2016. Divergence and Functional Degradation of a Sex Chromosome-like Supergene. Current Biology 26:344-350. doi: 10.1016/j.cub.2015.11.069

[5] Joron M, Frezal L, Jones RT, Chamberlain NL, Lee SF, Haag CR, Whibley A, Becuwe M, Baxter SW, Ferguson L, et al. 2011. Chromosomal rearrangements maintain a polymorphic supergene controlling butterfly mimicry. Nature 477:203-206. doi: 10.1038/nature10341

[6] Kunte K, Zhang W, Tenger-Trolander A, Palmer DH, Martin A, Reed RD, Mullen SP, Kronforst MR. 2014. doublesex is a mimicry supergene. Nature 507:229-232. doi: 10.1038/nature13112

[7] Wang J, Wurm Y, Nipitwattanaphon M, Riba-Grognuz O, Huang Y-C, Shoemaker D, Keller L. 2013. A Y-like social chromosome causes alternative colony organization in fire ants. Nature 493:664-668. doi: 10.1038/nature11832

[8] Thompson MJ, Jiggins CD. 2014. Supergenes and their role in evolution. Heredity 113:1-8. doi: 10.1038/hdy.2014.20

[9] Schwander T, Libbrecht R, Keller L. 2014. Supergenes and Complex Phenotypes. Current Biology 24:R288-R294. doi: 10.1016/j.cub.2014.01.056

[10] Charlesworth D. 2015. The status of supergenes in the 21st century: recombination suppression in Batesian mimicry and sex chromosomes and other complex adaptations. Evolutionary Applications 9:74-90. doi: 10.1111/eva.12291

A supergene determines highly divergent male reproductive morphs in the ruffKüpper C, Stocks M, Risse JE, dos Remedios N, Farrell LL, McRae SB, Morgan TC, Karlionova N, Pinchuk P, Verkuil YI, et al.Three strikingly different alternative male mating morphs (aggressive 'independents', semicooperative 'satellites' and female-mimic 'faeders') coexist as a balanced polymorphism in the ruff, *Philomachus pugnax*, a lek-breeding wading bird1, 2, 3....Adaptation, Genotype-Phenotype, Life History, Population Genetics / Genomics, Reproduction and SexThomas Flatt2016-12-13 17:28:13 View
22 May 2017
article picture

Can Ebola Virus evolve to be less virulent in humans?

A new hypothesis to explain Ebola's high virulence

Recommended by and based on reviews by Virginie Ravigné and François Blanquart

 

The tragic 2014-2016 Ebola outbreak that resulted in more than 28,000 cases and 11,000 deaths in West Africa [1] has been a surprise to the scientific community. Before 2013, the Ebola virus (EBOV) was known to produce recurrent outbreaks in remote villages near tropical rainforests in Central Africa, never exceeding a few hundred cases with very high virulence. Both EBOV’s ability to circulate for several months in large urban human populations and its important mutation rate suggest that EBOV’s virulence could evolve and to some extent adapt to human hosts [2]. Up to now, the high virulence of EBOV in humans was generally thought to be maladaptive, the virus being adapted to circulating in wild animal populations (e.g. fruit bats [3]). As a logical consequence, EBOV virulence could be expected to decrease during long epidemics in humans. The present paper by Sofonea et al. [4] challenges this view and explores how, given EBOV’s life cycle and known epidemiological parameters, virulence is expected to evolve in the human host during long epidemics. The main finding of the paper is that there is no chance that EBOV’s virulence decreases in the short and long terms. The main underlying mechanism is that EBOV is also transmitted by dead bodies, which limits the cost of virulence. In itself the idea that selection should select for higher virulence in diseases that are also transmitted after host death will sound intuitive for most evolutionary epidemiologists. The accomplishment of the paper is to make a very strong case that the parameter range where virulence could decrease is very small. The paper further provides scientifically grounded arguments in favor of the safe management of corpses. Safe burial of corpses is culturally difficult to impose. The present paper shows that in addition to instantaneously decreasing the spread of the virus, safe burial may limit virulence increase in the short term and favor of less virulent strains in the long term. Altogether these results make a timely and important contribution to the knowledge and understanding of EBOV.

References

[1] World Health Organization. 2016. WHO: Ebola situation report - 10 June 2016.

[2] Kupferschmidt K. 2014. Imagining Ebola’s next move. Science 346: 151–152. doi: 10.1126/science.346.6206.151

[3] Leroy EM, Kumulungui B, Pourrut X, Rouquet P, Hassanin A, Yaba P, Délicat A, Paweska, Gonzalez JP and Swanepoel R. 2005. Fruit bats as reservoirs of Ebola virus. Nature 438: 575–576. doi: 10.1038/438575a

[4] Sofonea MT, Aldakak L, Boullosa LFVV and Alizon S. 2017. Can Ebola Virus evolve to be less virulent in humans? bioRxiv 108589, ver. 3 of 19th May 2017; doi: 10.1101/108589

Can Ebola Virus evolve to be less virulent in humans?Mircea T. Sofonea, Lafi Aldakak, Luis Fernando Boullosa, Samuel AlizonUnderstanding Ebola Virus (EBOV) virulence evolution is not only timely but also raises specific questions because it causes one pf the most virulent human infections and it is capable of transmission after the death of its host. Using a compartme...Evolutionary EpidemiologyVirginie Ravigné2017-02-15 13:25:58 View