Recommendation

An evolutionary view of a biomedically important gene family

Kateryna Makova based on reviews by 2 anonymous reviewers

A recommendation of:

Evolution of the DAN gene family in vertebrates

Juan C. Opazo, Federico G. Hoffmann, Kattina Zavala, Scott V. Edwards (2020), bioRxiv, 794404, ver. 3 recommended and peer-reviewed by Peer Community In Evolutionary Biology https://doi.org/10.1101/794404

Read preprint in preprint server Now published in a journal

Data used for results

Abstract

EN

AR

ES

FR

HI

JA

PT

RU

ZH-CN

Evolution of the DAN gene family in vertebrates

The DAN gene family (DAN, Differential screening-selected gene Aberrant in Neuroblastoma) is a group of genes that is expressed during development and plays fundamental roles in limb bud formation and digitation, kidney formation and morphogenesis and left-right axis specification. During adulthood the expression of these genes are associated with diseases, including cancer. Although most of the attention to this group of genes has been dedicated to understanding its role in physiology and development, its evolutionary history remains poorly understood. Thus, the goal of this study is to investigate the evolutionary history of the DAN gene family in vertebrates, with the objective of complementing the already abundant physiological information with an evolutionary context. Our results recovered the monophyly of all DAN gene family members and divide them into five main groups. In addition to the well-known DAN genes, our phylogenetic results revealed the presence of two new DAN gene lineages; one is only retained in cephalochordates, whereas the other one (GREM3) was only identified in cartilaginous fish, holostean fish, and coelacanth. According to the phyletic distribution of the genes, the ancestor of gnathostomes possessed a repertoire of eight DAN genes, and during the radiation of the group GREM1, GREM2, SOST, SOSTDC1, and NBL1 were retained in all major groups, whereas, GREM3, CER1, and DAND5 were differentially lost.

gene family evolution; cerberus; differential retention; evolutionary medicine; evolutionary slowdown; gremlin.

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

تطور عائلة الجينات DAN في الفقاريات

إن عائلة جينات DAN (DAN، الجين المُختار بالفحص التفاضلي الشاذ في ورم الخلايا البدائية العصبية) هي مجموعة من الجينات التي يتم التعبير عنها أثناء التطور وتلعب أدوارًا أساسية في تكوين برعم الأطراف ورقمها، وتكوين الكلى وتشكلها ومواصفات المحور الأيسر والأيمن. . خلال مرحلة البلوغ، يرتبط التعبير عن هذه الجينات بالأمراض، بما في ذلك السرطان. على الرغم من أن معظم الاهتمام بهذه المجموعة من الجينات قد تم تخصيصه لفهم دورها في علم وظائف الأعضاء والنمو، إلا أن تاريخها التطوري لا يزال غير مفهوم بشكل جيد. وبالتالي، فإن الهدف من هذه الدراسة هو دراسة التاريخ التطوري لعائلة جينات DAN في الفقاريات، بهدف استكمال المعلومات الفسيولوجية الوفيرة بالفعل بسياق تطوري. استعادت نتائجنا أحادية جميع أفراد عائلة الجينات DAN وقسمتهم إلى خمس مجموعات رئيسية. بالإضافة إلى جينات DAN المعروفة، كشفت نتائج النشوء والتطور لدينا عن وجود سلالتين جديدتين من جينات DAN؛ يتم الاحتفاظ بواحد فقط في الرأسيات الحبليات، في حين تم تحديد الآخر (GREM3) فقط في الأسماك الغضروفية، والأسماك الهولوستينية، والسيلكانث. وفقًا للتوزيع السليلي للجينات، امتلك سلف الجناثوستوم ذخيرة من ثمانية جينات DAN، وأثناء تشعيع المجموعة GREM1، وGREM2، وSOST، وSOSTDC1، وNBL1، تم الاحتفاظ بها في جميع المجموعات الرئيسية، في حين تم الاحتفاظ بـ GREM3، وCER1 ، وDAND5 تم فقدهما بشكل تفاضلي.

تطور عائلة الجينات. سيربيروس. الاحتفاظ التفاضلي الطب التطوري؛ التباطؤ التطوري. شبح.

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Evolución de la familia de genes DAN en vertebrados

La familia de genes DAN (DAN, gen aberrante seleccionado para detección diferencial en neuroblastoma) es un grupo de genes que se expresa durante el desarrollo y desempeña funciones fundamentales en la formación y digitación de las yemas de las extremidades, la formación y morfogénesis de los riñones y la especificación del eje izquierda-derecha. . Durante la edad adulta la expresión de estos genes se asocia con enfermedades, incluido el cáncer. Aunque la mayor parte de la atención prestada a este grupo de genes se ha dedicado a comprender su papel en la fisiología y el desarrollo, su historia evolutiva sigue siendo poco conocida. Así, el objetivo de este estudio es investigar la historia evolutiva de la familia de genes DAN en vertebrados, con el objetivo de complementar la ya abundante información fisiológica con un contexto evolutivo. Nuestros resultados recuperaron la monofilia de todos los miembros de la familia del gen DAN y los dividieron en cinco grupos principales. Además de los genes DAN bien conocidos, nuestros resultados filogenéticos revelaron la presencia de dos nuevos linajes de genes DAN; uno solo se retiene en cefalocordados, mientras que el otro (GREM3) solo se identificó en peces cartilaginosos, peces holosteos y celacanto. Según la distribución filética de los genes, el antepasado de los gnatóstomos poseía un repertorio de ocho genes DAN, y durante la radiación del grupo GREM1, GREM2, SOST, SOSTDC1 y NBL1 se conservaron en todos los grupos principales, mientras que GREM3, CER1 y DAND5 se perdieron diferencialmente.

evolución de la familia de genes; cerbero; retención diferencial; medicina evolutiva; desaceleración evolutiva; duendecillo.

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Evolution de la famille des gènes DAN chez les vertébrés

La famille de gènes DAN (DAN, Differential Screening-selected Gene Aberrant in Neuroblastoma) est un groupe de gènes qui s'expriment au cours du développement et jouent un rôle fondamental dans la formation et la digitalisation des bourgeons des membres, la formation et la morphogenèse des reins et la spécification de l'axe gauche-droite. . À l’âge adulte, l’expression de ces gènes est associée à des maladies, notamment le cancer. Bien que l’essentiel de l’attention portée à ce groupe de gènes ait été consacré à la compréhension de son rôle dans la physiologie et le développement, son histoire évolutive reste mal comprise. Ainsi, le but de cette étude est d'étudier l'histoire évolutive de la famille des gènes DAN chez les vertébrés, dans le but de compléter les informations physiologiques déjà abondantes avec un contexte évolutif. Nos résultats ont récupéré la monophylie de tous les membres de la famille des gènes DAN et les ont divisés en cinq groupes principaux. En plus des gènes DAN bien connus, nos résultats phylogénétiques ont révélé la présence de deux nouvelles lignées de gènes DAN ; l'un n'est retenu que chez les céphalochordés, tandis que l'autre (GREM3) n'a été identifié que chez les poissons cartilagineux, les poissons holostéens et le cœlacanthe. Selon la distribution phylétique des gènes, l'ancêtre des gnathostomes possédait un répertoire de huit gènes DAN et, lors de l'irradiation du groupe, GREM1, GREM2, SOST, SOSTDC1 et NBL1 étaient conservés dans tous les groupes majeurs, alors que GREM3, CER1 , et DAND5 ont été différentiellement perdus.

évolution de la famille des gènes ; cerbère; rétention différentielle; médecine évolutionniste; ralentissement évolutif ; diablotin.

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

कशेरुकियों में DAN जीन परिवार का विकास

डीएएन जीन परिवार (डीएएन, न्यूरोब्लास्टोमा में डिफरेंशियल स्क्रीनिंग-चयनित जीन एबर्रैंट) जीन का एक समूह है जो विकास के दौरान व्यक्त होता है और अंग कली निर्माण और डिजिटलीकरण, किडनी गठन और मॉर्फोजेनेसिस और बाएं-दाएं अक्ष विनिर्देश में मौलिक भूमिका निभाता है। . वयस्कता के दौरान इन जीनों की अभिव्यक्ति कैंसर सहित बीमारियों से जुड़ी होती है। यद्यपि जीन के इस समूह पर अधिकांश ध्यान शरीर विज्ञान और विकास में इसकी भूमिका को समझने के लिए समर्पित किया गया है, लेकिन इसके विकासवादी इतिहास को कम समझा गया है। इस प्रकार, इस अध्ययन का लक्ष्य कशेरुकियों में डीएएन जीन परिवार के विकासवादी इतिहास की जांच करना है, जिसका उद्देश्य पहले से ही प्रचुर शारीरिक जानकारी को विकासवादी संदर्भ के साथ पूरक करना है। हमारे परिणामों ने सभी DAN जीन परिवार के सदस्यों की मोनोफिली को पुनः प्राप्त किया और उन्हें पांच मुख्य समूहों में विभाजित किया। प्रसिद्ध DAN जीन के अलावा, हमारे फ़ाइलोजेनेटिक परिणामों से दो नए DAN जीन वंशों की उपस्थिति का पता चला; एक केवल सेफलोकॉर्डेट्स में बरकरार रहता है, जबकि दूसरा (GREM3) केवल कार्टिलाजिनस मछली, होलोस्टियन मछली और कोलैकैंथ में पहचाना गया था। जीन के फ़ाइलेटिक वितरण के अनुसार, ग्नथोस्टोम्स के पूर्वज के पास आठ DAN जीनों का भंडार था, और समूह GREM1, GREM2, SOST, SOSTDC1 और NBL1 के विकिरण के दौरान सभी प्रमुख समूहों में बनाए रखा गया था, जबकि, GREM3, CER1 , और DAND5 भिन्न रूप से खो गए।

जीन परिवार विकास; सेर्बेरस; विभेदक प्रतिधारण; विकासवादी चिकित्सा; विकासवादी मंदी; ग्रेमलिन.

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

脊椎動物におけるDAN遺伝子ファミリーの進化

DAN 遺伝子ファミリー (DAN、神経芽腫におけるディファレンシャルスクリーニング選択遺伝子異常) は、発生中に発現し、四肢芽の形成と指形成、腎臓の形成と形態形成、左右軸の指定において基本的な役割を果たす遺伝子群です。。成人期におけるこれらの遺伝子の発現は、がんなどの病気に関連します。この遺伝子グループに対する注目のほとんどは、生理学と発生におけるその役割を理解することに向けられてきましたが、その進化の歴史は依然としてほとんど理解されていません。したがって、この研究の目的は、すでに豊富な生理学的情報を進化の文脈で補完することを目的として、脊椎動物におけるDAN遺伝子ファミリーの進化の歴史を調査することです。私たちの結果は、すべての DAN 遺伝子ファミリーメンバーの単系統性を回復し、それらを 5 つの主要なグループに分類しました。よく知られている DAN 遺伝子に加えて、系統発生学的結果から 2 つの新しい DAN 遺伝子系統の存在が明らかになりました。 1 つは頭索動物にのみ保持され、もう 1 つは軟骨魚類、ホロステイン魚類、およびシーラカンスでのみ確認されました。遺伝子の系統分布によれば、顎口類の祖先は 8 つの DAN 遺伝子のレパートリーを有しており、グループの放射線照射中、GREM1、GREM2、SOST、SOSTDC1、および NBL1 はすべての主要グループで保持されていたのに対し、GREM3、CER1 は保持されていました。、および DAND5 は差分的に失われました。

遺伝子ファミリーの進化。ケルベロス。保持力の差。進化医学。進化の減速。グレムリン。

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Evolução da família de genes DAN em vertebrados

A família de genes DAN (DAN, gene Aberrant in Neuroblastoma selecionado por triagem diferencial) é um grupo de genes que é expresso durante o desenvolvimento e desempenha papéis fundamentais na formação e digitação de botões de membros, formação e morfogênese renal e especificação do eixo esquerdo-direito . Durante a idade adulta a expressão destes genes está associada a doenças, incluindo o cancro. Embora a maior parte da atenção a este grupo de genes tenha sido dedicada à compreensão do seu papel na fisiologia e no desenvolvimento, a sua história evolutiva permanece pouco compreendida. Assim, o objetivo deste estudo é investigar a história evolutiva da família de genes DAN em vertebrados, com o objetivo de complementar a já abundante informação fisiológica com um contexto evolutivo. Nossos resultados recuperaram a monofilia de todos os membros da família do gene DAN e os dividiram em cinco grupos principais. Além dos genes DAN bem conhecidos, nossos resultados filogenéticos revelaram a presença de duas novas linhagens de genes DAN; um só é retido em cefalocordados, enquanto o outro (GREM3) só foi identificado em peixes cartilaginosos, peixes holosteanos e celacantos. De acordo com a distribuição filética dos genes, o ancestral dos gnatóstomos possuía um repertório de oito genes DAN, e durante a radiação do grupo GREM1, GREM2, SOST, SOSTDC1 e NBL1 foram retidos em todos os grupos principais, enquanto GREM3, CER1 , e DAND5 foram perdidos diferencialmente.

evolução da família genética; cérbero; retenção diferencial; medicina evolutiva; desaceleração evolutiva; Gremlin.

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Эволюция семейства генов DAN у позвоночных

Семейство генов DAN (DAN, аберрантный ген, выбранный при дифференциальном скрининге в нейробластоме) представляет собой группу генов, которые экспрессируются во время развития и играют фундаментальную роль в формировании и пальцевых зачатках конечностей, формировании и морфогенезе почек, а также спецификации оси левая-правая. . Во взрослом возрасте экспрессия этих генов связана с заболеваниями, включая рак. Хотя большая часть внимания к этой группе генов была посвящена пониманию ее роли в физиологии и развитии, ее эволюционная история остается плохо изученной. Таким образом, цель этого исследования — изучить эволюционную историю семейства генов DAN у позвоночных с целью дополнения и без того обильной физиологической информации эволюционным контекстом. Наши результаты восстановили монофилию всех членов семейства генов DAN и разделили их на пять основных групп. В дополнение к хорошо известным генам DAN наши филогенетические результаты показали наличие двух новых линий генов DAN; один сохраняется только у головохордовых, тогда как другой (GREM3) идентифицирован только у хрящевых рыб, голостевых рыб и латимерии. Судя по филетическому распределению генов, предок челюстноротых обладал репертуаром из восьми генов DAN, и при радиации группы GREM1, GREM2, SOST, SOSTDC1 и NBL1 сохранились во всех основных группах, тогда как GREM3, CER1 , и DAND5 были потеряны по-разному.

эволюция семейства генов; цербер; дифференциальное удержание; эволюционная медицина; эволюционное замедление; гремлин.

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

脊椎动物 DAN 基因家族的进化

DAN基因家族（DAN，Differentialscreening-selectedgeneAberrantinNeuroblastoma）是一组在发育过程中表达的基因，在肢芽形成和指状化、肾脏形成和形态发生以及左右轴规范中发挥重要作用。在成年期间，这些基因的表达与疾病有关，包括癌症。尽管对这组基因的大部分关注都致力于了解其在生理和发育中的作用，但其进化历史仍然知之甚少。因此，本研究的目的是调查脊椎动物 DAN 基因家族的进化史，目的是用进化背景补充已经丰富的生理信息。我们的结果恢复了所有 DAN 基因家族成员的单系性，并将它们分为五个主要组。除了众所周知的 DAN 基因外，我们的系统发育结果还揭示了两个新的 DAN 基因谱系的存在；一种仅在头索动物中保留，而另一种 (GREM3) 仅在软骨鱼、全骨鱼和腔棘鱼中发现。根据基因的系统分布，颌口动物的祖先拥有8个DAN基因，在群体的辐射过程中GREM1、GREM2、SOST、SOSTDC1和NBL1在所有主要群体中均保留，而GREM3、CER1 、DAND5 有差异性丢失。

基因家族进化；地狱犬；差异保留；进化医学；进化放缓；格林姆林。

Submission: posted 15 October 2019
Recommendation: posted 23 July 2020, validated 27 July 2020

Cite this recommendation as:
Makova, K. (2020) An evolutionary view of a biomedically important gene family. Peer Community in Evolutionary Biology, 100104. https://doi.org/10.24072/pci.evolbiol.100104

Recommendation

This manuscript [1] investigates the evolutionary history of the DAN gene family—a group of genes important for embryonic development of limbs, kidneys, and left-right axis speciation. This gene family has also been implicated in a number of diseases, including cancer and nephropathies. DAN genes have been associated with the inhibition of the bone morphogenetic protein (BMP) signaling pathway. Despite this detailed biochemical and functional knowledge and clear importance for development and disease, evolution of this gene family has remained understudied. The diversification of this gene family was investigated in all major groups of vertebrates. The monophyly of the gene members belonging to this gene family was confirmed. A total of five clades were delineated, and two novel lineages were discovered. The first lineage was only retained in cephalochordates (amphioxus), whereas the second one (GREM3) was retained by cartilaginous fish, holostean fish, and coelanth. Moreover, the patterns of chromosomal synteny in the chromosomal regions harboring DAN genes were investigated. Additionally, the authors reconstructed the ancestral gene repertoires and studied the differential retention/loss of individual gene members across the phylogeny. They concluded that the ancestor of gnathostome vertebrates possessed eight DAN genes that underwent differential retention during the evolutionary history of this group. During radiation of vertebrates, GREM1, GREM2, SOST, SOSTDC1, and NBL1 were retained in all major vertebrate groups. At the same time, GREM3, CER1, and DAND5 were differentially lost in some vertebrate lineages. At least two DAN genes were present in the common ancestor of vertebrates, and at least three DAN genes were present in the common ancestor of chordates. Therefore the patterns of retention and diversification in this gene family appear to be complex. Evolutionary slowdown for the DAN gene family was observed in mammals, suggesting selective constraints. Overall, this article puts the biomedical importance of the DAN family in the evolutionary perspective.

References

[1] Opazo JC, Hoffmann FG, Zavala K, Edwards SV (2020) Evolution of the DAN gene family in vertebrates. bioRxiv, 794404, ver. 3 peer-reviewed and recommended by PCI Evolutionary Biology. doi: 10.1101/794404

PDF recommendation

Conflict of interest:
The recommender in charge of the evaluation of the article and the reviewers declared that they have no conflict of interest (as defined in the code of conduct of PCI) with the authors or with the content of the article. The authors declared that they comply with the PCI rule of having no financial conflicts of interest in relation to the content of the article.

Funding:
no declaration

Reviews

Evaluation round #1

DOI or URL of the preprint: 10.1101/794404

Version of the preprint: 1

Author's Reply, 11 Jun 2020

Download author's reply Download tracked changes file https://doi.org/10.24072/pci.evolbiol.100221.ar1

Decision by Kateryna Makova, posted 14 Jan 2020

Please revise the manuscript according to the reviewers' suggestions.

https://doi.org/10.24072/pci.evolbiol.100221.d1

Reviewed by anonymous reviewer 2, 13 Jan 2020

Review for "Evolution of the DAN gene family in vertebrates"

In this manuscript by Opazzo et al., the authors use homology searches to identify genes from the DAN gene family (Differential screening-selected gene Aberrant in Neuroblastoma) across chordate lineages. The phylogenetic relationships of these genes were inferred and the toplogy of the resulting tree was used to describe the evolutionary history of the gene family.

Interestingly, the authors identify a new family member related to the Gremlin genes, which they dub Grem3. Next, in the Gnathostome lineage, the authors show evidence for five genes being present in its MRCA that are also widely retained across its descendents (e.g. the major Gnathosome lineages listed in figure 4). These genes include Grem1, Grem2, SOST, SOSTDC1, and NBL1. The authors also identify 3 gene family members that they conclude are likely in the gnathostome ancestor, but have experienced loss in some of the ancestors: Grem3, Cer1, and DAND5.

Over all, the manuscript is well-written and lays out its case fairly well. And for the most part, I find the major arguments to be reasonable. However, there are a lot of areas that I feel would benefit from feedback described here.

Major comments

The methods are insufficiently detailed to permit the work to be repeated

The authors do not define the pool of sequences from which query and subject sequences are drawn. The specific implementation of blast and its version isn't cited. The filtering criteria used to determine whether hits are retained or discarded are not documented. The nature of the multiple alignment wasn't described. How much of the genes were alignable at the greatest divergences? In the introduction, the authors claim that there is "low inter-parallog conservation", indicating that the alignment may not be reliable in many regions. What was aligned? Nucleotides or amino acids (I assume amino acids)?
The results are fairly sparse on details

For example, display items aren't thoroughly described. The captions are very terse. For example, there appears to be a convention in the synteny plots where the absence of a bar indicates the absense of the gene (ag CER1 in Spotted Gar in Figure 2B). However, in Figures 5 and 6, dotted lines apparently indicate missing DAN genes but missing bars for flanking genes means that the gene isn't in the syntenic region. What is the scale in Figure 1? A bar with the number "0.7" is included. The caption doesn't elaborate. I'm accustomed to bootstrap support to be reported in Numerator/Denominator or explicity in %. The numbers corresponding to bootstrap support in Figure 1 are just bare integers.
The authors often point out disagreements with the literature, which is commendable. However, little effort is made to reconcile these observed disagreements. I'd feel better if the authors would discuss the discrepancies they point out.

Examples:
"Although the study of Walsh et al. (2010) supports..., two other studies report alternative topologies."
"Nolan et al. (2014) recovered NBL1 as sister... However, in support of our study Avsian-Kretchmer et al. (2004) recovered NBL1 as sister to the GREM lineages."
"However, in contrast to Petillon et al. (2013), we did not find..."
The claim of "recovering monophyly" is confusing to me.
"Our results recovered monophyly of all DAN gene family members"

My parsing of this statement in the abstract (and others like it throughout the manuscript) is probably not what the authors intended. To me, this sounds like "we confirmed that, as a group, all DAN genes are monophyletic". This doesn't make sense in an analysis where the recovery of a gene from EnsEMBL is viewed as conferring DAN membership on that gene. So, by definition, every gene in the analysis is DAN, and with no non-DAN genes for contrast, no determination about monophyly can be made.

While I can't confidently interpolate what the authors actually meant, perhaps the following is closer to the authors' meaning:

"For each member of the gene family (e.g. CER1, SOST, SOSTDC1, DAND5, NBL1, GREM1, GREM2, and a new member, GREM5), the group of species sequences corresponding to each gene is monophyletic."

Even this formulation is a bit confusing to me, as the monophyly seems to be how the authors would assign a particular sequence in a particular species to particular family member. And in any event, this gets a bit muddied when there is gene duplication. What is monophyly when for some taxa, there are duplicates, and others, there aren't? Is "recovery of monophyly" a result as implied by the authors? Or rather is it part of how the authors are classifying the sequences into family members like CER1, etc.?

Perhaps this "recovery of monophyly" could be reconciled if the authors inferred the full duplication history with synteny for every species they examined and then layered the phylogenetic analysis of the gene family on top of that. But, as far as I can tell, this was not the strategy the authors followed in most cases.

Finally, DAND5 doesn't appear to offer strong support for monophyly given that lack of support for placing the Coelacanth as sister to the other DAND5 genes. The strong synteny argument doesn't change this assessment, as it could be a brute fact that the Coelacanth sequence is simultaneously the DAND5 ortholog and there is no strong evidence of monophyly with the remaining DAND5 orthologs.
One comment relating to paralogy confused me.

"The fourth clade corresponds to the NBL1 gene, the founding member of the DAN gene family, and was recovered as monophyletic with strong support (pink clade; Fig. 1)."

This way of discussing paralogy (ie "founding member") seems clumsy to me. Barring clear mechanistic reasons to assign one paralog the label "founder" or "parent" (e.g. the template for the RNA in retrogenes or the copy maintaining the ancestral structure in a chimeric duplicate), immediately after duplication, the copies are provisionally assumed to be redundant. And as such, it would only be confusing to label one member the "founding member". The authors even discuss this in relation to the putative redundancy between DAND5 and CER1.
The discussion of cancer on pages 14 and 15 isn't well-integrated into the rest of the manuscript. The reference to RPRM and p53 in particular seems like it could be better incorporated into the narrative of the manuscript. Personally, I'd recommend dropping it, but a smoother integration could also work.
In a manuscript like this one, I would like to see more in depth discussion of sources of error. The task the authors set before themselves is quite ambitious and requires marshaling a lot of data from many genes across many different taxa. These taxa were sequenced by different groups, at different times, with different technology, exhibit different levels of contiguity and likely accuracy and completeness, etc. Sources of error can include errors in multiple alignment, misannotation of the genes, and evolution in gene structure, all of which can lead to aligned non-homologous residues. Moreover, low assembly or annotation completeness can lead to missing genes.

Minor comments

Why use the common name "elephant fish" when there is an "elephant fish" in both Actinopterygii and Chondrichthyes? Perhaps "elephant shark" would be better?
Why didn't the authors use Rhincodon typus (whale shark: https://www.ncbi.nlm.nih.gov/assembly/GCF_001642345.1/) in the analysis? It has a Genbank annotation and appears to be more contiguous than the elephant shark. Also, since this manuscript was posted, there is now a much better Chondrichthyes genome (Pristis pectinata): https://www.ncbi.nlm.nih.gov/assembly/?term=Pristis+pectinata

Perhaps either of these two could be valuable in the analysis.
Typo of DAND5: DADN5
Perhaps a labeled, high-level phylogeny would be useful in orienting the readers. One like Figure 1 in this would be a great service to the reader:

http://dx.doi.org/10.1016/j.cub.2017.02.029
Is Urochordate / Urochordata still in common use?

https://doi.org/10.24072/pci.evolbiol.100221.rev11

Reviewed by anonymous reviewer 1, 10 Dec 2019

This is a nice reconstruction of the evolution of a complex gene family, the DAN gene family. The authors show strong supporting evidence for the monophyly of 5 major groups and the inter-group relationships among them. While it is useful to see the information about this gene family all together, the novelty of this study is unclear as the authors often refer to previous literature that shows comparable, albeit partial, results.

Minor comments:
1. the authors should provide more information about the alignments produced (length, % gaps).
2. the authors used an evaluation of likelihood scores to determine convergence of the bayesian phylogenetic reconstruction. Although I generally agree with the authors that this method should produce accurate results, most researchers rely on the estimation of ESS values to determine convergence. It would be useful to know how the ESS values correlate with the number of generations required to reach an asymptotic trend in likelihood scores.
3. At the end of the page with the section entitled "Definition of ancestral gene repertoires" the authors state that the "lack of DAND5 in the elephant fish is an artifact of the current genome assembly". Please provide an explanation for this statement.
4.figure 2 and 3: what is the meaning of the double lines associated to some genes? Also, the grey lines represent intervening genes but no information is provided on how large these intervening sections of DNA may be. Depending on the size, they could be affecting the definition of synteny so more information is necessary to support the conclusions based on synteny.

https://doi.org/10.24072/pci.evolbiol.100221.rev12

User comments

No user comments yet

or Register
Submit a preprint