Six-state amino acid recoding is not an effective strategy to offset the effects of compositional heterogeneity and saturation in phylogenetic analyses

Published on Aug 8, 2019in bioRxiv
· DOI :10.1101/729103
Alexandra M. Hernandez1
Estimated H-index: 1
(UF: University of Florida),
Joseph F. Ryan23
Estimated H-index: 23
(UF: University of Florida)
Six-state amino acid recoding strategies are commonly applied to combat the effects of compositional heterogeneity and substitution saturation in phylogenetic analyses. While these methods have been endorsed from a theoretical perspective, their performance has never been extensively tested. Here, we test the effectiveness of 6-state recoding approaches by comparing the performance of analyses on recoded and non-recoded datasets that have been simulated under gradients of compositional heterogeneity or saturation. In all of our simulation analyses, non-recoding approaches greatly outperformed 6-state recoding approaches. Our results suggest that 6-state recoding strategies are not effective in the face of high saturation. Further, while recoding strategies do buffer the effects of compositional heterogeneity, the loss of information that accompanies 6-state recoding outweighs its benefits, even in the most compositionally heterogeneous datasets. In addition, we evaluate recoding schemes with 9, 12, 15, and 18 states and show that these all outperform 6-state recoding. Our results have important implications for the more than 70 published papers that have incorporated 6-state recoding, many of which have significant bearing on relationships across the tree of life.
  • References (95)
  • Citations (2)
📖 Papers frequently viewed together
21 Citations
78% of Scinapse members use related papers. After signing in, all features are FREE.
#1Sujatha Narayanan Kutty (NUS: National University of Singapore)H-Index: 11
#2Karen Meusemann (University of Freiburg)H-Index: 21
Last. Thomas Pape (Wild Center)H-Index: 41
view all 12 authors...
2 CitationsSource
#1Christopher E. Laumer (Harvard University)H-Index: 7
#2Rosa Fernández (Harvard University)H-Index: 18
Last. Gonzalo Giribet (Harvard University)H-Index: 63
view all 10 authors...
Proper biological interpretation of a phylogeny can sometimes hinge on the placement of key taxa—or fail when such key taxa are not sampled. In this light, we here present the first attempt to inve...
8 CitationsSource
#1Melissa B. DeBiasse (UF: University of Florida)H-Index: 9
#2Joseph F. Ryan (UF: University of Florida)H-Index: 23
2 CitationsSource
#1Hervé Philippe (UdeM: Université de Montréal)H-Index: 82
#2Albert J. Poustka (MPG: Max Planck Society)H-Index: 25
Last. Maximilian J. Telford (UCL: University College London)H-Index: 35
view all 25 authors...
Summary Xenoturbella and the acoelomorph worms (Xenacoelomorpha) are simple marine animals with controversial affinities. They have been placed as the sister group of all other bilaterian animals (Nephrozoa hypothesis), implying their simplicity is an ancient characteristic [ 1 , 2 ]; alternatively, they have been linked to the complex Ambulacraria (echinoderms and hemichordates) in a clade called the Xenambulacraria [ 3 , 4 , 5 ], suggesting their simplicity evolved by reduction from a complex ...
9 CitationsSource
#1Oleg A. Zverkov (RAS: Russian Academy of Sciences)H-Index: 5
#2Kirill V. Mikhailov (MSU: Moscow State University)H-Index: 10
Last. Vladimir V. Aleoshin (MSU: Moscow State University)H-Index: 10
view all 11 authors...
Two enigmatic groups of morphologically simple parasites of invertebrates, the Dicyemida (syn. Rhombozoa) and the Orthonectida, since the 19th century have been usually considered as two classes of the phylum Mesozoa. Early molecular evidence suggested their relationship within the Spiralia (=Lophotrochozoa), however high rates of dicyemid and orthonectid sequence evolution led to contradicting phylogeny reconstructions. Genomic data for orthonectids revealed that they are highly simplified spir...
3 CitationsSource
#1Joanna M. Wolfe (MIT: Massachusetts Institute of Technology)H-Index: 8
#2Jesse W. Breinholt (Florida Museum of Natural History)H-Index: 17
Last. Heather D. Bracken-Grissom (FIU: Florida International University)H-Index: 12
view all 8 authors...
Comprising over 15 000 living species, decapods (crabs, shrimp and lobsters) are the most instantly recognizable crustaceans, representing a considerable global food source. Although decapod system...
7 CitationsSource
#1Juan E. Uribe (CSIC: Spanish National Research Council)H-Index: 6
#2Iker Irisarri (Uppsala University)H-Index: 13
Last. Rafael Zardoya (CSIC: Spanish National Research Council)H-Index: 55
view all 4 authors...
Long-branch attraction (LBA) is a well-known artifact in phylogenetic reconstruction. Sparse taxon sampling and extreme heterogeneity of evolutionary rates among lineages generate propitious situat ...
4 CitationsSource
#1Filipe de Sousa (University of the Algarve)H-Index: 8
#2Peter G. Foster (Natural History Museum)H-Index: 36
Last. Cymon J. Cox (University of the Algarve)H-Index: 35
view all 5 authors...
Unraveling the phylogenetic relationships between the four major lineages of terrestrial plants (mosses, liverworts, hornworts, and vascular plants) is essential for an understanding of the evolution of traits specific to land plants, such as their complex life cycles, and the evolutionary development of stomata and vascular tissue. Well supported phylogenetic hypotheses resulting from different data and methods are often incongruent due to processes of nucleotide evolution that are difficult to...
9 CitationsSource
#1Tauana Junqueira Cunha (Harvard University)H-Index: 2
#2Gonzalo Giribet (Harvard University)H-Index: 63
Gastropod molluscs are among the most diverse and abundant animals in the oceans, and are successful colonizers of terrestrial and freshwater environments. Past phylogenetic efforts to resolve gast...
6 CitationsSource
#1Sarah Lemer (U.O.G.: University of Guam)H-Index: 12
#2Rüdiger Bieler (FMNH: Field Museum of Natural History)H-Index: 16
Last. Gonzalo Giribet (Harvard University)H-Index: 63
view all 3 authors...
Bivalvia has been the subject of extensive recent phylogenetic work to attempt resolving either the backbone of the bivalve tree using transcriptomic data, or the tips using morpho-anatomical data ...
2 CitationsSource
Cited By2
Phylogenomics, the use of large datasets to examine phylogeny, has revolutionized the study of evolutionary relationships. However, genome-scale data have not been able to resolve all relationships in the tree of life; this could reflect, at least in part, the poor-fit of the models used to analyze heterogeneous datasets. Some of the heterogeneity may reflect the different patterns of selection on proteins based on their structures. To test that hypothesis, we developed a pipeline to divide phyl...
1 CitationsSource
#1José M. Martín-Durán (QMUL: Queen Mary University of London)H-Index: 16
#2Ferdinand Marlétaz (OIST: Okinawa Institute of Science and Technology)H-Index: 1
ABSTRACT Snails, earthworms and flatworms are remarkably different animals, but they all exhibit a very similar mode of early embryogenesis: spiral cleavage. This is one of the most widespread developmental programs in animals, probably ancestral to almost half of the animal phyla, and therefore its study is essential for understanding animal development and evolution. However, our knowledge of spiral cleavage is still in its infancy. Recent technical and conceptual advances, such as the establi...
1 CitationsSource
#1Christoph Bleidorn (GAU: University of Göttingen)H-Index: 28
Lophotrochozoa (also called Spiralia), the sister taxon of Ecdysozoa, includes animal taxa with disparate body plans such as the segmented annelids, the shell bearing molluscs and brachiopods, the colonial bryozoans, the endoparasitic acanthocephalans and the acoelomate platyhelminths. Phylogenetic relationships within Lophotrochozoa have been notoriously difficult to resolve leading to the point that they are often represented as polytomy. Recent studies focussing on phylogenomics, Hox genes an...
1 CitationsSource