Journal Article

Computational Methods for Evaluating Phylogenetic Models of Coding Sequence Evolution with Dependence between Codons

Nicolas Rodrigue, Claudia L. Kleinman, Hervé Philippe and Nicolas Lartillot

in Molecular Biology and Evolution

Published on behalf of Society for Molecular Biology and Evolution

Volume 26, issue 7, pages 1663-1676
Published in print July 2009 | ISSN: 0737-4038
Published online April 2009 | e-ISSN: 1537-1719 | DOI: http://dx.doi.org/10.1093/molbev/msp078
Computational Methods for Evaluating Phylogenetic Models of Coding Sequence Evolution with Dependence between Codons

More Like This

Show all results sharing these subjects:

  • Evolutionary Biology
  • Molecular and Cell Biology

GO

Show Summary Details

Preview

In recent years, molecular evolutionary models formulated as site-interdependent Markovian codon substitution processes have been proposed as means of mechanistically accounting for selective features over long-range evolutionary scales. Under such models, site interdependencies are reflected in the use of a simplified protein tertiary structure representation and predefined statistical potential, which, along with mutational parameters, mediate nonsynonymous rates of substitution; rates of synonymous events are solely mediated by mutational parameters. Although theoretically attractive, the models are computationally challenging, and the methods used to manipulate them still do not allow for quantitative model evaluations in a multiple-sequence context. Here, we describe Markov chain Monte Carlo computational methodologies for sampling parameters from their posterior distribution under site-interdependent codon substitution models within a phylogenetic context and allowing for Bayesian model assessment and ranking. Specifically, the techniques we expound here can form the basis of posterior predictive checking under these models and can be embedded within thermodynamic integration algorithms for computing Bayes factors. We illustrate the methods using two data sets and find that although current forms of site-interdependent models of codon substitution provide an improved fit, they are outperformed by the extended site-independent versions. Altogether, the methodologies described here should enable a quantified contrasting of alternative ways of modeling structural constraints, or other site-interdependent criteria, and establish if such formulations can match (or supplant) site-independent model extensions.

Keywords: Markov chain Monte Carlo; data augmentation; auxiliary variables; posterior predictive checking; Bayes factors; protein tertiary structure

Journal Article.  9738 words.  Illustrated.

Subjects: Evolutionary Biology ; Molecular and Cell Biology

Full text: subscription required

How to subscribe Recommend to my Librarian

Users without a subscription are not able to see the full content. Please, subscribe or login to access all content.