52 May 2003: 187–191
Archibald & al. Bayesian inference of phylogeny
N E W T R E N D S I N P L A N T S Y S T E M AT I C S
Bayesian inference of phylogeny: a non-technical primer
Jenny K. Archibald1 , Mark E. Mort2 & Daniel J. Crawford2
of Evolution, Ecology, and Organismal Biology, The Ohio State University, Columbus, Ohio 43210, U.S.A. archibald.7@ osu.edu 2 Departmentof Ecology and Evolutionary Biology & Museum of Natural History and Biodiversity Research Center, University of Kansas, Lawrence, Kansas 66045, U.S.A. email@example.com (author for correspondence); firstname.lastname@example.org
In our initial contribution to this column, we briefly commented upon the impact that phylogeny reconstruction has had on systematic botany. We further indicated that we feel that plantsystematics is currently in a period of reevaluation of the data we have used as well as the methodology employed to estimate phylogeny. This is not to say that issues such as the use of morphological data versus molecular data or whether or not to combine data for phylogenetic analyses have not been debated in the past. Clearly, these and many other issues have received a considerable amount ofdebate in the phylogenetic literature. Possibly one of the most rigorously debated topics is the choice of an optimality criterion. In general, there are three basic methods that have been used to estimate phylogeny, including distance, maximum parsimony (MP), and maximum likelihood (ML). The relative merits and shortcomings of these methods have been debated for a number of years (e.g., Faith,1985; Swofford & Olsen, 1990; Kunhner & Felsenstein, 1994; Huelsenbeck, 1995; Farris & al., 1996; Lewis, 1998; Steel & Penny, 2000), and it is not within the scope of this column to reiterate these discussions. However, it is noteworthy that numerous comparative studies employing both known phylogenies and simulated data have been very useful in determining under what set of conditions each of themethods “out performs” the others. For example, it is now generally accepted that when rates of change along branches vary greatly, employing a parsimony optimality criterion may be misleading due to “long branch attraction” (Felsenstein, 1978; but see Siddall, 1998); whereas additional studies have shown that ML may be inconsistent in other situations, such as when the chosen model of evolution isinappropriate (e.g., Farris, 1999). Simulation studies indicate that distance methods (especially UPGMA) are highly susceptible to variations in evolutionary rates and typically perform more poorly than either MP or ML (e.g., Huelsenbeck & Hillis, 1993). Studies such as these have been important in laying a theoretical foundation for making decisions on how best to estimate phylogeny given thedata in hand. However,
under most sets of realistic conditions, comparison of ML and MP indicates that these methods perform similarly and often result in highly concordant topologies (e.g., Reed & al., 2002; Kimball & al., 2003). Recently, another round of comparative studies has begun to address a new approach for phylogeny reconstruction (e.g., Suzuki & al., 2002; Wilcox & al., 2002; Alfaro &al., 2003; Douady & al., 2003). This new approach, Bayesian analyses, was proposed in 1996 (Rannala & Yang, 1996; Mau, 1996; Li, 1996) and is now receiving much attention in the literature [e.g., see Systematic Biology 51 (5)]. Several excellent technical reviews have recently been provided by Huelsenbeck & al. (2001, 2002) and Lewis (2001). Although this approach is now a “hot” topic insystematics, Bayesian statistics actually dates back to the 18th century and its utility for reconstructing phylogeny was suggested initially in 1968 by Felsenstein (see Huelsenbeck & al., 2002). It is only recently, however, that these methods have become more widely known and that relevant computer programs have become available. Internet links for downloadable programs for Bayesian analyses (and other...
Leer documento completo
Regístrate para leer el documento completo.