Genetic algorithm for large-scale maximum parsimony phylogenetic analysis of proteins
2005 (English)In: Biochimica et Biophysica Acta - General Subjects, ISSN 0304-4165, E-ISSN 1872-8006, Vol. 1725, no 1, 19-29 p.Article in journal (Refereed) Published
Inferring phylogeny is a difficult computational problem. For example, for only 13 taxa, there are more then 13 billion possible unrooted phylogenetic trees. Heuristics are necessary to minimize the time spent evaluating non-optimal trees. We describe here an approach for heuristic searching, using a genetic algorithm, that can reduce the time required for weighted maximum parsimony phylogenetic inference, especially for data sets involving a large number of taxa. It is the first implementation of a weighted maximum parsimony criterion using amino acid sequences. To validate the weighted criterion, we used an artificial data set and compared it to a number of other phylogenetic methods. Genetic algorithms mimic the natural selection's ability to solve complex problems. We have identified several parameters affecting the genetic algorithm. Methods were developed to validate these parameters, ensuring optimal performance. This approach allows the construction of phylogenetic trees with over 200 taxa in practical time on a regular PC.
Place, publisher, year, edition, pages
2005. Vol. 1725, no 1, 19-29 p.
Medical and Health Sciences
IdentifiersURN: urn:nbn:se:uu:diva-95458DOI: 10.1016/j.bbagen.2005.04.027OAI: oai:DiVA.org:uu-95458DiVA: diva2:169674