Algorithms for Searching and Analyzing Sets of Evolutionary Trees

Brammer, Grant

dc.contributor.advisor	Williams, Tiffani
dc.creator	Brammer, Grant
dc.date.accessioned	2015-01-09T20:49:39Z
dc.date.available	2016-05-01T05:31:04Z
dc.date.created	2014-05
dc.date.issued	2014-04-18
dc.date.submitted	May 2014
dc.identifier.uri	https://hdl.handle.net/1969.1/152770
dc.description.abstract	The evolutionary relationships between organisms are represented as phylogenetic trees. These trees have important implications for understanding biodiversity, tracking disease, and designing medicine. Since the evolutionary process that led to modern biodiversity was not directly recorded, phylogenetic trees are inferred from modern observations. Inferring accurate phylogenies is computationally difficult and many inference algorithms produce multiple phylogenetic trees of equal quality. The common method for presenting a set of trees is to summarize their common features into a single consensus tree. Consensus methods make it easy to tell which features are common to a set of trees, but how do you explore the hypotheses that are not the majority of trees? This question is best answered by a search algorithm. We present algorithms to query a set of trees based on their internal structure. Trees can be queried based on their bipartitions, quartets, clades, subtrees, or taxa, and we present a new concept which unifies edge based relationships for search functions. To extend the power of our search functions we provide the ability to combine the results of multiple searches using set operations. We also explore the differences between sets of trees. Clustering algorithms can detect if there are multiple distinct hypotheses within a set of trees. Decision tree depth and distinguishing bipartitions can be used to measure the similarity between sets of trees. For situations where a set of trees is made up of multiple distinct sets, we present p-support which is a measure to quantify the impact of the individual sets on a single consensus tree. The algorithms are presented within the context of TreeHouse. This is my open source platform for querying and analyzing sets of trees. One goal of TreeHouse was to unite query and analysis algorithms under a single user interface. The seamless interaction between fast filtering and analysis algorithms allows users to the explore their data in a way not easily accomplished elsewhere. We believe that the algorithms in this document and in TreeHouse can shed new light on often unexplored territory.	en
dc.format.mimetype	application/pdf
dc.language.iso	en
dc.subject	computer science	en
dc.subject	phylogenetics	en
dc.subject	query	en
dc.subject	algorithms	en
dc.title	Algorithms for Searching and Analyzing Sets of Evolutionary Trees	en
dc.type	Thesis	en
thesis.degree.department	Computer Science and Engineering	en
thesis.degree.discipline	Computer Science and Engineering	en
thesis.degree.grantor	Texas A & M University	en
thesis.degree.name	Doctor of Philosophy	en
thesis.degree.level	Doctoral	en
dc.contributor.committeeMember	Amato, Nancy
dc.contributor.committeeMember	Welch, Jennifer
dc.contributor.committeeMember	Murphy, William
dc.type.material	text	en
dc.date.updated	2015-01-09T20:49:39Z
local.embargo.terms	2016-05-01
local.etdauthor.orcid	0000-0003-0725-791X

Files in this item

Name:: BRAMMER-DISSERTATION-2014.pdf
Size:: 22.11Mb
Format:: PDF

View/ Open

This item appears in the following Collection(s)

Electronic Theses, Dissertations, and Records of Study (2002– )
Texas A&M University Theses, Dissertations, and Records of Study (2002– )

Show simple item record