Difference between revisions of "UPGMA"

From Christoph's Personal Wiki
Jump to: navigation, search
(Started article)
Line 6: Line 6:
 
* [[Phylogenetic trees]]
 
* [[Phylogenetic trees]]
  
 +
{{Phylogenetics}}
 
[[Category:Academic Research]]
 
[[Category:Academic Research]]
 
[[Category:Phylogenetics]]
 
[[Category:Phylogenetics]]

Revision as of 04:48, 13 September 2006

UPGMA (Unweighted Pair-Group Method using Arithmetic Averages) is a simple bottom-up data clustering method used in bioinformatics for the creation of phylogenetic trees. The input data is a collection of objects with their pairwise distances and the output is a rooted tree (dendrogram). It is sometimes used for creating rooted phylogenetic trees under the assumption of a constant evolutionary rate. Initially, each object is in its own cluster. At each step, the nearest two clusters are combined into a higher-level cluster. The distance between any two clusters A and B is taken to be the average of all distances between pairs of objects a in A and b in B. UPGMA is not a well-regarded method for inferring phylogenetic trees unless the constant-rate assumption (molecular clock hypothesis) has been tested and justified for the data set being used.

UPGMA involves clustering of closely distant species. At each stage of clustering, tree branches are being built, and the branch lengths are calculated. UPGMA assumes a constant evolutionary rate, and so the two species in a cluster are given the same branch length from the node. It is a simple and fast method; however, because of the assumption, it often produces incorrect topologies when the assumption is not met.

See also

Topics in phylogenetics
Relevant fields: phylogenetics | computational phylogenetics | molecular phylogeny | cladistics
Basic concepts: synapomorphy | phylogenetic tree | phylogenetic network | long branch attraction
Phylogeny inference methods: maximum parsimony | maximum likelihood | neighbour joining | UPGMA