Here are 376 phylogeny packages and 50 free
servers, all that I know about.
It is an attempt to be completely comprehensive. I have not made any attempt to
exclude programs that do not meet some standard of quality or importance.
Updates to these pages are made roughly weekly.
Here is a "waiting list" of new programs waiting to have
their full entries constructed.
Many of the programs in these pages are available on the web, and some of
the older ones are also available from ftp server machines.
The programs listed below include both free and non-free
ones; in some cases I do not know whether a program is free. I have listed as
free those that I knew were free; for the others you have to ask their
distributor. Usually when I say that a program is downloadable from a
web site, this means that it is available free.
Email addresses in these pages have had the @ symbol replaced by
(at) and also surrounded by invisible confusing tags and blank
characters in hopes of foiling spambots that harvest email addresses.
If you discover any inaccuracies, or feel that I have left any important
programs or facts out, or if links do not work properly, please
e-mail me at: (joe (at) gs.washington.edu). You can also use the submission form here to submit new entries.
|
Owing to past NSF support of these pages, I am required to note that
any opinions, findings, and conclusions or recommendations expressed in this
material are those of the author and do not necessarily reflect the views of
the National Science Foundation (NSF supported these pages from 1995-2003).
|
|
List of packages arranged ... |
|
Table of contents by methods available |
General-purpose packages
Parsimony programs
Distance matrix methods
Computation of distances
Maximum likelihood and Bayesian methods
Quartets methods
Artificial-intelligence and genetic algorithms methods
Invariants (or Evolutionary Parsimony) methods
Interactive tree manipulation
Looking for hybridization or recombination events
Bootstrapping and other measures of support
Compatibility analysis
Consensus trees, subtrees, supertrees, and distances between trees
Tree-based sequence alignment
Gene duplication and genomic analysis
Biogeographic analysis and host-parasite comparison
Comparative method analysis
Simulation of trees or data
Examination of shapes of trees
Clocks, dating and stratigraphy
Model Selection
Description or prediction of data from trees
Tree plotting/drawing
Sequence management/job submission
Teaching about phylogenies
Table of contents by computer systems
on which they work |
- Unix (source code in C or executables).
I have included programs that
are available as C source code because most Unix workstations have a C
compiler. (Programs in other compiled languages such as FORTRAN and Pascal,
and in interpreted languages such as Java, Perl, Python, or R are also
included), as are Java executables.
For many of these the programs can also be compiled or run on Windows or
Mac OS X systems if they have the appropriate compilers or interpreters loaded.
- PC's
- as Windows executables (not
counting executing in a "DOS box").
Programs available as source code which is Windows-specific are listed here.
Java executables are also included. (Note that compilers available on Windows
systems, particularly the free Cygwin and MinGW compilers, can also be used to
compile many of the programs listed above under Unix generic source code).
- under DOS (MSDOS, PCDOS) or in a Command Tool "DOS box" in Windows
- Macintosh Mac OS or Mac OS X
executables. Programs available in source code that is
Mac OS X specific are included. Java executables are also included.
(Note that most of the programs in the Unix/Linux list above can also
be compiled for Mac OS X using the GCC compiler that is distributed with
Mac OS X).
|
Analyzing particular types of data |
Here you will find lists of programs that analyze types of data other than
molecular sequence data. We will gradually expand this list of data types.
- Microsatellite data
-
- RAPDs, RFLPs, or AFLPs
-
- Continuous quantitative characters
-
- Gene frequencies (aside from microsatellite loci)
-
(under construction: more coming soon)
Here are the packages that have most recently been added to these listings: (the most recent ones first). Entries are retained in this list
for about 6 months. Note also below the "waiting list" area
listing programs that are to be added.
You can use the submission form here to submit new entries.
- MULTIDIVTIME HELPER
(26 April 2008) prepares constraint trees and folders for
Multidivtime
- GenoDive (25 April 2008)
computes genetic diversity measures including standardized distances.
- Codeml3X (19 April 2008) script
to run CODEML from PAML three times with
different values of ω
- NEPAL (19 April 2008) infers
networks by parsimony or likelihood.
- PHYLLAB (12 April 2008) A MATLAB toolbox for MCMC
Bayesian inference of phylogenies from sequences.
- Bioinformatics_Toolbox (12 April 2008)
A sequence analysis MATLAB toolbox which can make distance-based trees.
- DTdraw (5 April 2008) draws gene
duplication history trees.
- GAME (5 April 2008) estimates the
α shape parameter using distances and also infers the tree.
- Concaterpillar (29 March
2008)
performs a hierarchical likelihood ratio test for phylogenetic congruence.
- EvolSimulator (29 March
2008) simulates sequence evolution with genomic events and lateral gene transfer as well.
- PhyloNet (23 March 2008) infers networks by a
parsimony method allowing recombinations.
- TreeSnatcher (22 March 2008)
captures trees from graphic images.
- TreeToy (15 March 2008) displays
and
reroots trees, and selects subtrees.
- bootscore (9 March 2008) computes majority rule
consensus tree and puts bootstrap scores on it.
- gtp (9 March 2008) gene tree parsimony program.
- RPT (1 March 2008) makes circular plots of a tree
that show the values of characters graphically.
- EvolveAGene3 (23 February
2008) simulates evolution of protein sequences with gaps
- AxParafit (16 February 2008)
Parallel and optimized version of the ParaFit
program for host-parasite analyses.
- FAMD (15 February 2008) Distance and distance matrix
programs for RAPD fingerprinting with consensus and bootstraps.
- rRNA phylogeny (9 February 2008)
phylogenies from compensating substitutions for paired and unpaired rRNA sites.
- SLR (9 February 2008) sitewise
analysis of nonsynonymous / synonymous rates of substitution.
- NHML (2 February 2008) Infers maximum
likelihood phylogenies using a nonhomogenous model of nucleotide change.
- Leaphy (2 February 2008) infers
maximum likelihood phylogenies by its own search methods.
- HGT (26 January 2008) estimates the
rate of horizontal gene transfer.
- GRate (26 January 2008) does
relative rate tests comparing prespecified groups using PAUP*
- SeqState (19 January 2008)
recodes indels in various ways.
- PRAP (19 January 2008) implements
ratchet searches and Bremer support calculations in
PAUP*
- CTree (12 January 2008)
interactively views clusters according to measures of their diversity.
- AMBIORE (12 January 2008)
estimates rates of change of bases dependent on the bases at the two neighboring sites.
- MUSCLE (11 December 2007) multiple
sequence alignment using trees.
- PISE (3 December 2007) generates web interfaces from
XML for molecular evolution programs.
- PhyloSort (3 December 2007) finds monophyletic
groups with particular subsets of species in trees.
- PSODA (24 November 2007) parsimony
program for nucleotide sequences.
- LEVEL2 (24 November 2007)
constructs networks with limited degrees of looping from triplets.
- DAWG (17 November 2007) simulates
evolution of DNA sequences with recombination and gaps.
- GRIMM server (17 November
2007) computes
distances or phylogenies that minimize chromosome rearrangements.
- PhyloCoCo (11 November 2007)
infers model parameters and uses special search methods to find the maximum
likelihood tree.
- EEEP (11 November 2007) reconstructs
horizontal gene transfer events from a gene tree and a species tree.
- PROCOV (4 November 2007) does
maximum likelihood inference for proteins under covarion models.
- DART (4 November 2007) a package with
methods for RNA structure and sequence alignment.
- DTscore (27 October 2007) a
distance-based program to reconstruct phylogenies of tandem duplication copies.
- MBEToolbox (27 October 2007)
a MATLAB toolbox to analyze DNA and protein sequences.
|
Here are the packages whose entries have most recently been changed:
The date on which each change was entered is shown. Entries are retained in
this list for about 6 months.
(Note that changes may be as small as updated version numbers or a
modified web address). The most recent
changes are first.
|
Other lists of phylogeny software |
- There is one phylogeny software list even more complete and up-to-date
than this one: a more recent version of this list. If you are
reading this on the web pages at our server
evolution.gs.washington.edu, you are reading the most up-to-date
version. But if you are reading a version stored anywhere else, you might
want to
look here instead.
- The Institut Pasteur in Paris has the
Bio NetBook,
a search facility for biocomputing web pages, which include web pages of
other lists, web pages of programs or servers, and some pages that simply no
longer exist. It is located at
http://www.pasteur.fr/recherche/BNB/bnb-en.html. Programs for
phylogenies can be found by, for example, selecting software from the
Resource Type list and evolution from the Biological Domain list
without selecting any Organism. Other categories worth looking at include
Online Analysis Tools, Sequence Analysis and
Bio-computing.
- Sergios-Orestis Kolokotronis has posted
an extensive table
of phylogeny programs at his site at the American Museum of Natural History,
and near it are others under headings such as "molecular evolution" and
"alignment".
- Classification and clustering programs are described in
a useful Web page
maintained by Fionn Murtagh at
http://astro.u-strasbg.fr/~fmurtagh/mda-sw/online-sw.html.
Note, however, that inferring phylogenies and making clusters are different
tasks; the software described on that list will be of most use to people who
are trying to cluster or classify but not as much for inferring phylogenies.
- The Digital Taxonomy web site, maintained by Mauro José Cavalcanti
in Rio de Janeiro, Brazil, lists phylogeny programs (and programs in other
areas of systematics and morphometrics), with attention restricted to those
that are free and whose source code is publicly available.
- Genamics, a company located in Hamilton, New Zealand, maintains the
SoftwareSeek searchable index of bioinformatics software
at
http://genamics.com/software/index.htm
in a number of categories. One of them is Phylogenetic Analysis. They have
a reasonably large number
of entries under that heading, though it also includes some statistical
genetics software that is really not phylogenetic. Their listing has links
to the web sites of the software; for those programs that are not available
by Web they maintain copies for download at their server.
- Georg Fuellen at the University of Bielefeld, Germany, has a
very good page on
Multiple Alignment Resources at
http://www.techfak.uni-bielefeld.de/bcd/Curric/MulAli/welcome.html.
- David Robertson of the Bioinformatics Education and Research at the
University of Manchester, England, maintains a very informative web site at
listing programs and their web sites that test for the presence of
recombination or hybridization events in DNA sequence data. It lists some
programs that are covered here, and others that are outside the scope of
these web pages. That site is located at
http://bioinf.man.ac.uk/~robertson/recombination/.
- Mike Robeson at the University of Colorado maintains a page with
multiple programs listed as Bioinformatics
software for Mac OS X.
- The Bioinformatics Organization, a nonprofit group in Hudson, Massachusetts,
has posted the bioinformatics.org
web pages. These offer a free membership and host open source software projects
in bioinformatics. They also have a Molecular Linux listing of Linux programs to carry out bioinformatics
tasks.
- Don Gilbert, of the Department of Biology of the University of
Indiana, has a good web page on
Free Software in Molecular Biology for Macintosh and MS Windows computers
as of 1998 at http://iubio.bio.indiana.edu/soft/molbio/Listings.html.
It lists some popular packages and all packages and programs kept
at the IUBio ftp server at that time (see
our description of that server). Unfortunately
the web links on that page are not active so the addresses must be
retyped by hand.
- Andrea Ramge, of Biomax Informatics AG, Martinsried, Germany
has created the
bioinformatik.de
index of resources. It includes a
list of software located at
http://www.bioinformatik.de/cgi-bin/browse/Catalog/Software.
The phylogeny programs listings there are located within the categories
for different operating systems.
- Biodirectory.com
maintains a BioWiki site, with content edited by users. It requires a free
login and is supported by advertising. It has lists of phylogeny and alignment
programs.
- Silvio Nihei at the University of São Paulo in Brazil has produced
a list: Programas para Filogenia (Morfológica) e Biogeografi in
Portugese. It concentrates on a small number of programs that mostly
use parsimony methods.
|
New programs waiting to be added |
This is a "waiting list" showing links to the web pages of many new
phylogeny programs, which I have not yet had time to add to the main listing.
They will be listed there, with a single web link and
no detailed explanation. I hope that this list will gradually shrink as the new
programs are put into the main listing.
You can use the submission form here to submit new entries.
These are waiting to be added:
- Tree Tracker detects
overrepresented clusters in trees.
-
Ultrametric Check calculates root-to-tip distances for all species in a
tree.
- analysis
molecular population genetics package can compute
Kimura 2-parameter distances.
- CIPRES Web
Portal server runs parsimony, likelihood and Bayesian programs.
- CodeAxe estimates
neighbor-dependent mutation rates in DNA sequences.