T-Coffee

From Christoph's Personal Wiki
Jump to: navigation, search

T-Coffee (Tree-based Consistency Objective Function For alignment Evaluation) is a multiple sequence alignment software using a progressive approach. It generates a library of pairwise alignments to guide the multiple sequence alignment. It can also combine multiple sequences alignments obtained previously and in the latest versions can use structural information from PDB files (3D-Coffee). It has advanced features to evaluate the quality of the alignments and some capacity of identifying occurrence of motifs (Mocca). It produces alignment in the aln format (Clustal) by default, but can also produce PIR, MSF and FASTA format. The input files are always in FASTA format. Note that what T-Coffee terms "Clustal" format is sufficiently different from the output of ClustalW/X that many programs supporting Clustal format cannot read it; fortunately ClustalX can import T-Coffee output so the simplest fix for this issue is usually to import T-Coffee's output into ClustalX and then re-export.

Note: The latest version is 6.92 (2008-09-12)

Installation

  • Requirements: gcc, g77, makefile, CPAN
tar -zxvf t_coffee.tar.gz
cd T-COFFEE_distribution_Version_X.XX
./install

This installation procedure is semi-interactive. It will prompt questions here and there. You can interrupt it any time and resume it later.

The install procedure carries out three distinct tasks:

  1. compilation of T-Coffee (C program);
  2. compilation and installation of SOAP::Lite (Perl module); and
  3. download/compilation and installation of all the T-Coffee companion packages required for all possible T-Coffee flavors (tcoffee, expresso, 3dcoffee, mcoffee, rcoffee).

Except for T-Coffee, the installer will only install the packages that are NOT already on your computer. If you want a lighter or more specific installation, you can try any of the following:

./install tcoffee
./install rcoffee
./install expresso
./install 3dcoffee
./install 3dcoffee

While installing SOAP::Lite, CPAN will ask you many questions: say "Yes" to all or type return to keep the default values. If everything went well, the procedure has created in the bin directory two executables: t_coffee and TMalign (make sure these executables are on your $PATH).

If you have not managed to install SOAP::Lite, you can re-install it anytime (from anywhere) using steps 1-2

If you cannot log as root, or if for some reason this procedure does not work, see with your system manager and/or go directly to the CPAN repository of SOAP::Lite. You will still be able to use the most basic functions of T-Coffee.

IMPORTANT: The purpose of SOAP::Lite is to allow T-Coffee the use of the EBI webservices such as webblast. BLAST brings many functionalities to T-Coffee and if you cannot install SOAP we suggest you go to the "Installing BLAST for T-Coffee" section of the Technical Doumentation (in the "Installation" section). There you will find alternative ways of using BLAST without SOAP. It is also in this document that you will find all the information required for a full installation of T-Coffee.

Usage

t_coffee foo.fa

the default output is:

  • foo.aln (the multiple sequence alignment); and
  • foo.dnd (the guide tree in Newick format)

Residue types

  • charged: KRDE
  • polar: NQST
  • aliphatic: ILMV
  • aromatic: FYW
  • others: APCGH

See also

  • Clustal
  • MUSCLE
  • Bioinfo tutorial — by Wikiomics.org
  • LiSA — a software platform for structural alignment algorithms. (including: "T-Lara: RNA multiple Structural Alignment")
  • MARNA — a server for Multiple Alignment of RNAs.
  • SeaView — a graphical multiple sequence alignment editor. Able to read and write various alignment formats (NEXUS, MSF, CLUSTAL, FASTA, PHYLIP, MASE). It allows to manually edit the alignment, and also to run DOT-PLOT or CLUSTALW/MUSCLE programs to locally improve the alignment.
  • M-Coffee — computes a multiple sequence alignment and the associated phylogenetic tree by combining the output of several multiple sequence alignment packages (PCMA, Poa, Mafft, Muscle, T-Coffee, ClustalW, ProbCons, DialignT).

References

External links