Bioinformatics or computational biology is the use of techniques from applied mathematics, informatics, statistics, and computer science to solve biological problems.
Research in computational biology often overlaps with systems biology. Major research efforts in the field include sequence alignment, gene finding, genome assembly, protein structure alignment, protein structure prediction, prediction of gene expression and protein-protein interactions, and the modelling of evolution. The terms bioinformatics and computational biology are often used interchangeably, although the latter typically focuses on algorithm development and specific computational methods. (In the biology-mathematics-computer science triad, bioinformatics will intimately involve all three components while computational biology will focus on biology and mathematics.) A common thread in projects in bioinformatics and computational biology is the use of mathematical tools to extract useful information from noisy data produced by high-throughput biological techniques. (The field of data mining overlaps with computational biology in this regard.) Representative problems in computational biology include the assembly of high-quality DNA sequences from fragmentary "shotgun" DNA sequencing, and the prediction of gene regulation with data from mRNA microarrays or mass spectrometry.
Most of the scientific research I have done since May of 2000 could be described as bioinformatics or computational biology. For more information on this research, please see the following:
Projects / Servers
Below are links to external websites where I was the primary researcher. These are mainly Servers that I have setup:
- FastContact Server: a free energy scoring tool for protein-protein complex structures.
- Version 1.0: Programmer, Server architect, and administrator; July 2005–December 2006.
- Version 2.0: Programmer, Server architect, and administrator; January 2007–present.
- CBS Fungal Database: DNA structural atlases for complete chromosomes and genomes.
- Programmer, Server architect, and administrator; November 2005-December 2005.
- SmoothDock Server (currently under development):
- Programmer, Server architect, and administrator; January 2005–present (note: This server uses code optimised and run in parallel on 256 processors).
- LooseLoops Server (currently under development and construction):
- Programmer, Server architect, and administrator; November 2004–present.
- DomainSplit Server: predicts the number of domains in a given protein (PDB format)
- Server architect and administrator; September 2004–present.
- Programmer; May 2000-September 2004 (note: front end by Sandor Maurice; back end by Sandor Maurice and P. Christoph Champ.)
Bioinformatics in protein analysis
- Genome Sequencing and Databases: Genome Sequencing, Phred, Phrap, Consed, Sequin
- Database Searching for Similar Sequences: FASTA, BLAST, SSEARCH, Dotter, Align, Clustal, Meme, Fasta, Ssearch, Blast
- Phylogenetic Prediction: CN3D, MMDB, VAST, PREDATOR, Phylip
- Gene Analysis: HMMER, MAST, PSI-BLAST - Sim4, Genscan, GrailExp
- Persson B (2000). "Bioinformatics in protein analysis". EXS 88:215-31. PMID: 10803381.