Revision as of 20:33, 19 December 2004 editL33tminion (talk | contribs)Extended confirmed users, Pending changes reviewers2,124 editsm →Algorithms← Previous edit | Revision as of 18:15, 3 January 2005 edit undo146.203.3.237 (talk) →PackagesNext edit → | ||
Line 30: | Line 30: | ||
|- | |- | ||
| CE/CE-MC | | CE/CE-MC | ||
| '''C'''ombinatorial '''E'''xtension || Cα ||Multi || || I. Shindyalov || 2000 | | '''C'''ombinatorial '''E'''xtension -- '''M'''onte '''C'''arlo || Cα ||Multi || || I. Shindyalov || 2000 | ||
|- | |- | ||
| DaliLite | | DaliLite |
Revision as of 18:15, 3 January 2005
Protein structural alignment is a form of alignment which tries to establish equivalences between two or more protein structures based on their fold. In contrast to simple structural superposition, where at least some equivalent residues of the two structures are known, structural alignment requires no a priori knowledge of equivalent positions. Structural alignment is a valuable tool for the comparison of proteins in the so called "twilight zone" and "midnight zone" of homology (biology), where relationships between proteins can't be detected by sequence alignment methods. The method can therefore be used to establish evolutionary relationships between proteins that share no or nearly no common primary structure. This is especially important in the light of structural genomics and proteomics projects. The result of a structural alignment of two proteins is a superposition of their atomic coordinate sets with a minimal root mean square deviation (RMSD) between the two structures.
Algorithms
Up to now there is no definitive algorithmic solution to protien structural alignment. It could be shown that the alignment problem is NP-hard. All current algorithms employ heuristic methods. Therefore different algorithms may not produce exactly the same results for the same alignment problem.
Representation of structures
Protein structures have to be represented in some coordinate independent space to make them comparable. One possible representation is the so-called distance matrix, which is a two-dimensional matrix containing all pairwise distance between all Cα atoms of the protein backbone. This can also be represented as a set of overlapping sub-matrices spanning only fragments of the protein. Another possible representation is the reduction of the protein structure to the level of secondary structure elements (SSEs), which can be represented as vectors, and can carry additional information about relationships to other SSEs, as well as about certain biophysical properties.
Comparison and Optimization
In the case of distance matrix representation, the comparison algorithm breaks down the distance matrices into regions of overlap, which are then again combined if there is overlap between adjacent fragments, thereby extending the alignment. If the SSE representation is chosen, there are several possibilities. One can search for the maximum ensemble of equivalent SSE pairs using algorithms to solve the maximum clique problem from graph theory. Other approaches employ dynamic programming or combinatorial simulated annealing.
Packages
Several tools for pairwise and multiple structural alignments are available on the web:
NAME | Description | Class | Type | Link | Author | Year |
---|---|---|---|---|---|---|
MAMMOTH | MAtching Molecular Models Obtained from Theory | Cα | Multi | server | AR. Ortiz | 2002 |
CE/CE-MC | Combinatorial Extension -- Monte Carlo | Cα | Multi | server | I. Shindyalov | 2000 |
DaliLite | Distance Matrix Alignment | Contact Map | Multi | server | L. Holm | 1993 |
VAST | Vector Alignment Search Tool | SSE | Pair | server | S. Bryant | 1996 |
PrISM | Protein Informatics Systems for Modeling | SSE | Multi | server | B. Honig | 2000 |
SSAP | Sequential Structure Alignment Program | SSE | Multi | server | C. Orengo | 1989 |
SARF2 | Spatial Arrangements of Backbone Fragments | SSE | Pair | server | D. Fischer | 1996 |
KENOBI/K2 | NA | SSE | Pair | server | Z. Weng | 2000 |
STAMP | STructural Alignment of Multiple Protein | Sequence | Pair | server | G. Barton | 1992 |
MASS | Multiple Alignment by Secondary Structure | SSE | Multi | server | R. Nussinov | 2003 |
MALECON | NA | Geometry | Multi | NA | S. Wodak | 2004 |
MultiProt | NA | Geometry | Multi | server | R. Nussinov | 2004 |
KEY MAP:
* Cα -- Backbone Atom (Cα) Alignment; * SSE -- Secondary Structure Element Alignment; * Pair -- Pairwise Alignment (2 structures *only*); * Multi -- Multiple Structure Alignment (MStA);
See also
References
- Bourne, P.E & Shindyalov, I.N. (2003): Structure Comparison and Alignment. In: Bourne, P.E., Weissig, H. (Eds): Structural Bioinformatics. Hoboken NJ: Wiley-Liss. ISBN 0-471-20200-2
- Olmea O, Straus CE, Ortiz AR. (2002) MAMMOTH (matching molecular models obtained from theory): an automated method for model comparison. Protein Sci 11,2606-21