RESEARCH ARTICLE
Fast Multiple Alignment of Protein Structures Using Conformational Letter Blocks
Wang Sheng, Zheng Wei-Mou*
Article Information
Identifiers and Pagination:
Year: 2009Volume: 3
First Page: 69
Last Page: 83
Publisher ID: TOBIOIJ-3-69
DOI: 10.2174/1875036200903010069
Article History:
Received Date: 09/09/2009Revision Received Date: 09/10/2009
Acceptance Date: 09/10/2009
Electronic publication date: 19/11/2009
Collection year: 2009
open-access license: This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International Public License (CC-BY 4.0), a copy of which is available at: https://creativecommons.org/licenses/by/4.0/legalcode. This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Abstract
Most approaches for protein structure alignment start from a search for similar fragments since this local similarity is necessary to the alignment even though is insufficient. In contrary to the sequence alignment, any insignificant trial alignment for structures can be detected by structure superposition and then excluded. It is then practicable to select from locally similar fragments those responsible for alignment and build up it. An efficient way for local similarity search is to use a conformational alphabet, which is a discretized description of protein chain local geometry. Using our conformational alphabet and its substitution matrix CLESUM, we propose a tool called BLOMAPS for fast multiple structure alignment.
By means of the conformational alphabet, a structural fragment is mapped to a string, and two strings with their CLESUM score being higher than a preset threshold form a similar fragment pair (SFP). A string from one protein as a seed and its highly similar fragments from other proteins form a similar fragment block. Taking one protein as the pivot, BLOMAPS uses the rigid transformation for SFPs in a block to superimpose proteins and initiate an anchor-based alignment. BLOMAPS is greedy in nature, guided by CLESUM similarity scores. It consists of several steps including finding similar fragment blocks based on a pivot protein, removing block redundancy, constructing scaffold by checking consistency in spatial arrangement among fragments from different blocks, dealing with unanchored structures, and the final step of refinement where the average template for alignment is obtained and motifs missing from the pivot protein are found and added. The utility of BLOMAPS is tested on various protein structure ensembles including large scale ones, and compared with several other tools including MATT.
BLOMAPS is available at: www.itp.ac.cn/zheng/blomaps.rar