TY - JOUR
T1 - DALIX: Optimal DALI protein structure alignment
AU - Wohlers, Inken
AU - Andonov, Rumen
AU - Klau, Gunnar W.
PY - 2013
Y1 - 2013
N2 - We present a mathematical model and exact algorithm for optimally aligning protein structures using the dali scoring model. This scoring model is based on comparing the interresidue distance matrices of proteins and is used in the popular dali software tool, a heuristic method for protein structure alignment. Our model and algorithm extend an integer linear programming approach that has been previously applied for the related, but simpler, contact map overlap problem. To this end, we introduce a novel type of constraint that handles negative score values and relax it in a Lagrangian fashion. The new algorithm, which we call dalix, is applicable to any distance matrix-based scoring scheme. We also review options that allow to consider fewer pairs of interresidue distances explicitly because their large number hinders the optimization process. Using four known data sets of varying structural similarity, we compute many provably score-optimal dali alignments. This allowed, for the first time, to evaluate the dali heuristic in sound mathematical terms. The results indicate that dali usually computes optimal or close to optimal alignments. However, we detect a subset of small proteins for which dali fails to generate any significant alignment, although such alignments do exist.
AB - We present a mathematical model and exact algorithm for optimally aligning protein structures using the dali scoring model. This scoring model is based on comparing the interresidue distance matrices of proteins and is used in the popular dali software tool, a heuristic method for protein structure alignment. Our model and algorithm extend an integer linear programming approach that has been previously applied for the related, but simpler, contact map overlap problem. To this end, we introduce a novel type of constraint that handles negative score values and relax it in a Lagrangian fashion. The new algorithm, which we call dalix, is applicable to any distance matrix-based scoring scheme. We also review options that allow to consider fewer pairs of interresidue distances explicitly because their large number hinders the optimization process. Using four known data sets of varying structural similarity, we compute many provably score-optimal dali alignments. This allowed, for the first time, to evaluate the dali heuristic in sound mathematical terms. The results indicate that dali usually computes optimal or close to optimal alignments. However, we detect a subset of small proteins for which dali fails to generate any significant alignment, although such alignments do exist.
UR - http://www.scopus.com/inward/record.url?scp=84878307737&partnerID=8YFLogxK
U2 - 10.1109/TCBB.2012.143
DO - 10.1109/TCBB.2012.143
M3 - Journal articles
C2 - 23702541
AN - SCOPUS:84878307737
SN - 1545-5963
VL - 10
SP - 26
EP - 36
JO - IEEE/ACM Transactions on Computational Biology and Bioinformatics
JF - IEEE/ACM Transactions on Computational Biology and Bioinformatics
IS - 1
M1 - 6365174
ER -