Treffer: CombAlign: a code for generating a one-to-many sequence alignment from a set of pairwise structure-based sequence alignments.
Nucleic Acids Res. 2015 Jul 1;43(W1):W174-81. (PMID: 25883148)
Future Virol. 2015 May;10(5):537-546. (PMID: 26120351)
PLoS Pathog. 2015 Apr 30;11(4):e1004731. (PMID: 25928849)
J Virol. 2003 Feb;77(3):1812-9. (PMID: 12525615)
EMBO J. 2000 Dec 15;19(24):6732-41. (PMID: 11118208)
J Virol. 2011 Sep;85(17):8502-13. (PMID: 21697477)
Future Virol. 2009;4(6):621-635. (PMID: 20198110)
Nucleic Acids Res. 2005 Apr 22;33(7):2302-9. (PMID: 15849316)
Adv Protein Chem Struct Biol. 2014;94:121-75. (PMID: 24629187)
BMC Biol. 2007 May 08;5:17. (PMID: 17488521)
Bioinformatics. 2000 Jun;16(6):566-7. (PMID: 10980157)
Infect Dis Poverty. 2014 Nov 28;3:43. (PMID: 25699183)
Curr Opin Struct Biol. 2006 Apr;16(2):172-7. (PMID: 16510277)
EMBO J. 2000 Aug 15;19(16):4228-36. (PMID: 10944105)
Bioinformatics. 2005 Jul 15;21(14):3089-96. (PMID: 15905278)
PLoS One. 2012;7(7):e39978. (PMID: 22792204)
Weitere Informationen
Background: In order to better define regions of similarity among related protein structures, it is useful to identify the residue-residue correspondences among proteins. Few codes exist for constructing a one-to-many multiple sequence alignment derived from a set of structure or sequence alignments, and a need was evident for creating such a tool for combining pairwise structure alignments that would allow for insertion of gaps in the reference structure.
Results: This report describes a new Python code, CombAlign, which takes as input a set of pairwise sequence alignments (which may be structure based) and generates a one-to-many, gapped, multiple structure- or sequence-based sequence alignment (MSSA). The use and utility of CombAlign was demonstrated by generating gapped MSSAs using sets of pairwise structure-based sequence alignments between structure models of the matrix protein (VP40) and pre-small/secreted glycoprotein (sGP) of Reston Ebolavirus and the corresponding proteins of several other filoviruses. The gapped MSSAs revealed structure-based residue-residue correspondences, which enabled identification of structurally similar versus differing regions in the Reston proteins compared to each of the other corresponding proteins.
Conclusions: CombAlign is a new Python code that generates a one-to-many, gapped, multiple structure- or sequence-based sequence alignment (MSSA) given a set of pairwise sequence alignments (which may be structure based). CombAlign has utility in assisting the user in distinguishing structurally conserved versus divergent regions on a reference protein structure relative to other closely related proteins. CombAlign was developed in Python 2.6, and the source code is available for download from the GitHub code repository.