Rapid similarity search of proteins using alignments of domain arrangements

Terrapon,Nicolas N.,Weiner,January J.,Grath,Sonja S.,Moore,Andrew D. A.D.,Bornberg-Bauer,Erich G. E.G.,

Research article (journal) | Peer reviewed

Abstract

Motivation: Homology search methods are dominated by the central paradigm that sequence similarity is a proxy for common ancestry and, by extension, functional similarity. For determining sequence similarity in proteins, most widely used methods use models of sequence evolution and compare amino-acid strings in search for conserved linear stretches. Probabilistic models or sequence profiles capture the position-specific variation in an alignment of homologous sequences and can identify conserved motifs or domains. While profile-based search methods are generally more accurate than simple sequence comparison methods, they tend to be computationally more demanding. In recent years, several methods have emerged that perform protein similarity searches based on domain composition. However, few methods have considered the linear arrangements of domains when conducting similarity searches, despite strong evidence that domain order can harbour considerable functional and evolutionary signal. Results: Here, we introduce an alignment scheme that uses a classical dynamic programming approach to the global alignment of domains.We illustrate that representing proteins as strings of domains (domain arrangements) and comparing these strings globally allows for a both fast and sensitive homology search. Further, we demonstrate that the presented methods complement existing methods by finding similar proteins missed by popular amino-acid-based comparison methods. Availability: An implementation of the presented algorithms, a web-based interface as well as a command-line program for batch searching against the UniProt database can be found at http://rads. uni-muenster.de. Furthermore, we provide a JAVA API for programmatic access to domain-string-based search methods. © 2013 The Author.

Details about the publication

JournalBioinformatics
Volume30
Issue2
Page range281null
StatusPublished
Release year2014 (01/01/2014)
Language in which the publication is writtenEnglish
DOI10.1093/bioinformatics/btt379

Authors from the University of Münster

Bornberg-Bauer, Erich
Research Group Evolutionary Bioinformatics
Grath, Sonja
Institute for Evolution and Biodiversity (IEB)
Moore, Andrew
Research Group Evolutionary Bioinformatics
Terrapon, Nicolas
Institute for Evolution and Biodiversity (IEB)