Rapid similarity search of proteins using alignments of domain arrangements

Terrapon,Nicolas N.,Weiner,January J.,Grath,Sonja S.,Moore,Andrew D. A.D.,Bornberg-Bauer,Erich G. E.G.,

Forschungsartikel (Zeitschrift) | Peer reviewed

Zusammenfassung

Motivation: Homology search methods are dominated by the central paradigm that sequence similarity is a proxy for common ancestry and, by extension, functional similarity. For determining sequence similarity in proteins, most widely used methods use models of sequence evolution and compare amino-acid strings in search for conserved linear stretches. Probabilistic models or sequence profiles capture the position-specific variation in an alignment of homologous sequences and can identify conserved motifs or domains. While profile-based search methods are generally more accurate than simple sequence comparison methods, they tend to be computationally more demanding. In recent years, several methods have emerged that perform protein similarity searches based on domain composition. However, few methods have considered the linear arrangements of domains when conducting similarity searches, despite strong evidence that domain order can harbour considerable functional and evolutionary signal. Results: Here, we introduce an alignment scheme that uses a classical dynamic programming approach to the global alignment of domains.We illustrate that representing proteins as strings of domains (domain arrangements) and comparing these strings globally allows for a both fast and sensitive homology search. Further, we demonstrate that the presented methods complement existing methods by finding similar proteins missed by popular amino-acid-based comparison methods. Availability: An implementation of the presented algorithms, a web-based interface as well as a command-line program for batch searching against the UniProt database can be found at http://rads. uni-muenster.de. Furthermore, we provide a JAVA API for programmatic access to domain-string-based search methods. © 2013 The Author.

Details zur Publikation

FachzeitschriftBioinformatics
Jahrgang / Bandnr. / Volume30
Ausgabe / Heftnr. / Issue2
Seitenbereich281null
StatusVeröffentlicht
Veröffentlichungsjahr2014 (01.01.2014)
Sprache, in der die Publikation verfasst istEnglisch
DOI10.1093/bioinformatics/btt379

Autor*innen der Universität Münster

Bornberg-Bauer, Erich
Arbeitsgruppe Bioinformatik (Prof. Bornberg-Bauer)
Grath, Sonja
Institut für Evolution und Biodiversität (IEB)
Moore, Andrew
Arbeitsgruppe Bioinformatik (Prof. Bornberg-Bauer)
Terrapon, Nicolas
Institut für Evolution und Biodiversität (IEB)