SIMAP - the database of all-against-all protein sequence similarities and annotations with new interfaces and increased coverage.
Nucleic Acids Res. 42, D279-D284 (2014)
The Similarity Matrix of Proteins (SIMAP, http://mips.gsf.de/simap/) database has been designed to massively accelerate computationally expensive protein sequence analysis tasks in bioinformatics. It provides pre-calculated sequence similarities interconnecting the entire known protein sequence universe, complemented by pre-calculated protein features and domains, similarity clusters and functional annotations. SIMAP covers all major public protein databases as well as many consistently re-annotated metagenomes from different repositories. As of September 2013, SIMAP contains >163 million proteins corresponding to similar to 70 million non-redundant sequences. SIMAP uses the sensitive FASTA search heuristics, the Smith-Waterman alignment algorithm, the InterPro database of protein domain models and the BLAST2GO functional annotation algorithm. SIMAP assists biologists by facilitating the interactive exploration of the protein sequence universe. Web-Service and DAS interfaces allow connecting SIMAP with any other bioinformatic tool and resource. All-against-all protein sequence similarity matrices of project-specific protein collections are generated on request. Recent improvements allow SIMAP to cover the rapidly growing sequenced protein sequence universe. New Web-Service interfaces enhance the connectivity of SIMAP. Novel tools for interactive extraction of protein similarity networks have been added. Open access to SIMAP is provided through the web portal; the portal also contains instructions and links for software access and flat file downloads.
Impact Factor
Scopus SNIP
Web of Science
Times Cited
Scopus
Cited By
Altmetric
Publication type
Article: Journal article
Document type
Scientific Article
Thesis type
Editors
Keywords
Substitution Matrices; Visualization; Networks; Construction; Generation; Clusters; Genomes; Family; Tool
Keywords plus
Language
english
Publication Year
2014
Prepublished in Year
HGF-reported in Year
2014
ISSN (print) / ISBN
0305-1048
e-ISSN
1362-4962
ISBN
Book Volume Title
Conference Title
Conference Date
Conference Location
Proceedings Title
Quellenangaben
Volume: 42,
Issue: D1,
Pages: D279-D284
Article Number: ,
Supplement: ,
Series
Publisher
Oxford University Press
Publishing Place
Oxford
Day of Oral Examination
0000-00-00
Advisor
Referee
Examiner
Topic
University
University place
Faculty
Publication date
0000-00-00
Application date
0000-00-00
Patent owner
Further owners
Application country
Patent priority
Reviewing status
Peer reviewed
POF-Topic(s)
30505 - New Technologies for Biomedical Discoveries
Research field(s)
Enabling and Novel Technologies
PSP Element(s)
G-503700-001
Grants
Copyright
Erfassungsdatum
2014-03-24