PuSH - Publication Server of Helmholtz Zentrum München

Tetko, I.V. ; Engkvist, O.* ; Koch, U.* ; Reymond, J.L.* ; Chen, H.*

BIGCHEM: Challenges and opportunities for big data analysis in chemistry.

Mol. Inform. 35, 615-621 (2016)
Postprint DOI PMC
Open Access Green
Creative Commons Lizenzvertrag
The increasing volume of biomedical data in chemistry and life sciences requires the development of new methods and approaches for their handling. Here, we briefly discuss some challenges and opportunities of this fast growing area of research with a focus on those to be addressed within the BIGCHEM project. The article starts with a brief description of some available resources for "Big Data" in chemistry and a discussion of the importance of data quality. We then discuss challenges with visualization of millions of compounds by combining chemical and biological data, the expectations from mining the "Big Data" using advanced machine-learning methods, and their applications in polypharmacology prediction and target de-convolution in phenotypic screening. We show that the efficient exploration of billions of molecules requires the development of smart strategies. We also address the issue of secure information sharing without disclosing chemical structures, which is critical to enable bi-party or multi-party data sharing. Data sharing is important in the context of the recent trend of "open innovation" in pharmaceutical industry, which has led to not only more information sharing among academics and pharma industries but also the so-called "precompetitive" collaboration between pharma companies. At the end we highlight the importance of education in "Big Data" for further progress of this area.
Impact Factor
Scopus SNIP
Web of Science
Times Cited
Scopus
Cited By
Altmetric
1.570
0.596
60
67
Tags
Annotations
Special Publikation
Hide on homepage

Edit extra information
Edit own tags
Private
Edit own annotation
Private
Hide on publication lists
on hompage
Mark as special
publikation
Publication type Article: Journal article
Document type Scientific Article
Keywords Nonlinear Dimensionality Reduction; Interference Compounds Pains; Molecule Frequent Hitters; Drug Discovery; Chemical Space; Assay Interference; Applicability Domain; Compound Libraries; Bioassay Ontology; Natural-products
Language english
Publication Year 2016
HGF-reported in Year 2016
ISSN (print) / ISBN 1868-1743
e-ISSN 1868-1751
Quellenangaben Volume: 35, Issue: 11-12, Pages: 615-621 Article Number: , Supplement: ,
Publisher Wiley
Publishing Place Weinheim
Reviewing status Peer reviewed
POF-Topic(s) 30203 - Molecular Targets and Therapies
Research field(s) Enabling and Novel Technologies
PSP Element(s) G-503000-003
PubMed ID 27464907
Erfassungsdatum 2016-09-06