Open Access Green as soon as Postprint is submitted to ZB.
Nonlinear dimensionality reduction for visualizing toxicity data: Distance-based versus topology-based approaches.
ChemMedChem 9, 1047-1059 (2014)
Over the years, a number of dimensionality reduction techniques have been proposed and used in chemoinformatics to perform nonlinear mappings. In this study, four representatives of nonlinear dimensionality reduction methods related to two different families were analyzed: distance-based approaches (Isomap and Diffusion Maps) and topology-based approaches (Generative Topographic Mapping (GTM) and Laplacian Eigenmaps). The considered methods were applied for the visualization of three toxicity datasets by using four sets of descriptors. Two methods, GTM and Diffusion Maps, were identified as the best approaches, which thus made it impossible to prioritize a single family of the considered dimensionality reduction methods. The intrinsic dimensionality assessment of data was performed by using the Maximum Likelihood Estimation. It was observed that descriptor sets with a higher intrinsic dimensionality contributed maps of lower quality. A new statistical coefficient, which combines two previously known ones, was proposed to automatically rank the maps. Instead of relying on one of the best methods, we propose to automatically generate maps with different parameter values for different descriptor sets. By following this procedure, the maps with the highest values of the introduced statistical coefficient can be automatically selected and used as a starting point for visual inspection by the user.
Impact Factor
Scopus SNIP
Web of Science
Times Cited
Times Cited
Scopus
Cited By
Cited By
Altmetric
3.046
0.844
7
7
Annotations
Special Publikation
Hide on homepage
Publication type
Article: Journal article
Document type
Scientific Article
Keywords
Chemography ; Chemoinformatics ; Dimensionality Reduction ; Drug Design ; Topographic Mapping; Drug-induced Phospholipidosis; Structure-activity Landscapes; Classification Models; Compound Libraries; Activity-cliffs; Chemical Space; Discovery; Identification; Descriptors; Performance
Language
english
Publication Year
2014
HGF-reported in Year
2014
ISSN (print) / ISBN
1860-7179
e-ISSN
1860-7187
Journal
ChemMedChem
Quellenangaben
Volume: 9,
Issue: 5,
Pages: 1047-1059
Publisher
Wiley
Publishing Place
Weinheim
Reviewing status
Peer reviewed
Institute(s)
Institute of Structural Biology (STB)
POF-Topic(s)
30203 - Molecular Targets and Therapies
Research field(s)
Enabling and Novel Technologies
PSP Element(s)
G-503000-003
PubMed ID
24729490
WOS ID
WOS:000335001700020
Scopus ID
84899905848
Scopus ID
84899042118
Erfassungsdatum
2014-05-05