PuSH - Publikationsserver des Helmholtz Zentrums München

Brandmaier, S. ; Sahlin, U.* ; Tetko, I.V. ; Öberg, T.*

PLS-optimal: A stepwise D-optimal design based on latent variables.

J. Chem. Inf. Model. 52, 975-983 (2012)
Verlagsversion Volltext DOI PMC
Open Access Green möglich sobald Postprint bei der ZB eingereicht worden ist.
Several applications, such as risk assessment within REACH or drug discovery, require reliable methods for the design of experiments and efficient testing strategies. Keeping the number of experiments as low as possible is important from both a financial and an ethical point of view, as exhaustive testing of compounds requires significant financial resources and animal lives. With a large initial set of compounds, experimental design techniques can be used to select a representative subset for testing. Once measured, these compounds can be used to develop quantitative structure activity relationship models to predict properties of the remaining compounds. This reduces the required resources and time. D-Optimal design is frequently used to select an optimal set of compounds by analyzing data variance. We developed a new sequential approach to apply a D-Optimal design to latent variables derived from a partial least squares (PLS) model instead of principal components. The stepwise procedure selects a new set of molecules to be measured after each previous measurement cycle. We show that application of the D-Optimal selection generates models with a significantly improved performance on four different data sets with end points relevant for REACH. Compared to those derived from principal components, PLS models derived from the selection on latent variables had a lower root-mean-square error and a higher Q2 and R2. This improvement is statistically significant, especially for the small number of compounds selected.
Impact Factor
Scopus SNIP
Web of Science
Times Cited
Scopus
Cited By
Altmetric
4.675
1.747
15
20
Tags
Anmerkungen
Besondere Publikation
Auf Hompepage verbergern

Zusatzinfos bearbeiten
Eigene Tags bearbeiten
Privat
Eigene Anmerkung bearbeiten
Privat
Auf Publikationslisten für
Homepage nicht anzeigen
Als besondere Publikation
markieren
Publikationstyp Artikel: Journalartikel
Dokumenttyp Wissenschaftlicher Artikel
Schlagwörter TETRAHYMENA-PYRIFORMIS; REPRESENTATIVE SUBSET; APPLICABILITY DOMAIN; PRINCIPAL COMPONENTS; MULTIVARIATE DESIGN; COMPOUND SELECTION; QSAR; RECONSTRUCTION; PREDICTION; TOXICITY
Sprache
Veröffentlichungsjahr 2012
HGF-Berichtsjahr 2012
ISSN (print) / ISBN 0021-9576
e-ISSN 1520-5142
Quellenangaben Band: 52, Heft: 4, Seiten: 975-983 Artikelnummer: , Supplement: ,
Verlag American Chemical Society (ACS)
Begutachtungsstatus Peer reviewed
POF Topic(s) 30203 - Molecular Targets and Therapies
Forschungsfeld(er) Enabling and Novel Technologies
PSP-Element(e) G-503000-001
PubMed ID 22462577
Scopus ID 84862021618
Erfassungsdatum 2012-07-23