Menu

Javascript is not activated in your browser. This website needs javascript activated to work properly.
You are here

PLS-Optimal: A Stepwise D-Optimal Design Based on Latent Variables

Author:
  • Stefan Brandmaier
  • Ullrika Sahlin
  • Igor V. Tetko
  • Tomas Oberg
Publishing year: 2012
Language: English
Pages: 975-983
Publication/Series: Journal of Chemical Information and Modeling
Volume: 52
Issue: 4
Document type: Journal article
Publisher: The American Chemical Society

Abstract english

Several applications, such as risk assessment within REACH or drug discovery, require reliable methods for the design of experiments and efficient testing strategies. Keeping the number of experiments as low as possible is important from both a financial and an ethical point of view, as exhaustive testing of compounds requires significant financial resources and animal lives. With a large initial set of compounds, experimental design techniques can be used to select a representative subset for testing. Once measured, these compounds can be used to develop quantitative structure activity relationship models to predict properties of the remaining compounds. This reduces the required resources and time. D-Optimal design is frequently used to select an optimal set of compounds by analyzing data variance. We developed a new sequential approach to apply a D-Optimal design to latent variables derived from a partial least squares (PLS) model instead of principal components. The stepwise procedure selects a new set of molecules to be measured after each previous measurement cycle. We show that application of the D-Optimal selection generates models with a significantly improved performance on four different data sets with end points relevant for REACH. Compared to those derived from principal components, PLS models derived from the selection on latent variables had a lower root-mean-square error and a higher Q2 and R2. This improvement is statistically significant, especially for the small number of compounds selected.

Keywords

  • Earth and Related Environmental Sciences

Other

Published
  • ISSN: 1549-960X
me in Lundagård
E-mail: ullrika.sahlin [at] cec.lu.se

Researcher

Centre for Environmental and Climate Research (CEC)

+46 46 222 68 31

+46 73 827 44 32

E-D340

50

Read more on the blog

Bayes@Lund2017 19-12 April

2017-03-23
The program for Bayes@Lund2017 is now ready Don’t forget to sign up – and be fast to not miss out! We start in room MA7, Maths b…

Workshop on Bayesian Networks for risk assessment and decision making

2017-02-08
You most welcome to attend a workshop on Bayesian Networks in risk assessment and decision making in Lund March 28 and 29th, 2017. This work…

Studentprojekt: Tuggummin som en indikator för nedskräpning i urbana miljöer

2017-02-06
Studenter sökes för ett projekt under våren 2017 Detta projekt går ut på att ta reda på hur förekomst av tuggummin förhåller sig till föreko…

Centre for Environmental and Climate Research, CEC

Sölvegatan 37
223 62 Lund, Sweden

Visiting address
The Ecology building, Sölvegatan 37, Lund