Menu

Javascript is not activated in your browser. This website needs javascript activated to work properly.
You are here

Applicability Domain Dependent Predictive Uncertainty in QSAR Regressions

Author:
  • Ullrika Sahlin
  • N. Jeliazkova
  • T. Oberg
Publishing year: 2014
Language: English
Pages: 26-35
Publication/Series: Molecular Informatics
Volume: 33
Issue: 1
Document type: Journal article
Publisher: John Wiley & Sons

Abstract english

Predictive models used in decision making, such as QSARs in chemical regulation or drug discovery, call for evaluated approaches to quantitatively assess associated uncertainty in predictions. Uncertainty in less reliable predictions may be captured by locally varying predictive errors. In the current study, model-based bootstrapping was combined with analogy reasoning to generate predictive distributions varying in magnitude over a model's domain of applicability. A resampling experiment based on PLS regressions on four QSAR data sets demonstrated that predictive errors assessed by k nearest neighbour or weighted PRedicted Error Sum of Squares (PRESS) on samples of external test data or by internal cross-validation improved the performance of the uncertainty assessment. Analogy using similarity defined by Euclidean distances, or differences in standard deviation in perturbed predictions, resulted in better performances than similarity defined by distance to, or density of, the training data. Locally assessed predictive distributions had on average at least as good coverage as Gaussian distribution with variance assessed from the PRESS. An R-code is provided that evaluates performances of the suggested algorithms to assess predictive error based on log likelihood scores and empirical coverage graphs, and which applies these to derive confidence intervals or samples from the predictive distributions of query compounds.

Keywords

  • Earth and Related Environmental Sciences
  • Predictive error
  • Variance
  • Reliability
  • Bootstrap
  • Risk assessment

Other

Published
  • ISSN: 1868-1751
me in Lundagård
E-mail: ullrika.sahlin [at] cec.lu.se

Researcher

Centre for Environmental and Climate Research (CEC)

+46 46 222 68 31

+46 73 827 44 32

E-D340

50

Read more on the blog

Extreme events in the real world and in the mind

2017-05-05
Last Friday April 28th I gave my docenture lecture for Environmental Science at Lund University. I talked about the importance of considerin…

Lundaloppet predictive challenge 2017

2017-05-04
Forecasting with uncertainty Guess your time in Lundaloppet 2017 and express your uncertainty in your running time in which way you want! Af…

The dawn of the new research group “UnEviL”

2017-04-13
Uncertainty and Evidence Lab is the name of a new research group at Lund University. The group is led by Ullrika Sahlin at the Centre of Env…

Centre for Environmental and Climate Research, CEC

Sölvegatan 37
223 62 Lund, Sweden

Visiting address
The Ecology building, Sölvegatan 37, Lund