Introducing uncertainty in predictive modeling: Friend or foe?
2012 (English)In: Journal of Chemical Information and Modeling, ISSN 1549-9596, Vol. 52, no 11, 2815-2822 p.Article in journal (Refereed) Published
Uncertainty was introduced to chemical descriptors of 16 publicly available data sets to various degrees and in various ways in order to investigate the effect on the predictive performance of the state-of-the-art method decision tree ensembles. A number of strategies to handle uncertainty in decision tree ensembles were evaluated. The main conclusion of the study is that uncertainty to a large extent may be introduced in chemical descriptors without impairing the predictive performance of ensembles and without the predictive performance being significantly reduced from a practical point of view. The investigation further showed that even when distributions of uncertain values were provided, the ensembles method could generate equally effective models from single-point samples from these distributions. Hence, there seems to be no advantage in using more elaborate methods for handling uncertainty in chemical descriptors when using decision tree ensembles as a modeling method for the considered types of introduced uncertainty.
Place, publisher, year, edition, pages
2012. Vol. 52, no 11, 2815-2822 p.
IdentifiersURN: urn:nbn:se:uu:diva-188119DOI: 10.1021/ci3003446ISI: 000311461400003OAI: oai:DiVA.org:uu-188119DiVA: diva2:576658