Open this publication in new window or tab >>2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]
Machine learning is increasingly used to support decision-making in high-stakes domains such as precision medicine. Unlike traditional predictive models, decision-making models must take into account the effects of future actions that may not be directly observed in the available data. This mismatch between training data and target distribution introduces challenges. In such cases, data may be biased, confounded, or lacking sufficient support to evaluate alternative actions, and standard statistical learning methods can be misleading. This thesis addresses the problem of evaluating and learning decision policies under the above challenges. A central goal is to enable valid predictions about the consequences of implementing new policies, even when the data are incomplete or collected under conditions different from those under which the policy will be applied. We develop methods that explicitly model uncertainty and bias, allowing for valid performance guarantees in these scenarios.
In the first research paper, we focus on multi-objective decision support by learning Pareto-efficient decisions and provide finite-sample guarantees. In the following two research papers, we address policy evaluation: first in the case of observational data, and then in the case of a randomized trial. We propose robust reweighting techniques to evaluate the distributional performance of a given policy. For observational data, where the past policy is unknown, we provide valid performance guarantees under confounding. For randomized controlled trials, we instead provide valid performance guarantees when generalizing the trial results to broader populations. The fourth research paper addresses trade-offs between minimizing treatment risk while reducing harm. We propose a learning method that controls harm in a partially identified setting. In the final research paper, we study decision-making with missing data. Instead of imputing missing values, we propose a method that can handle missingness directly in the policy learning to improve upon a baseline policy.
The thesis is focused on methods that are certified to be statistically valid under credible assumptions. The aim is to make data-driven decision-making in sensitive applications safer and more trustworthy.
Place, publisher, year, edition, pages
Uppsala: Acta Universitatis Upsaliensis, 2025. p. 71
Series
Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology, ISSN 1651-6214 ; 2574
Keywords
Policy learning, Policy evaluation, Treatment decision policy, Partial identifiability, Risk minimization, Risk control, Causal inference
National Category
Probability Theory and Statistics
Research subject
Machine learning
Identifiers
urn:nbn:se:uu:diva-565537 (URN)978-91-513-2565-1 (ISBN)
Public defence
2025-10-10, 10134, Polhem lecture hall, Lägerhyddsvägen 1, Uppsala, 09:15 (English)
Opponent
Supervisors
2025-09-172025-08-222025-09-17