Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
Change search
Link to record
Permanent link

Direct link
Publications (6 of 6) Show all publications
Ek, S. (2025). Machine Learning for Decision-Making: Uncertainty, Inference and Trade-offs. (Doctoral dissertation). Uppsala: Acta Universitatis Upsaliensis
Open this publication in new window or tab >>Machine Learning for Decision-Making: Uncertainty, Inference and Trade-offs
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Machine learning is increasingly used to support decision-making in high-stakes domains such as precision medicine. Unlike traditional predictive models, decision-making models must take into account the effects of future actions that may not be directly observed in the available data. This mismatch between training data and target distribution introduces challenges. In such cases, data may be biased, confounded, or lacking sufficient support to evaluate alternative actions, and standard statistical learning methods can be misleading. This thesis addresses the problem of evaluating and learning decision policies under the above challenges. A central goal is to enable valid predictions about the consequences of implementing new policies, even when the data are incomplete or collected under conditions different from those under which the policy will be applied. We develop methods that explicitly model uncertainty and bias, allowing for valid performance guarantees in these scenarios.

In the first research paper, we focus on multi-objective decision support by learning Pareto-efficient decisions and provide finite-sample guarantees. In the following two research papers, we address policy evaluation: first in the case of observational data, and then in the case of a randomized trial. We propose robust reweighting techniques to evaluate the distributional performance of a given policy. For observational data, where the past policy is unknown, we provide valid performance guarantees under confounding. For randomized controlled trials, we instead provide valid performance guarantees when generalizing the trial results to broader populations. The fourth research paper addresses trade-offs between minimizing treatment risk while reducing harm. We propose a learning method that controls harm in a partially identified setting. In the final research paper, we study decision-making with missing data. Instead of imputing missing values, we propose a method that can handle missingness directly in the policy learning to improve upon a baseline policy.

The thesis is focused on methods that are certified to be statistically valid under credible assumptions. The aim is to make data-driven decision-making in sensitive applications safer and more trustworthy.

Place, publisher, year, edition, pages
Uppsala: Acta Universitatis Upsaliensis, 2025. p. 71
Series
Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology, ISSN 1651-6214 ; 2574
Keywords
Policy learning, Policy evaluation, Treatment decision policy, Partial identifiability, Risk minimization, Risk control, Causal inference
National Category
Probability Theory and Statistics
Research subject
Machine learning
Identifiers
urn:nbn:se:uu:diva-565537 (URN)978-91-513-2565-1 (ISBN)
Public defence
2025-10-10, 10134, Polhem lecture hall, Lägerhyddsvägen 1, Uppsala, 09:15 (English)
Opponent
Supervisors
Available from: 2025-09-17 Created: 2025-08-22 Last updated: 2025-09-17
Ek, S. & Zachariah, D. (2024). Externally Valid Policy Evaluation from Randomized Trials Using Additional Observational Data. In: : . Paper presented at The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS).
Open this publication in new window or tab >>Externally Valid Policy Evaluation from Randomized Trials Using Additional Observational Data
2024 (English)Conference paper, Published paper (Other academic)
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:uu:diva-565528 (URN)
Conference
The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS)
Available from: 2025-08-22 Created: 2025-08-22 Last updated: 2025-09-15Bibliographically approved
Ek, S., Zachariah, D., Johansson, F. D. & Stoica, P. (2023). Off-Policy Evaluation with Out-of-Sample Guarantees. Transactions on Machine Learning Research (06/2023)
Open this publication in new window or tab >>Off-Policy Evaluation with Out-of-Sample Guarantees
2023 (English)In: Transactions on Machine Learning Research, E-ISSN 2835-8856, no 06/2023Article in journal (Refereed) Published
Abstract [en]

We consider the problem of evaluating the performance of a decision policy using past observational data. The outcome of a policy is measured in terms of a loss (aka. disutility or negative reward) and the main problem is making valid inferences about its out-of-sample loss when the past data was observed under a different and possibly unknown policy. Using a sample-splitting method, we show that it is possible to draw such inferences with finite-sample coverage guarantees about the entire loss distribution, rather than just its mean. Importantly, the method takes into account model misspecifications of the past policy - including unmeasured confounding. The evaluation method can be used to certify the performance of a policy using observational data under a specified range of credible model assumptions.

National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:uu:diva-519244 (URN)
Available from: 2024-01-04 Created: 2024-01-04 Last updated: 2025-08-22Bibliographically approved
Ek, S., Zachariah, D. & Stoica, P. (2022). Learning Pareto-Efficient Decisions with Confidence. In: Camps-Valls, G Ruiz, FJR Valera, I (Ed.), International Conference on Artificial Intelligence and Statistics: . Paper presented at International Conference on Artificial Intelligence and Statistics, MAR 28-30, 2022, ELECTR NETWORK (pp. 9969-9981). JMLR-JOURNAL MACHINE LEARNING RESEARCH, 151
Open this publication in new window or tab >>Learning Pareto-Efficient Decisions with Confidence
2022 (English)In: International Conference on Artificial Intelligence and Statistics / [ed] Camps-Valls, G Ruiz, FJR Valera, I, JMLR-JOURNAL MACHINE LEARNING RESEARCH , 2022, Vol. 151, p. 9969-9981Conference paper, Published paper (Refereed)
Abstract [en]

The paper considers the problem of multi-objective decision support when outcomes are uncertain. We extend the concept of Pareto-efficient decisions to take into account the uncertainty of decision outcomes across varying contexts. This enables quantifying trade-offs between decisions in terms of tail outcomes that are relevant in safety-critical applications. We propose a method for learning efficient decisions with statistical confidence, building on results from the conformal prediction literature. The method adapts to weak or nonexistent context covariate overlap and its statistical guarantees are evaluated using both synthetic and real data.

Place, publisher, year, edition, pages
JMLR-JOURNAL MACHINE LEARNING RESEARCH, 2022
Series
Proceedings of Machine Learning Research, ISSN 2640-3498
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:uu:diva-487888 (URN)000841852304022 ()
Conference
International Conference on Artificial Intelligence and Statistics, MAR 28-30, 2022, ELECTR NETWORK
Funder
Knut and Alice Wallenberg FoundationSwedish Research Council, 2018-05040Swedish Research Council, 2021-05022
Available from: 2022-11-14 Created: 2022-11-14 Last updated: 2025-08-22Bibliographically approved
Ek, S. & Zachariah, D.Learning Robust Decision Policies with Missing Covariates.
Open this publication in new window or tab >>Learning Robust Decision Policies with Missing Covariates
(English)Manuscript (preprint) (Other academic)
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:uu:diva-565531 (URN)
Available from: 2025-08-22 Created: 2025-08-22 Last updated: 2025-08-22
Ek, S. & Zachariah, D.Learning Treatment Allocations with Risk Control Under Partial Identifiability.
Open this publication in new window or tab >>Learning Treatment Allocations with Risk Control Under Partial Identifiability
(English)Manuscript (preprint) (Other academic)
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:uu:diva-565529 (URN)
Available from: 2025-08-22 Created: 2025-08-22 Last updated: 2025-08-25Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0003-1303-2901

Search in DiVA

Show all publications