Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Learning Pareto-Efficient Decisions with Confidence
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division of Systems and Control. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Artificial Intelligence.ORCID iD: 0000-0003-1303-2901
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division of Systems and Control. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Artificial Intelligence.ORCID iD: 0000-0002-6698-0166
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division of Systems and Control. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Automatic control.ORCID iD: 0000-0002-7957-3711
2022 (English)In: International Conference on Artificial Intelligence and Statistics / [ed] Camps-Valls, G Ruiz, FJR Valera, I, JMLR-JOURNAL MACHINE LEARNING RESEARCH , 2022, Vol. 151, p. 9969-9981Conference paper, Published paper (Refereed)
Abstract [en]

The paper considers the problem of multi-objective decision support when outcomes are uncertain. We extend the concept of Pareto-efficient decisions to take into account the uncertainty of decision outcomes across varying contexts. This enables quantifying trade-offs between decisions in terms of tail outcomes that are relevant in safety-critical applications. We propose a method for learning efficient decisions with statistical confidence, building on results from the conformal prediction literature. The method adapts to weak or nonexistent context covariate overlap and its statistical guarantees are evaluated using both synthetic and real data.

Place, publisher, year, edition, pages
JMLR-JOURNAL MACHINE LEARNING RESEARCH , 2022. Vol. 151, p. 9969-9981
Series
Proceedings of Machine Learning Research, ISSN 2640-3498
National Category
Probability Theory and Statistics
Identifiers
URN: urn:nbn:se:uu:diva-487888ISI: 000841852304022OAI: oai:DiVA.org:uu-487888DiVA, id: diva2:1710586
Conference
International Conference on Artificial Intelligence and Statistics, MAR 28-30, 2022, ELECTR NETWORK
Funder
Knut and Alice Wallenberg FoundationSwedish Research Council, 2018-05040Swedish Research Council, 2021-05022Available from: 2022-11-14 Created: 2022-11-14 Last updated: 2025-08-22Bibliographically approved
In thesis
1. Machine Learning for Decision-Making: Uncertainty, Inference and Trade-offs
Open this publication in new window or tab >>Machine Learning for Decision-Making: Uncertainty, Inference and Trade-offs
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Machine learning is increasingly used to support decision-making in high-stakes domains such as precision medicine. Unlike traditional predictive models, decision-making models must take into account the effects of future actions that may not be directly observed in the available data. This mismatch between training data and target distribution introduces challenges. In such cases, data may be biased, confounded, or lacking sufficient support to evaluate alternative actions, and standard statistical learning methods can be misleading. This thesis addresses the problem of evaluating and learning decision policies under the above challenges. A central goal is to enable valid predictions about the consequences of implementing new policies, even when the data are incomplete or collected under conditions different from those under which the policy will be applied. We develop methods that explicitly model uncertainty and bias, allowing for valid performance guarantees in these scenarios.

In the first research paper, we focus on multi-objective decision support by learning Pareto-efficient decisions and provide finite-sample guarantees. In the following two research papers, we address policy evaluation: first in the case of observational data, and then in the case of a randomized trial. We propose robust reweighting techniques to evaluate the distributional performance of a given policy. For observational data, where the past policy is unknown, we provide valid performance guarantees under confounding. For randomized controlled trials, we instead provide valid performance guarantees when generalizing the trial results to broader populations. The fourth research paper addresses trade-offs between minimizing treatment risk while reducing harm. We propose a learning method that controls harm in a partially identified setting. In the final research paper, we study decision-making with missing data. Instead of imputing missing values, we propose a method that can handle missingness directly in the policy learning to improve upon a baseline policy.

The thesis is focused on methods that are certified to be statistically valid under credible assumptions. The aim is to make data-driven decision-making in sensitive applications safer and more trustworthy.

Place, publisher, year, edition, pages
Uppsala: Acta Universitatis Upsaliensis, 2025. p. 71
Series
Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology, ISSN 1651-6214 ; 2574
Keywords
Policy learning, Policy evaluation, Treatment decision policy, Partial identifiability, Risk minimization, Risk control, Causal inference
National Category
Probability Theory and Statistics
Research subject
Machine learning
Identifiers
urn:nbn:se:uu:diva-565537 (URN)978-91-513-2565-1 (ISBN)
Public defence
2025-10-10, 10134, Polhem lecture hall, Lägerhyddsvägen 1, Uppsala, 09:15 (English)
Opponent
Supervisors
Available from: 2025-09-17 Created: 2025-08-22 Last updated: 2025-09-17

Open Access in DiVA

No full text in DiVA

Other links

Abstract

Authority records

Ek, SofiaZachariah, DaveStoica, Peter

Search in DiVA

By author/editor
Ek, SofiaZachariah, DaveStoica, Peter
By organisation
Division of Systems and ControlArtificial IntelligenceAutomatic control
Probability Theory and Statistics

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 197 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf