Logotyp: till Uppsala universitets webbplats

uu.sePublikationer från Uppsala universitet
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Robust machine learning methods
Uppsala universitet, Teknisk-naturvetenskapliga vetenskapsområdet, Matematisk-datavetenskapliga sektionen, Institutionen för informationsteknologi, Avdelningen för systemteknik. Uppsala universitet, Teknisk-naturvetenskapliga vetenskapsområdet, Matematisk-datavetenskapliga sektionen, Institutionen för informationsteknologi, Reglerteknik.ORCID-id: 0000-0002-2294-004X
2022 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

We are surrounded by data in our daily lives. The rent of our houses, the amount of electricity units consumed, the prices of different products at a supermarket, the daily temperature, our medicine prescriptions, our internet search history are all different forms of data. Data can be used in a wide range of applications. For example, one can use data to predict product prices in the future; to predict tomorrow's temperature; to recommend videos; or suggest better prescriptions. However in order to do the above, one is required to learn a model from data. A model is a mathematical description of how the phenomena we are interested in behaves e.g. how does the temperature vary? Is it periodic? What kinds of patterns does it have? Machine learning is about this process of learning models from data by building on disciplines such as statistics and optimization. 

Learning models comes with many different challenges. Some challenges are related to how flexible the model is, some are related to the size of data, some are related to computational efficiency etc. One of the challenges is that of data outliers. For instance, due to war in a country exports could stop and there could be a sudden spike in prices of different products. This sudden jump in prices is an outlier or corruption to the normal situation and must be accounted for when learning the model. Another challenge could be that data is collected in one situation but the model is to be used in another situation. For example, one might have data on vaccine trials where the participants were mostly old people. But one might want to make a decision on whether to use the vaccine or not for the whole population that contains people of all age groups. So one must also account for this difference when learning models because the conclusion drawn may not be valid for the young people in the population. Yet another challenge  could arise when data is collected from different sources or contexts. For example, a shopkeeper might have data on sales of paracetamol when there was flu and when there was no flu and she might want to decide how much paracetamol to stock for the next month. In this situation, it is difficult to know whether there will be a flu next month or not and so deciding on how much to stock is a challenge. This thesis tries to address these and other similar challenges.

In paper I, we address the challenge of data corruption i.e., learning models in a robust way when some fraction of the data is corrupted. In paper II, we apply the methodology of paper I to the problem of localization in wireless networks. Paper III addresses the challenge of estimating causal effect between an exposure and an outcome variable from spatially collected data (e.g. whether increasing number of police personnel in an area reduces number of crimes there). Paper IV addresses the challenge of learning improved decision policies e.g. which treatment to assign to which patient given past data on treatment assignments. In paper V, we look at the challenge of learning models when data is acquired from different contexts and the future context is unknown. In paper VI, we address the challenge of predicting count data across space e.g. number of crimes in an area and quantify its uncertainty. In paper VII, we address the challenge of learning models when data points arrive in a streaming fashion i.e., point by point. The proposed method enables online training and also yields some robustness properties.

Ort, förlag, år, upplaga, sidor
Uppsala: Acta Universitatis Upsaliensis, 2022. , s. 50
Serie
Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology, ISSN 1651-6214 ; 2147
Nyckelord [en]
artificial intelligence, machine learning, risk minimization, data corruption, decision policy, conformal methods, data from contexts, online learning, spice, robust, causal inference, point process, localization, distribution uncertainty, treatment rules, quantile treatment, predicting count data
Nationell ämneskategori
Elektroteknik och elektronik Signalbehandling Sannolikhetsteori och statistik
Forskningsämne
Elektroteknik med inriktning mot signalbehandling
Identifikatorer
URN: urn:nbn:se:uu:diva-472453ISBN: 978-91-513-1492-1 (tryckt)OAI: oai:DiVA.org:uu-472453DiVA, id: diva2:1651294
Disputation
2022-06-09, 101195, Ångström, Lägerhyddsvägen 1, Uppsala, 13:00 (Engelska)
Opponent
Handledare
Tillgänglig från: 2022-05-12 Skapad: 2022-04-11 Senast uppdaterad: 2022-06-15
Delarbeten
1. Robust Risk Minimization for Statistical Learning From Corrupted Data
Öppna denna publikation i ny flik eller fönster >>Robust Risk Minimization for Statistical Learning From Corrupted Data
2020 (Engelska)Ingår i: IEEE Open Journal of Signal Processing, E-ISSN 2644-1322, Vol. 1, s. 287-294Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

We consider a general statistical learning problem where an unknown fraction of the training data is corrupted. We develop a robust learning method that only requires specifying an upper bound on the corrupted data fraction. The method minimizes a risk function defined by a non-parametric distribution with unknown probability weights. We derive and analyse the optimal weights and show how they provide robustness against corrupted data. Furthermore, we give a computationally efficient coordinate descent algorithm to solve the risk minimization problem. We demonstrate the wide range applicability of the method, including regression, classification, unsupervised learning and classic parameter estimation, with state-of-the-art performance.

Nyckelord
Data corruption, Huber contamination model, risk minimization, robustness
Nationell ämneskategori
Sannolikhetsteori och statistik
Identifikatorer
urn:nbn:se:uu:diva-429036 (URN)10.1109/OJSP.2020.3039632 (DOI)000722891600021 ()
Forskningsfinansiär
Vetenskapsrådet, 2017-04610Vetenskapsrådet, 2018-05040
Tillgänglig från: 2020-12-18 Skapad: 2020-12-18 Senast uppdaterad: 2024-01-08Bibliografiskt granskad
2. Robust localization in wireless networks from corrupted signals
Öppna denna publikation i ny flik eller fönster >>Robust localization in wireless networks from corrupted signals
2021 (Engelska)Ingår i: EURASIP Journal on Advances in Signal Processing, ISSN 1687-6172, E-ISSN 1687-6180, Vol. 2021, nr 1, artikel-id 79Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

We address the problem of timing-based localization in wireless networks, when an unknown fraction of data is corrupted by non-ideal propagation conditions. While timing-based techniques can enable accurate localization, they are sensitive to corrupted data. We develop a robust method that is applicable to a range of localization techniques, including time-of-arrival, time-difference-of-arrival and time-difference in schedule-based transmissions. The method is distribution-free, is computationally efficient and requires only an upper bound on the fraction of corrupted data, thus obviating distributional assumptions on the corrupting noise. The robustness of the method is demonstrated in numerical experiments.

Ort, förlag, år, upplaga, sidor
SpringerSPRINGER, 2021
Nyckelord
Localization, Robustness, Wireless networks, Time-of-arrival, Time-difference-of-arrival
Nationell ämneskategori
Signalbehandling
Identifikatorer
urn:nbn:se:uu:diva-456481 (URN)10.1186/s13634-021-00786-8 (DOI)000695828100001 ()
Forskningsfinansiär
Vetenskapsrådet, 2016-06079Vetenskapsrådet, 2017-04610Vetenskapsrådet, 2018-05040
Tillgänglig från: 2021-10-21 Skapad: 2021-10-21 Senast uppdaterad: 2024-01-15Bibliografiskt granskad
3. Inferring Heterogeneous Causal Effects in Presence of Spatial Confounding
Öppna denna publikation i ny flik eller fönster >>Inferring Heterogeneous Causal Effects in Presence of Spatial Confounding
2019 (Engelska)Ingår i: Proceedings of the 36th International Conference on Machine Learning, 2019, s. 4942-4950Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

We address the problem of inferring the causal effect of an exposure on an outcome across space, using observational data. The data is possibly subject to unmeasured confounding variables which, in a standard approach, must be adjusted for by estimating a nuisance function. Here we develop a method that eliminates the nuisance function, while mitigating the resulting errors-in-variables. The result is a robust and accurate inference method for spatially varying heterogeneous causal effects. The properties of the method are demonstrated on synthetic as well as real data from Germany and the US.

Serie
Proceedings of Machine Learning Research, ISSN 2640-3498 ; 97
Nationell ämneskategori
Sannolikhetsteori och statistik
Identifikatorer
urn:nbn:se:uu:diva-429033 (URN)000684034305010 ()
Konferens
International Conference on Machine Learning (ICML), 9-15 June 2019, Long Beach, California, USA
Forskningsfinansiär
Vetenskapsrådet, 2018-05040Stiftelsen för strategisk forskning (SSF), RIT15-0012Vetenskapsrådet, 621-2016-06079
Tillgänglig från: 2020-12-18 Skapad: 2020-12-18 Senast uppdaterad: 2022-06-16Bibliografiskt granskad
4. Learning Robust Decision Policies from Observational Data
Öppna denna publikation i ny flik eller fönster >>Learning Robust Decision Policies from Observational Data
2020 (Engelska)Ingår i: Advances in Neural Information Processing Systems 33 (NeurIPS 2020) / [ed] H. Larochelle; M. Ranzato; R. Hadsell; M.F. Balcan; H. Lin, Neural Information Processing Systems, 2020Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

We address the problem of learning a decision policy from observational data of past decisions in contexts with features and associated outcomes. The past policy maybe unknown and in safety-critical applications, such as medical decision support, it is of interest to learn robust policies that reduce the risk of outcomes with high costs. In this paper, we develop a method for learning policies that reduce tails of the cost distribution at a specified level and, moreover, provide a statistically valid bound on the cost of each decision. These properties are valid under finite samples -- even in scenarios with uneven or no overlap between features for different decisions in the observed data -- by building on recent results in conformal prediction. The performance and statistical properties of the proposed method are illustrated using both real and synthetic data. 

Ort, förlag, år, upplaga, sidor
Neural Information Processing Systems, 2020
Nationell ämneskategori
Sannolikhetsteori och statistik
Identifikatorer
urn:nbn:se:uu:diva-429039 (URN)001207696401072 ()9781713829546 (ISBN)
Konferens
34th Conference on Neural Information Processing Systems (NeurIPS 2020), 6-12 December, 2020, Online
Forskningsfinansiär
Vetenskapsrådet, 2018-05040Knut och Alice Wallenbergs StiftelseWallenberg AI, Autonomous Systems and Software Program (WASP)
Tillgänglig från: 2020-12-18 Skapad: 2020-12-18 Senast uppdaterad: 2024-12-12Bibliografiskt granskad
5. Robust learning in heterogeneous contexts
Öppna denna publikation i ny flik eller fönster >>Robust learning in heterogeneous contexts
(Engelska)Manuskript (preprint) (Övrigt vetenskapligt)
Abstract [en]

We consider the problem of learning decision parameters from data obtained in different contexts. When future context information is inaccessible, we consider the resulting (i) worst-case and (ii) overall out-of-sample performance of the learned parameters. We propose a robust approach that trades off these two performance criteria based on the partial information obtained about the unknown context distribution. The proposed method overcomes the overly conservative nature of the minimax method, while robustifying the empirical risk minimization method in a statistically motivated manner. We illustrate the performance of the method in a classification task.

Nationell ämneskategori
Sannolikhetsteori och statistik
Identifikatorer
urn:nbn:se:uu:diva-472095 (URN)
Tillgänglig från: 2022-04-05 Skapad: 2022-04-05 Senast uppdaterad: 2026-04-24
6. Prediction of Spatial Point Processes: Regularized Method with Out-of-Sample Guarantees
Öppna denna publikation i ny flik eller fönster >>Prediction of Spatial Point Processes: Regularized Method with Out-of-Sample Guarantees
2019 (Engelska)Ingår i: ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019) / [ed] Wallach, H Larochelle, H Beygelzimer, A d'Alche-Buc, F Fox, E Garnett, R, Neural Information Processing Systems, 2019Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

A spatial point process can be characterized by an intensity function which predicts the number of events that occur across space. In this paper, we develop a method to infer predictive intensity intervals by learning a spatial model using a regularized criterion. We prove that the proposed method exhibits out-of-sample prediction performance guarantees which, unlike standard estimators, are valid even when the spatial model is misspecified. The method is demonstrated using synthetic as well as real spatial data.

Ort, förlag, år, upplaga, sidor
Neural Information Processing Systems, 2019
Serie
Advances in Neural Information Processing Systems, ISSN 1049-5258 ; 32
Nationell ämneskategori
Sannolikhetsteori och statistik
Identifikatorer
urn:nbn:se:uu:diva-418895 (URN)000535866903054 ()
Konferens
33rd Conference on Neural Information Processing Systems (NeurIPS), DEC 08-14, 2019, Vancouver, CANADA
Forskningsfinansiär
Vetenskapsrådet, 2017 -04610Vetenskapsrådet, 2018 -05040
Tillgänglig från: 2020-09-09 Skapad: 2020-09-09 Senast uppdaterad: 2022-04-11Bibliografiskt granskad
7. Online Learning for Prediction via Covariance Fitting: Computation, Performance and Robustness
Öppna denna publikation i ny flik eller fönster >>Online Learning for Prediction via Covariance Fitting: Computation, Performance and Robustness
2023 (Engelska)Ingår i: Transactions on Machine Learning Research, E-ISSN 2835-8856, nr (01/2023Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

We consider the online learning of linear smoother predictors based on a covariance model of the outcomes. To control its degrees of freedom in an appropriate manner, the covariance model parameters are often learned using cross-validation or maximum-likelihood techniques. However, neither technique is suitable when training data arrives in a streaming fashion. Here we consider a covariance-fitting method to learn the model parameters, initially used  in spectral estimation. We show that this results in a computation efficient online learning method in which the resulting predictor can be updated sequentially. We prove that, with high probability, its out-of-sample error approaches the minimum achievable level at root-n rate. Moreover, we show that the resulting predictor enjoys two different robustness properties. First, it minimizes the out-of-sample error with respect to the least favourable distribution within a given Wasserstein distance from the empirical distribution. Second, it is robust against errors in the covariate training data. We illustrate the performance of the proposed method in a numerical experiment.

Ort, förlag, år, upplaga, sidor
Transactions on Machine Learning Research, 2023
Nationell ämneskategori
Sannolikhetsteori och statistik Teknik och teknologier
Identifikatorer
urn:nbn:se:uu:diva-472451 (URN)2-s2.0-105000033546 (Scopus ID)
Forskningsfinansiär
Wallenberg AI, Autonomous Systems and Software Program (WASP)Knut och Alice Wallenbergs StiftelseKjell och Märta Beijers Stiftelse
Tillgänglig från: 2022-04-11 Skapad: 2022-04-11 Senast uppdaterad: 2026-02-25Bibliografiskt granskad

Open Access i DiVA

UUThesis_M-Osama-2022(1194 kB)1177 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 1194 kBChecksumma SHA-512
0d0707e4d987d7fe280bf3bcb496654002b76f645f6f276f1290948341a2110b01b95ba6b0e1cbf2e1ac6bb0abaf17c5f20d5522d8c2c7bead917735198a3ca4
Typ fulltextMimetyp application/pdf

Person

Osama, Muhammad

Sök vidare i DiVA

Av författaren/redaktören
Osama, Muhammad
Av organisationen
Avdelningen för systemteknikReglerteknik
Elektroteknik och elektronikSignalbehandlingSannolikhetsteori och statistik

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 1179 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

isbn
urn-nbn

Altmetricpoäng

isbn
urn-nbn
Totalt: 1494 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf