Logo: to the web site of Uppsala University

uu.sePublikasjoner fra Uppsala universitet
Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Robust machine learning methods
Uppsala universitet, Teknisk-naturvetenskapliga vetenskapsområdet, Matematisk-datavetenskapliga sektionen, Institutionen för informationsteknologi, Avdelningen för systemteknik. Uppsala universitet, Teknisk-naturvetenskapliga vetenskapsområdet, Matematisk-datavetenskapliga sektionen, Institutionen för informationsteknologi, Reglerteknik.ORCID-id: 0000-0002-2294-004X
2022 (engelsk)Doktoravhandling, med artikler (Annet vitenskapelig)
Abstract [en]

We are surrounded by data in our daily lives. The rent of our houses, the amount of electricity units consumed, the prices of different products at a supermarket, the daily temperature, our medicine prescriptions, our internet search history are all different forms of data. Data can be used in a wide range of applications. For example, one can use data to predict product prices in the future; to predict tomorrow's temperature; to recommend videos; or suggest better prescriptions. However in order to do the above, one is required to learn a model from data. A model is a mathematical description of how the phenomena we are interested in behaves e.g. how does the temperature vary? Is it periodic? What kinds of patterns does it have? Machine learning is about this process of learning models from data by building on disciplines such as statistics and optimization. 

Learning models comes with many different challenges. Some challenges are related to how flexible the model is, some are related to the size of data, some are related to computational efficiency etc. One of the challenges is that of data outliers. For instance, due to war in a country exports could stop and there could be a sudden spike in prices of different products. This sudden jump in prices is an outlier or corruption to the normal situation and must be accounted for when learning the model. Another challenge could be that data is collected in one situation but the model is to be used in another situation. For example, one might have data on vaccine trials where the participants were mostly old people. But one might want to make a decision on whether to use the vaccine or not for the whole population that contains people of all age groups. So one must also account for this difference when learning models because the conclusion drawn may not be valid for the young people in the population. Yet another challenge  could arise when data is collected from different sources or contexts. For example, a shopkeeper might have data on sales of paracetamol when there was flu and when there was no flu and she might want to decide how much paracetamol to stock for the next month. In this situation, it is difficult to know whether there will be a flu next month or not and so deciding on how much to stock is a challenge. This thesis tries to address these and other similar challenges.

In paper I, we address the challenge of data corruption i.e., learning models in a robust way when some fraction of the data is corrupted. In paper II, we apply the methodology of paper I to the problem of localization in wireless networks. Paper III addresses the challenge of estimating causal effect between an exposure and an outcome variable from spatially collected data (e.g. whether increasing number of police personnel in an area reduces number of crimes there). Paper IV addresses the challenge of learning improved decision policies e.g. which treatment to assign to which patient given past data on treatment assignments. In paper V, we look at the challenge of learning models when data is acquired from different contexts and the future context is unknown. In paper VI, we address the challenge of predicting count data across space e.g. number of crimes in an area and quantify its uncertainty. In paper VII, we address the challenge of learning models when data points arrive in a streaming fashion i.e., point by point. The proposed method enables online training and also yields some robustness properties.

sted, utgiver, år, opplag, sider
Uppsala: Acta Universitatis Upsaliensis, 2022. , s. 50
Serie
Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology, ISSN 1651-6214 ; 2147
Emneord [en]
artificial intelligence, machine learning, risk minimization, data corruption, decision policy, conformal methods, data from contexts, online learning, spice, robust, causal inference, point process, localization, distribution uncertainty, treatment rules, quantile treatment, predicting count data
HSV kategori
Forskningsprogram
Elektroteknik med inriktning mot signalbehandling
Identifikatorer
URN: urn:nbn:se:uu:diva-472453ISBN: 978-91-513-1492-1 (tryckt)OAI: oai:DiVA.org:uu-472453DiVA, id: diva2:1651294
Disputas
2022-06-09, 101195, Ångström, Lägerhyddsvägen 1, Uppsala, 13:00 (engelsk)
Opponent
Veileder
Tilgjengelig fra: 2022-05-12 Laget: 2022-04-11 Sist oppdatert: 2022-06-15
Delarbeid
1. Robust Risk Minimization for Statistical Learning From Corrupted Data
Åpne denne publikasjonen i ny fane eller vindu >>Robust Risk Minimization for Statistical Learning From Corrupted Data
2020 (engelsk)Inngår i: IEEE Open Journal of Signal Processing, E-ISSN 2644-1322, Vol. 1, s. 287-294Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

We consider a general statistical learning problem where an unknown fraction of the training data is corrupted. We develop a robust learning method that only requires specifying an upper bound on the corrupted data fraction. The method minimizes a risk function defined by a non-parametric distribution with unknown probability weights. We derive and analyse the optimal weights and show how they provide robustness against corrupted data. Furthermore, we give a computationally efficient coordinate descent algorithm to solve the risk minimization problem. We demonstrate the wide range applicability of the method, including regression, classification, unsupervised learning and classic parameter estimation, with state-of-the-art performance.

Emneord
Data corruption, Huber contamination model, risk minimization, robustness
HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-429036 (URN)10.1109/OJSP.2020.3039632 (DOI)000722891600021 ()
Forskningsfinansiär
Swedish Research Council, 2017-04610Swedish Research Council, 2018-05040
Tilgjengelig fra: 2020-12-18 Laget: 2020-12-18 Sist oppdatert: 2024-01-08bibliografisk kontrollert
2. Robust localization in wireless networks from corrupted signals
Åpne denne publikasjonen i ny fane eller vindu >>Robust localization in wireless networks from corrupted signals
2021 (engelsk)Inngår i: EURASIP Journal on Advances in Signal Processing, ISSN 1687-6172, E-ISSN 1687-6180, Vol. 2021, nr 1, artikkel-id 79Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

We address the problem of timing-based localization in wireless networks, when an unknown fraction of data is corrupted by non-ideal propagation conditions. While timing-based techniques can enable accurate localization, they are sensitive to corrupted data. We develop a robust method that is applicable to a range of localization techniques, including time-of-arrival, time-difference-of-arrival and time-difference in schedule-based transmissions. The method is distribution-free, is computationally efficient and requires only an upper bound on the fraction of corrupted data, thus obviating distributional assumptions on the corrupting noise. The robustness of the method is demonstrated in numerical experiments.

sted, utgiver, år, opplag, sider
SpringerSPRINGER, 2021
Emneord
Localization, Robustness, Wireless networks, Time-of-arrival, Time-difference-of-arrival
HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-456481 (URN)10.1186/s13634-021-00786-8 (DOI)000695828100001 ()
Forskningsfinansiär
Swedish Research Council, 2016-06079Swedish Research Council, 2017-04610Swedish Research Council, 2018-05040
Tilgjengelig fra: 2021-10-21 Laget: 2021-10-21 Sist oppdatert: 2024-01-15bibliografisk kontrollert
3. Inferring Heterogeneous Causal Effects in Presence of Spatial Confounding
Åpne denne publikasjonen i ny fane eller vindu >>Inferring Heterogeneous Causal Effects in Presence of Spatial Confounding
2019 (engelsk)Inngår i: Proceedings of the 36th International Conference on Machine Learning, 2019, s. 4942-4950Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

We address the problem of inferring the causal effect of an exposure on an outcome across space, using observational data. The data is possibly subject to unmeasured confounding variables which, in a standard approach, must be adjusted for by estimating a nuisance function. Here we develop a method that eliminates the nuisance function, while mitigating the resulting errors-in-variables. The result is a robust and accurate inference method for spatially varying heterogeneous causal effects. The properties of the method are demonstrated on synthetic as well as real data from Germany and the US.

Serie
Proceedings of Machine Learning Research, ISSN 2640-3498 ; 97
HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-429033 (URN)000684034305010 ()
Konferanse
International Conference on Machine Learning (ICML), 9-15 June 2019, Long Beach, California, USA
Forskningsfinansiär
Swedish Research Council, 2018-05040Swedish Foundation for Strategic Research, RIT15-0012Swedish Research Council, 621-2016-06079
Tilgjengelig fra: 2020-12-18 Laget: 2020-12-18 Sist oppdatert: 2022-06-16bibliografisk kontrollert
4. Learning Robust Decision Policies from Observational Data
Åpne denne publikasjonen i ny fane eller vindu >>Learning Robust Decision Policies from Observational Data
2020 (engelsk)Inngår i: Advances in Neural Information Processing Systems 33 (NeurIPS 2020) / [ed] H. Larochelle; M. Ranzato; R. Hadsell; M.F. Balcan; H. Lin, Neural Information Processing Systems, 2020Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

We address the problem of learning a decision policy from observational data of past decisions in contexts with features and associated outcomes. The past policy maybe unknown and in safety-critical applications, such as medical decision support, it is of interest to learn robust policies that reduce the risk of outcomes with high costs. In this paper, we develop a method for learning policies that reduce tails of the cost distribution at a specified level and, moreover, provide a statistically valid bound on the cost of each decision. These properties are valid under finite samples -- even in scenarios with uneven or no overlap between features for different decisions in the observed data -- by building on recent results in conformal prediction. The performance and statistical properties of the proposed method are illustrated using both real and synthetic data. 

sted, utgiver, år, opplag, sider
Neural Information Processing Systems, 2020
HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-429039 (URN)001207696401072 ()9781713829546 (ISBN)
Konferanse
34th Conference on Neural Information Processing Systems (NeurIPS 2020), 6-12 December, 2020, Online
Forskningsfinansiär
Swedish Research Council, 2018-05040Knut and Alice Wallenberg FoundationWallenberg AI, Autonomous Systems and Software Program (WASP)
Tilgjengelig fra: 2020-12-18 Laget: 2020-12-18 Sist oppdatert: 2024-12-12bibliografisk kontrollert
5. Robust learning in heterogeneous contexts
Åpne denne publikasjonen i ny fane eller vindu >>Robust learning in heterogeneous contexts
(engelsk)Manuskript (preprint) (Annet vitenskapelig)
Abstract [en]

We consider the problem of learning decision parameters from data obtained in different contexts. When future context information is inaccessible, we consider the resulting (i) worst-case and (ii) overall out-of-sample performance of the learned parameters. We propose a robust approach that trades off these two performance criteria based on the partial information obtained about the unknown context distribution. The proposed method overcomes the overly conservative nature of the minimax method, while robustifying the empirical risk minimization method in a statistically motivated manner. We illustrate the performance of the method in a classification task.

HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-472095 (URN)
Tilgjengelig fra: 2022-04-05 Laget: 2022-04-05 Sist oppdatert: 2026-04-24
6. Prediction of Spatial Point Processes: Regularized Method with Out-of-Sample Guarantees
Åpne denne publikasjonen i ny fane eller vindu >>Prediction of Spatial Point Processes: Regularized Method with Out-of-Sample Guarantees
2019 (engelsk)Inngår i: ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019) / [ed] Wallach, H Larochelle, H Beygelzimer, A d'Alche-Buc, F Fox, E Garnett, R, Neural Information Processing Systems, 2019Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

A spatial point process can be characterized by an intensity function which predicts the number of events that occur across space. In this paper, we develop a method to infer predictive intensity intervals by learning a spatial model using a regularized criterion. We prove that the proposed method exhibits out-of-sample prediction performance guarantees which, unlike standard estimators, are valid even when the spatial model is misspecified. The method is demonstrated using synthetic as well as real spatial data.

sted, utgiver, år, opplag, sider
Neural Information Processing Systems, 2019
Serie
Advances in Neural Information Processing Systems, ISSN 1049-5258 ; 32
HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-418895 (URN)000535866903054 ()
Konferanse
33rd Conference on Neural Information Processing Systems (NeurIPS), DEC 08-14, 2019, Vancouver, CANADA
Forskningsfinansiär
Swedish Research Council, 2017 -04610Swedish Research Council, 2018 -05040
Tilgjengelig fra: 2020-09-09 Laget: 2020-09-09 Sist oppdatert: 2022-04-11bibliografisk kontrollert
7. Online Learning for Prediction via Covariance Fitting: Computation, Performance and Robustness
Åpne denne publikasjonen i ny fane eller vindu >>Online Learning for Prediction via Covariance Fitting: Computation, Performance and Robustness
2023 (engelsk)Inngår i: Transactions on Machine Learning Research, E-ISSN 2835-8856, nr (01/2023Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

We consider the online learning of linear smoother predictors based on a covariance model of the outcomes. To control its degrees of freedom in an appropriate manner, the covariance model parameters are often learned using cross-validation or maximum-likelihood techniques. However, neither technique is suitable when training data arrives in a streaming fashion. Here we consider a covariance-fitting method to learn the model parameters, initially used  in spectral estimation. We show that this results in a computation efficient online learning method in which the resulting predictor can be updated sequentially. We prove that, with high probability, its out-of-sample error approaches the minimum achievable level at root-n rate. Moreover, we show that the resulting predictor enjoys two different robustness properties. First, it minimizes the out-of-sample error with respect to the least favourable distribution within a given Wasserstein distance from the empirical distribution. Second, it is robust against errors in the covariate training data. We illustrate the performance of the proposed method in a numerical experiment.

sted, utgiver, år, opplag, sider
Transactions on Machine Learning Research, 2023
HSV kategori
Identifikatorer
urn:nbn:se:uu:diva-472451 (URN)2-s2.0-105000033546 (Scopus ID)
Forskningsfinansiär
Wallenberg AI, Autonomous Systems and Software Program (WASP)Knut and Alice Wallenberg FoundationKjell and Marta Beijer Foundation
Tilgjengelig fra: 2022-04-11 Laget: 2022-04-11 Sist oppdatert: 2026-02-25bibliografisk kontrollert

Open Access i DiVA

UUThesis_M-Osama-2022(1194 kB)1177 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 1194 kBChecksum SHA-512
0d0707e4d987d7fe280bf3bcb496654002b76f645f6f276f1290948341a2110b01b95ba6b0e1cbf2e1ac6bb0abaf17c5f20d5522d8c2c7bead917735198a3ca4
Type fulltextMimetype application/pdf

Person

Osama, Muhammad

Søk i DiVA

Av forfatter/redaktør
Osama, Muhammad
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 1179 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

isbn
urn-nbn

Altmetric

isbn
urn-nbn
Totalt: 1494 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf