Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Learning Proposals for Practical Energy-Based Regression
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division of Systems and Control. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Artificial Intelligence.ORCID iD: 0000-0001-5456-5515
Swiss Fed Inst Technol, Comp Vis Lab, Zurich, Switzerland..
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division of Systems and Control. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Artificial Intelligence.ORCID iD: 0000-0001-5183-234X
2022 (English)In: International conference on artificial intelligence and statistics, vol 151 / [ed] Camps-Valls, G Ruiz, FJR Valera, I, JMLR-JOURNAL MACHINE LEARNING RESEARCH , 2022, Vol. 151, p. 4685-4704Conference paper, Published paper (Refereed)
Abstract [en]

Energy-based models (EBMs) have experienced a resurgence within machine learning in recent years, including as a promising alternative for probabilistic regression. However, energy-based regression requires a proposal distribution to be manually designed for training, and an initial estimate has to be provided at test-time. We address both of these issues by introducing a conceptually simple method to automatically learn an effective proposal distribution, which is parameterized by a separate network head. To this end, we derive a surprising result, leading to a unified training objective that jointly minimizes the KL divergence from the proposal to the EBM, and the negative log-likelihood of the EBM. At test-time, we can then employ importance sampling with the trained proposal to efficiently evaluate the learned EBM and produce standalone predictions. Furthermore, we utilize our derived training objective to learn mixture density networks (MDNs) with a jointly trained energy-based teacher, consistently outperforming conventional MDN training on four real-world regression tasks within computer vision. Code is available at https://github.com/fregu856/ebms_proposals.

Place, publisher, year, edition, pages
JMLR-JOURNAL MACHINE LEARNING RESEARCH , 2022. Vol. 151, p. 4685-4704
Series
Proceedings of Machine Learning Research, ISSN 2640-3498
National Category
Business Administration
Identifiers
URN: urn:nbn:se:uu:diva-482673ISI: 000828072704035OAI: oai:DiVA.org:uu-482673DiVA, id: diva2:1696106
Conference
International Conference on Artificial Intelligence and Statistics, MAR 28-30, 2022, ELECTR NETWORK
Funder
Swedish Foundation for Strategic Research, RIT15-0012Swedish Research Council, 621-2016-06079Kjell and Marta Beijer FoundationAvailable from: 2022-09-15 Created: 2022-09-15 Last updated: 2023-10-10Bibliographically approved
In thesis
1. Towards Accurate and Reliable Deep Regression Models
Open this publication in new window or tab >>Towards Accurate and Reliable Deep Regression Models
2023 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Regression is a fundamental machine learning task with many important applications within computer vision and other domains. In general, it entails predicting continuous targets from given inputs. Deep learning has become the dominant paradigm within machine learning in recent years, and a wide variety of different techniques have been employed to solve regression problems using deep models. There is however no broad consensus on how deep regression models should be constructed for best possible accuracy, or how the uncertainty in their predictions should be represented and estimated. 

These open questions are studied in this thesis, aiming to help take steps towards an ultimate goal of developing deep regression models which are both accurate and reliable enough for real-world deployment within medical applications and other safety-critical domains.

The first main contribution of the thesis is the formulation and development of energy-based probabilistic regression. This is a general and conceptually simple regression framework with a clear probabilistic interpretation, using energy-based models to represent the true conditional target distribution. The framework is applied to a number of regression problems and demonstrates particularly strong performance for 2D bounding box regression, improving the state-of-the-art when applied to the task of visual tracking.

The second main contribution is a critical evaluation of various uncertainty estimation methods. A general introduction to the problem of estimating the predictive uncertainty of deep models is first provided, together with an extensive comparison of the two popular methods ensembling and MC-dropout. A number of regression uncertainty estimation methods are then further evaluated, specifically examining their reliability under real-world distribution shifts. This evaluation uncovers important limitations of current methods and serves as a challenge to the research community. It demonstrates that more work is required in order to develop truly reliable uncertainty estimation methods for regression.

Place, publisher, year, edition, pages
Uppsala: Acta Universitatis Upsaliensis, 2023. p. 61
Series
Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology, ISSN 1651-6214 ; 2320
Keywords
Machine Learning, Deep Learning, Regression, Probabilistic Models, Energy-Based Models, Uncertainty Estimation
National Category
Signal Processing
Research subject
Machine learning
Identifiers
urn:nbn:se:uu:diva-513727 (URN)978-91-513-1925-4 (ISBN)
Public defence
2023-11-30, 101121 Sonja Lyttkens, Ångströmlaboratoriet, Lägerhyddsvägen 1, Uppsala, 09:15 (English)
Opponent
Supervisors
Available from: 2023-11-07 Created: 2023-10-10 Last updated: 2023-11-07

Open Access in DiVA

No full text in DiVA

Other links

Learning Proposals for Practical Energy-Based Regression

Authority records

Gustafsson, Fredrik K.Schön, Thomas B.

Search in DiVA

By author/editor
Gustafsson, Fredrik K.Schön, Thomas B.
By organisation
Division of Systems and ControlArtificial Intelligence
Business Administration

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 144 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf