Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
Change search
Link to record
Permanent link

Direct link
Schön, Thomas B., ProfessorORCID iD iconorcid.org/0000-0001-5183-234X
Alternative names
Publications (10 of 155) Show all publications
During, M. A. D., Matelsky, J. K., Gustafsson, F. K., Voeten, D. F. A., Chen, D., Wester, B. A., . . . Schön, T. B. (2025). Automated segmentation of synchrotron-scanned fossils. Fossil Record, 28(1), 103-114
Open this publication in new window or tab >>Automated segmentation of synchrotron-scanned fossils
Show others...
2025 (English)In: Fossil Record, ISSN 2193-0066, Vol. 28, no 1, p. 103-114Article in journal (Refereed) Published
Abstract [en]

Computed tomography has revolutionised the study of the internal three-dimensional structure of fossils. Historically, fossils typically spent years in preparation to be freed from the enclosing rock. Now, X-ray and synchrotron tomography reveal structures that are otherwise invisible, and data acquisition can be fast. However, manual segmentation of these 3D volumes can still take months to years. This is especially challenging for resource-poor teams, as scanning may be free, but the computing power and (AI-assisted) segmentation software required to handle the resulting large data sets are complex to use and expensive.

Here we present a free, browser-based segmentation tool that reduces computational overhead by splitting volumes into small chunks, allowing processing on low-memory, inexpensive hardware. Our tool also speeds up collaborative ground-truth generation and 3D visualisation, all in-browser. We developed and evaluated our pipeline on various open-data scans of differing contrast, resolution, textural complexity, and size. Our tool successfully isolated the Thrinaxodon and Broomistega pair from an Early Triassic burrow. It isolated cranial bones from the Cretaceous acipenseriform Parapsephurus willybemisi on both 45.53 µm and 13.67 µm resolution (voxel size) scanning data. We also isolated bones of the Middle Triassic sauropterygian Nothosaurus and a challenging scan of a squamate embryo inside an egg dating back to the Early Cretaceous. Our tool reliably reproduces expert-supervised segmentation at a fraction of the time and cost, offering greater accessibility than existing tools. Beyond the online tool, all our code is open source, enabling contributions from the palaeontology community to further this emerging machine-learning ecosystem.

Place, publisher, year, edition, pages
Pensoft Publishers, 2025
Keywords
AI-segmentation, Machine Learning, Open Source, Open Access, Propagation Phase-Contrast Synchrotron Radiation Micro-Computed Tomography (PPC-SRµCT), Random Forest
National Category
Computer Sciences Geology
Research subject
Artificial Intelligence
Identifiers
urn:nbn:se:uu:diva-538797 (URN)10.3897/fr.28.e139379 (DOI)001446793800001 ()2-s2.0-86000594308 (Scopus ID)
Funder
Swedish Research Council, 2020-03685Kjell and Marta Beijer Foundation
Note

De två första författarna delar förstaförfattarskapet

Available from: 2024-09-20 Created: 2024-09-20 Last updated: 2025-03-28Bibliographically approved
Pillonetto, G., Aravkin, A., Gedon, D., Ljung, L., Ribeiro, A. H. & Schön, T. B. (2025). Deep networks for system identification: A survey. Automatica, 171, Article ID 111907.
Open this publication in new window or tab >>Deep networks for system identification: A survey
Show others...
2025 (English)In: Automatica, ISSN 0005-1098, E-ISSN 1873-2836, Vol. 171, article id 111907Article in journal (Refereed) Published
Abstract [en]

Deep learning is a topic of considerable current interest. The availability of massive data collections and powerful software resources has led to an impressive amount of results in many application areas that reveal essential but hidden properties of the observations. System identification learns mathematical descriptions of dynamic systems from input-output data and can thus benefit from the advances of deep neural networks to enrich the possible range of models to choose from. For this reason, we provide a survey of deep learning from a system identification perspective. We cover a wide spectrum of topics to enable researchers to understand the methods, providing rigorous practical and theoretical insights into the benefits and challenges of using them. The main aim of the identified model is to predict new data from previous observations. This can be achieved with different deep learning-based modelling techniques and we discuss architectures commonly adopted in the literature, like feedforward, convolutional, and recurrent networks. Their parameters have to be estimated from past data to optimize the prediction performance. For this purpose, we discuss a specific set of first-order optimization tools that have emerged as efficient. The survey then draws connections to the well-studied area of kernel-based methods. They control the data fit by regularization terms that penalize models not in line with prior assumptions. We illustrate how to cast them in deep architectures to obtain deep kernel-based methods. The success of deep learning also resulted in surprising empirical observations, like the counter-intuitive behaviour of models with many parameters. We discuss the role of overparameterized models, including their connection to kernels, as well as implicit regularization mechanisms which affect generalization, specifically the interesting phenomena of benign overfitting and double-descent. Finally, we highlight numerical, computational and software aspects in the area with the help of applied examples.

Place, publisher, year, edition, pages
Elsevier, 2025
National Category
Control Engineering Computer Sciences
Identifiers
urn:nbn:se:uu:diva-540394 (URN)10.1016/j.automatica.2024.111907 (DOI)001322585600001 ()
Funder
Swedish Research Council, 2021-04301Wallenberg AI, Autonomous Systems and Software Program (WASP)Knut and Alice Wallenberg FoundationKjell and Marta Beijer Foundation
Available from: 2024-10-15 Created: 2024-10-15 Last updated: 2024-11-11Bibliographically approved
Strand, M., Wedlin, L. & Schön, T. B. (2025). Rättsliga hinder mot samverkan?: Juridik och ledningsfrågor vid doktorandsamverkan. Stockholm: SNS förlag
Open this publication in new window or tab >>Rättsliga hinder mot samverkan?: Juridik och ledningsfrågor vid doktorandsamverkan
2025 (Swedish)Report (Refereed)
Abstract [sv]

Universitet och högskolor spelar en viktig roll i samhällsutvecklingen, inte minst i samspel med näringslivet. Samverkan mellan akademi och näringsliv innebär en möjlighet för företag att ta del av den senaste forskningen och utveckla sina verksamheter genom forskningsbaserad kunskap. På samma sätt kan universitetens forskning bli mer relevant i samarbete med näringslivet. En vanlig form för samverkan av detta slag är företagsdoktorander, ibland också kallade industridoktorander, det vill säga att en anställd vid ett företag blir doktorand vid ett lärosäte.

Idag utgör företagsdoktorander en stor del av de forskare som har en fot både i näringslivet och i akademin. Andelen företagsdoktorander har ökat under de senaste 15 åren. Många företag och lärosäten skulle dock vilja öka omfattningen av programmen samverkansprogrammen ytterligare, men hindras till följd av brist på finansiering och komplexa avtal. av juridiska svårigheter i avtalsförhandlingarna. En viktig förutsättning för samverkan är att den är varaktig och organiserad. Det är därför problematiskt att det finns svårigheter med upprättande av anställningsavtal, både vad gäller gemensamma regelverk för hur avtalen bör reglerasatt få avtal och hur finansieringen ska lösasfinansiering på plats.

Place, publisher, year, edition, pages
Stockholm: SNS förlag, 2025. p. 75
Keywords
samverkan, företagsdoktorander, industridoktorander, forskning, immateriella rättigheter, samägande, konfidentialitet
National Category
Other Legal Research Criminology Peace and Conflict Studies Other Social Sciences not elsewhere specified
Identifiers
urn:nbn:se:uu:diva-542421 (URN)978-91-89754-60-7 (ISBN)
Available from: 2024-12-06 Created: 2024-12-06 Last updated: 2025-02-20
Strand, M., Wedlin, L. & Schön, T. B. (2025). Underlätta samarbeten mellan företag och universitet. Stockholm
Open this publication in new window or tab >>Underlätta samarbeten mellan företag och universitet
2025 (Swedish)Other (Other (popular science, discussion, etc.))
Place, publisher, year, pages
Stockholm: , 2025. p. 1
Keywords
samverkan, doktorander, ledarskap, standardavtal, immaterialrätt
National Category
Economics and Business
Identifiers
urn:nbn:se:uu:diva-553813 (URN)
Note

Debattartikel i Dagens industri 2025-01-27

Available from: 2025-04-03 Created: 2025-04-03 Last updated: 2025-04-03
Wullt, B., Mattsson, P., Schön, T. B. & Norrlöf, M. (2024). A Model Predictive Control Approach to Motion Planning in Dynamic Environments. In: 2024 European Control Conference (ECC): . Paper presented at 2024 European Control Conference (ECC), 25-28 June, 2024, Stockholm, Sweden (pp. 3247-3254). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>A Model Predictive Control Approach to Motion Planning in Dynamic Environments
2024 (English)In: 2024 European Control Conference (ECC), Institute of Electrical and Electronics Engineers (IEEE), 2024, p. 3247-3254Conference paper, Published paper (Refereed)
Abstract [en]

The current state-of-the art motion planners for mobile robots typically do not consider the future movement of moving obstacles. Instead they work by rapid replanning, which makes them reactively adapt to any changes in the environment. This can result in a sub-optimal behavior, which we address in this work by proposing a predictive motion planner that integrates motion predictions into all planning steps. We demonstrate the validity of our approach by evaluating our proposed planner in a dynamic environment where the robot moves slower than the moving obstacles. We benchmark our predictive planner with a reactive planning approach and observe better performance, both in avoiding collisions and maintaining the robots position in the goal region.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
National Category
Control Engineering
Identifiers
urn:nbn:se:uu:diva-547371 (URN)10.23919/ecc64448.2024.10591070 (DOI)001290216503001 ()2-s2.0-85200591162 (Scopus ID)978-3-9071-4410-7 (ISBN)979-8-3315-4092-0 (ISBN)
Conference
2024 European Control Conference (ECC), 25-28 June, 2024, Stockholm, Sweden
Funder
Knut and Alice Wallenberg FoundationWallenberg AI, Autonomous Systems and Software Program (WASP)
Available from: 2025-01-15 Created: 2025-01-15 Last updated: 2025-04-15Bibliographically approved
Luo, Z., Gustafsson, F. K., Zhao, Z., Sjölund, J. & Schön, T. B. (2024). Controlling Vision-Language Models for Multi-Task Image Restoration. In: : . Paper presented at The Twelfth International Conference on Learning Representations, Vienna, Austria, May 7, 2024. Vienna, Austria: The International Conference on Learning Representations (ICLR)
Open this publication in new window or tab >>Controlling Vision-Language Models for Multi-Task Image Restoration
Show others...
2024 (English)Conference paper, Poster (with or without abstract) (Other academic)
Abstract [en]

Vision-language models such as CLIP have shown great impact on diverse downstream tasks for zero-shot or label-free predictions. However, when it comes to low-level vision such as image restoration their performance deteriorates dramatically due to corrupted inputs. In this paper, we present a degradation-aware vision-language model (DA-CLIP) to better transfer pretrained vision-language models to low-level vision tasks as a universal framework for image restoration. More specifically, DA-CLIP trains an additional controller that adapts the fixed CLIP image encoder to predict high-quality feature embeddings. By integrating the embedding into an image restoration network via cross-attention, we are able to pilot the model to learn a high-fidelity image reconstruction. The controller itself will also output a degradation feature that matches the real corruptions of the input, yielding a natural classifier for different degradation types. In addition, we construct a mixed degradation dataset with synthetic captions for DA-CLIP training. Our approach advances state-of-the-art performance on both degradation-specific and unified image restoration tasks, showing a promising direction of prompting image restoration with large-scale pretrained vision-language models. Our code is available at https://github. com/Algolzw/daclip-uir.

Place, publisher, year, edition, pages
Vienna, Austria: The International Conference on Learning Representations (ICLR), 2024
National Category
Computer graphics and computer vision
Identifiers
urn:nbn:se:uu:diva-544058 (URN)
Conference
The Twelfth International Conference on Learning Representations, Vienna, Austria, May 7, 2024
Available from: 2024-11-28 Created: 2024-11-28 Last updated: 2025-02-07Bibliographically approved
Zhang, R., Luo, Z., Sjölund, J., Schön, T. B. & Mattsson, P. (2024). Entropy-regularized diffusion policy with Q-ensembles for offline reinforcement learning. In: Advances in Neural Information Processing Systems 37 (NeurIPS 2024): . Paper presented at Neural Information Processing Systems. , 37
Open this publication in new window or tab >>Entropy-regularized diffusion policy with Q-ensembles for offline reinforcement learning
Show others...
2024 (English)In: Advances in Neural Information Processing Systems 37 (NeurIPS 2024), 2024, Vol. 37Conference paper, Published paper (Refereed)
National Category
Other Computer and Information Science
Research subject
Machine learning
Identifiers
urn:nbn:se:uu:diva-545831 (URN)
Conference
Neural Information Processing Systems
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)Kjell and Marta Beijer FoundationSwedish Research Council, 2021-04301Swedish Research Council, 2023-04546
Available from: 2024-12-25 Created: 2024-12-25 Last updated: 2025-01-09Bibliographically approved
von Bachmann, P., Gedon, D., Gustafsson, F. K., Ribeiro, A. H., Lampa, E., Gustafsson, S., . . . Schön, T. B. (2024). Evaluating regression and probabilistic methods for ECG-based electrolyte prediction. Scientific Reports, 14(1), Article ID 15273.
Open this publication in new window or tab >>Evaluating regression and probabilistic methods for ECG-based electrolyte prediction
Show others...
2024 (English)In: Scientific Reports, E-ISSN 2045-2322, Vol. 14, no 1, article id 15273Article in journal (Refereed) Published
Abstract [en]

Imbalances in electrolyte concentrations can have severe consequences, but accurate and accessible measurements could improve patient outcomes. The current measurement method based on blood tests is accurate but invasive and time-consuming and is often unavailable for example in remote locations or an ambulance setting. In this paper, we explore the use of deep neural networks (DNNs) for regression tasks to accurately predict continuous electrolyte concentrations from electrocardiograms (ECGs), a quick and widely adopted tool. We analyze our DNN models on a novel dataset of over 290,000 ECGs across four major electrolytes and compare their performance with traditional machine learning models. For improved understanding, we also study the full spectrum from continuous predictions to a binary classification of extreme concentration levels. Finally, we investigate probabilistic regression approaches and explore uncertainty estimates for enhanced clinical usefulness. Our results show that DNNs outperform traditional models but model performance varies significantly across different electrolytes. While discretization leads to good classification performance, it does not address the original problem of continuous concentration level prediction. Probabilistic regression has practical potential, but our uncertainty estimates are not perfectly calibrated. Our study is therefore a first step towards developing an accurate and reliable ECG-based method for electrolyte concentration level prediction—a method with high potential impact within multiple clinical scenarios.

Place, publisher, year, edition, pages
Springer Nature, 2024
Keywords
ECGs, Electrolytes, Probabilistic deep learning, Regression, Uncertainty estimation
National Category
Signal Processing
Identifiers
urn:nbn:se:uu:diva-513725 (URN)10.1038/s41598-024-65223-w (DOI)001262863000061 ()38961109 (PubMedID)
Funder
Uppsala UniversityKjell and Marta Beijer FoundationWallenberg AI, Autonomous Systems and Software Program (WASP)Knut and Alice Wallenberg FoundationEU, Horizon Europe, 101054643Swedish National Infrastructure for Computing (SNIC), sens2020005Swedish National Infrastructure for Computing (SNIC), sens2020598UPPMAXSwedish Research Council, 2018-05973
Note

Title in the list of papers of Fredrik K. Gustafsson's thesis: ECG-Based Electrolyte Prediction: Evaluating Regression and Probabilistic Methods

Available from: 2023-10-10 Created: 2023-10-10 Last updated: 2024-10-23Bibliographically approved
Arteaga, G. Y., Schön, T. B. & Pielawski, N. (2024). Hallucination Detection in LLMs: Fast and Memory-Efficient Finetuned Models. Paper presented at Northern Lights Deep Learning 2025.
Open this publication in new window or tab >>Hallucination Detection in LLMs: Fast and Memory-Efficient Finetuned Models
2024 (English)Manuscript (preprint) (Other academic)
Abstract [en]

Uncertainty estimation is a necessary component when implementing AI in high-risk settings, such as autonomous cars, medicine, or insurances. Large Language Models (LLMs) have seen a surge in popularity in recent years, but they are subject to hallucinations, which may cause serious harm in high-risk settings. Despite their success, LLMs are expensive to train and run: they need a large amount of computations and memory, preventing the use of ensembling methods in practice. In this work, we present a novel method that allows for fast and memory-friendly training of LLM ensembles. We show that the resulting ensembles can detect hallucinations and are a viable approach in practice as only one GPU is needed for training and inference.

Keywords
LLM, AI, Bayesian ensemble
National Category
Computer Systems
Research subject
Computer Science
Identifiers
urn:nbn:se:uu:diva-543911 (URN)
Conference
Northern Lights Deep Learning 2025
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)Kjell and Marta Beijer FoundationNational Academic Infrastructure for Supercomputing in Sweden (NAISS), 2022-06725EU, European Research Council, 101054643
Available from: 2024-11-27 Created: 2024-11-27 Last updated: 2024-11-28Bibliographically approved
Galos, P., Hult, L., Zachariah, D., Lewén, A., Hånell, A., Howells, T., . . . Enblad, P. (2024). Machine Learning Based Prediction of Imminent ICP Insults During Neurocritical Care of Traumatic Brain Injury. Neurocritical Care
Open this publication in new window or tab >>Machine Learning Based Prediction of Imminent ICP Insults During Neurocritical Care of Traumatic Brain Injury
Show others...
2024 (English)In: Neurocritical Care, ISSN 1541-6933, E-ISSN 1556-0961Article in journal (Refereed) Epub ahead of print
Abstract [en]

Background

In neurointensive care, increased intracranial pressure (ICP) is a feared secondary brain insult in traumatic brain injury (TBI). A system that predicts ICP insults before they emerge may facilitate early optimization of the physiology, which may in turn lead to stopping the predicted ICP insult from occurring. The aim of this study was to evaluate the performance of different artificial intelligence models in predicting the risk of ICP insults.

Methods

The models were trained to predict risk of ICP insults starting within 30 min, using the Uppsala high frequency TBI dataset. A restricted dataset consisting of only monitoring data were used, and an unrestricted dataset using monitoring data as well as clinical data, demographic data, and radiological evaluations was used. Four different model classes were compared: Gaussian process regression, logistic regression, random forest classifier, and Extreme Gradient Boosted decision trees (XGBoost).

Results

Six hundred and two patients with TBI were included (total monitoring 138,411 h). On the task of predicting upcoming ICP insults, the Gaussian process regression model performed similarly on the Uppsala high frequency TBI dataset (sensitivity 93.2%, specificity 93.9%, area under the receiver operating characteristic curve [AUROC] 98.3%), as in earlier smaller studies. Using a more flexible model (XGBoost) resulted in a comparable performance (sensitivity 93.8%, specificity 94.6%, AUROC 98.7%). Adding more clinical variables and features further improved the performance of the models slightly (XGBoost: sensitivity 94.1%, specificity of 94.6%, AUROC 98.8%).

Conclusions

Artificial intelligence models have potential to become valuable tools for predicting ICP insults in advance during neurointensive care. The fact that common off-the-shelf models, such as XGBoost, performed well in predicting ICP insults opens new possibilities that can lead to faster advances in the field and earlier clinical implementations.

Place, publisher, year, edition, pages
Springer, 2024
Keywords
TBI, AI, Machine learning, Intracranial hypertension, Critical care
National Category
Signal Processing Neurology
Identifiers
urn:nbn:se:uu:diva-533622 (URN)10.1007/s12028-024-02119-7 (DOI)
Funder
Swedish Research Council, 2022-06725Swedish Research Council, 2018-05973Kjell and Marta Beijer FoundationSwedish National Infrastructure for Computing (SNIC)National Academic Infrastructure for Supercomputing in Sweden (NAISS)Uppsala UniversityRegion Uppsala
Note

De två första författarna delar förstaförfattarskapet

Available from: 2024-06-27 Created: 2024-06-27 Last updated: 2024-09-27Bibliographically approved
Projects
Probabilistic modeling of dynamical systems [2013-05524_VR]; Uppsala UniversityLearning flexible models of nonlinear dynamics [2017-03807_VR]; Uppsala UniversityDeep probabilistic regression - new models and learning algorithms [2021-04301_VR]; Uppsala UniversityPrecision Medicine in Traumatic Brain Injury: Transforming Care through Personalized Outcome Predictions and Humanized In Vivo model. [2024-02725_VR]; Uppsala University
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0001-5183-234X

Search in DiVA

Show all publications

Profile pages

Personal website