Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
Change search
Link to record
Permanent link

Direct link
Publications (10 of 10) Show all publications
Ju, L. (2026). Robust Learning from Distributed and Heterogeneous Data. (Doctoral dissertation). Uppsala: Acta Universitatis Upsaliensis
Open this publication in new window or tab >>Robust Learning from Distributed and Heterogeneous Data
2026 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Modern machine learning is increasingly expanding beyond centralized, mono-modal training toward systems that must also learn from data across distributed edge devices and heterogeneous data modalities. This transition breaks the foundational identical and independent distribution (i.i.d.) assumptions of traditional models, making robustness a first-class requirement for real-world applications. This thesis studies the mechanisms and methodologies necessary to achieve algorithmic robustness across three intersecting dimensions: distributed optimization, geometry-aware uncertainty quantification, and simulation-based inference.

The first dimension addresses statistical heterogeneity in Federated Learning (FL), a distributed training framework in which multiple participants collaboratively train a shared model without exchanging their local data. In FL, the non-i.i.d. nature of distributed data often induces performance degradation, convergence issues and fairness problems. Through an empirical study on drug discovery and the development of new algorithms, this work demonstrates that adaptive optimization and dynamic hyperparameter adjustment can mitigate training instabilities. These methods ensure equitable performance across diverse data silos, preventing the global model from favoring specific participants.

The second dimension explores the structural challenges of multi-modal language models, which map data of heterogeneous modalities onto complex, non-Euclidean manifolds. This research models aleatoric and epistemic uncertainty with directional distributions via parametric models and Riemannian Flow Matching. This geometry-aware approach allows models to respect the intrinsic geometric structure of the embedding space, providing a mathematically grounded framework for models to quantify their ignorance when confronted with ambiguous or out-of-distribution inputs.

The final dimension addresses the robustness of a unified framework which supports both forward and inverse processes for Bayesian inference. The proposed framework utilizes a unified Flow Matching model to learn the joint distribution of parameters and observations. By employing randomized masking, this architecture robustly handles partially observed or noisy data, integrating forward and inverse processes into a single cohesive neural network without the need for specialized retraining.  Collectively, this thesis contributes theoretical analyses, novel algorithms, and empirical validations that advance the robustness of machine learning across federated optimization, multi-modal uncertainty quantification, and simulation-based inference, bridging the gap between idealized training assumptions and the demands of real-world applications.

Place, publisher, year, edition, pages
Uppsala: Acta Universitatis Upsaliensis, 2026. p. 62
Series
Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology, ISSN 1651-6214 ; 2665
Keywords
Machine Learning, Distributed Optimization, Federated Learning, Probabilistic Modeling, Multi-modal Learning
National Category
Artificial Intelligence
Research subject
Computer Science
Identifiers
urn:nbn:se:uu:diva-583490 (URN)978-91-513-2816-4 (ISBN)
Public defence
2026-06-04, 101195, Heinz-Otto Kreiss, Regementsvägen 10, Uppsala, 09:15 (English)
Opponent
Supervisors
Available from: 2026-05-07 Created: 2026-03-31 Last updated: 2026-05-13
Ju, L., Andersson, M., Fredriksson, S., Glöckner, E., Hellander, A., Vats, E. & Singh, P. (2025). Exploiting the Asymmetric Uncertainty Structure of Pre-trained VLMs on the Unit Hypersphere. In: : . Paper presented at 39th Conference on Neural Information Processing Systems (NeurIPS 2025).
Open this publication in new window or tab >>Exploiting the Asymmetric Uncertainty Structure of Pre-trained VLMs on the Unit Hypersphere
Show others...
2025 (English)Conference paper, Published paper (Refereed)
National Category
Artificial Intelligence Computer graphics and computer vision Computational Mathematics
Research subject
Machine learning; Scientific Computing
Identifiers
urn:nbn:se:uu:diva-572611 (URN)
Conference
39th Conference on Neural Information Processing Systems (NeurIPS 2025)
Funder
Swedish Research Council, 2023-05167Swedish Research Council, 2023-05593Kjell and Marta Beijer FoundationKnut and Alice Wallenberg Foundation
Available from: 2025-12-07 Created: 2025-12-07 Last updated: 2026-03-31Bibliographically approved
Ju, L., Zhang, T., Toor, S. & Hellander, A. (2024). Accelerating Fair Federated Learning: Adaptive Federated Adam. IEEE Transactions on Machine Learning in Communications and Networking, 2, 1017-1032
Open this publication in new window or tab >>Accelerating Fair Federated Learning: Adaptive Federated Adam
2024 (English)In: IEEE Transactions on Machine Learning in Communications and Networking, E-ISSN 2831-316X, Vol. 2, p. 1017-1032Article in journal (Refereed) Published
Abstract [en]

Federated learning is a distributed and privacy-preserving approach to train a statistical model collaboratively from decentralized data held by different parties. However, when the datasets are not independent and identically distributed, models trained by naive federated algorithms may be biased towards certain participants, and model performance across participants is non-uniform. This is known as the fairness problem in federated learning. In this paper, we formulate fairness-controlled federated learning as a dynamical multi-objective optimization problem to ensure the fairness and convergence with theoretical guarantee. To solve the problem efficiently, we study the convergence and bias of Adam as the server optimizer in federated learning, and propose Adaptive Federated Adam (AdaFedAdam) to accelerate fair federated learning with alleviated bias. We validated the effectiveness, Pareto optimality and robustness of AdaFedAdam with numerical experiments and show that AdaFedAdam outperforms existing algorithms, providing better convergence and fairness properties of the federated scheme.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
Keywords
Distributed learning, fair machine learning, federated learning, federated optimization
National Category
Computer Sciences
Research subject
Machine learning
Identifiers
urn:nbn:se:uu:diva-544327 (URN)10.1109/tmlcn.2024.3423648 (DOI)001487803200002 ()
Projects
eSSENCE - An eScience Collaboration
Funder
National Academic Infrastructure for Supercomputing in Sweden (NAISS), 2023/22-1048eSSENCE - An eScience CollaborationUppsala University
Available from: 2024-12-03 Created: 2024-12-03 Last updated: 2026-03-31Bibliographically approved
Li, S., Ngai, E. C. H., Ye, F., Ju, L., Zhang, T. & Voigt, T. (2024). Blades: A Unified Benchmark Suite for Byzantine Attacks and Defenses in Federated Learning. In: 2024 IEEE/ACM Ninth International Conference on Internet-of-Things Design and Implementation (IoTDI): . Paper presented at 9th ACM/IEEE Conference on Internet of Things Design and Implementation (IoTDI), May 13-16, 2024, Hong Kong, Hong Kong (pp. 158-169). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Blades: A Unified Benchmark Suite for Byzantine Attacks and Defenses in Federated Learning
Show others...
2024 (English)In: 2024 IEEE/ACM Ninth International Conference on Internet-of-Things Design and Implementation (IoTDI), Institute of Electrical and Electronics Engineers (IEEE), 2024, p. 158-169Conference paper, Published paper (Refereed)
Abstract [en]

Federated learning (FL) facilitates distributed training across different IoT and edge devices, safeguarding the privacy of their data. The inherent distributed structure of FL introduces vulnerabilities, especially from adversarial devices aiming to skew local updates to their advantage. Despite the plethora of research focusing on Byzantine-resilient FL, the academic community has yet to establish a comprehensive benchmark suite, pivotal for impartial assessment and comparison of different techniques. This paper presents Blades, a scalable, extensible, and easily configurable benchmark suite that supports researchers and developers in efficiently implementing and validating novel strategies against baseline algorithms in Byzantine-resilient FL. Blades contains built-in implementations of representative attack and defense strategies and offers a user-friendly interface that seamlessly integrates new ideas. Using Blades, we re-evaluate representative attacks and defenses on wide-ranging experimental configurations (approximately 1,500 trials in total). Through our extensive experiments, we gained new insights into FL robustness and highlighted previously overlooked limitations due to the absence of thorough evaluations and comparisons of baselines under various attack settings. We maintain the source code and documents at https://github.com/lishenghui/blades.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
Keywords
Byzantine attacks, distributed learning, federated learning, IoT, neural networks, robustness
National Category
Computer Sciences
Identifiers
urn:nbn:se:uu:diva-537577 (URN)10.1109/IoTDI61053.2024.00018 (DOI)001261370500014 ()2-s2.0-85196568437 (Scopus ID)979-8-3503-7025-6 (ISBN)979-8-3503-7026-3 (ISBN)
Conference
9th ACM/IEEE Conference on Internet of Things Design and Implementation (IoTDI), May 13-16, 2024, Hong Kong, Hong Kong
Funder
Swedish Research Council, 2017-04543
Available from: 2024-09-05 Created: 2024-09-05 Last updated: 2025-02-11Bibliographically approved
Li, S., Ngait, E.-H. C. -., Ye, F., Ju, L., Zhang, T. & Voigt, T. (2024). Demo Abstract: Blades: A Unified Benchmark Suite for Byzantine-Resilient in Federated Learning. In: 9TH ACM/IEEE CONFERENCE ON INTERNET OF THINGS DESIGN AND IMPLEMENTATION, IOTDI 2024: . Paper presented at 9th ACM/IEEE Conference on Internet of Things Design and Implementation (IoTDI), MAY 13-16, 2024, Hong Kong, PEOPLES R CHINA (pp. 229-230). IEEE Computer Society
Open this publication in new window or tab >>Demo Abstract: Blades: A Unified Benchmark Suite for Byzantine-Resilient in Federated Learning
Show others...
2024 (English)In: 9TH ACM/IEEE CONFERENCE ON INTERNET OF THINGS DESIGN AND IMPLEMENTATION, IOTDI 2024, IEEE Computer Society, 2024, p. 229-230Conference paper, Published paper (Refereed)
Abstract [en]

Federated learning (FL) facilitates distributed training across different IoT and edge devices, safeguarding the privacy of their data. The inherently distributed nature of FL introduces vulnerabilities, especially from adversarial devices aiming to skew local updates to their desire. Despite the plethora of research focusing on Byzantine-resilient FL, the academic conununity has yet to establish a comprehensive benchmark suite, pivotal for the assessment and comparison of different techniques. This demonstration presents Blades, a scalable, extensible, and easily configurable benchmark suite that supports researchers and developers in efficiently implementing and validating strategies against baseline algorithms in Byzantine-resilient FL.

Place, publisher, year, edition, pages
IEEE Computer Society, 2024
Keywords
Byzantine attacks, distributed learning, federated learning, IoT, neural networks, robustness
National Category
Computer Sciences
Identifiers
urn:nbn:se:uu:diva-537570 (URN)10.1109/IoTDI61053.2024.00030 (DOI)001261370500026 ()979-8-3503-7025-6 (ISBN)979-8-3503-7026-3 (ISBN)
Conference
9th ACM/IEEE Conference on Internet of Things Design and Implementation (IoTDI), MAY 13-16, 2024, Hong Kong, PEOPLES R CHINA
Available from: 2024-09-05 Created: 2024-09-05 Last updated: 2024-09-05Bibliographically approved
Ju, L., Hellander, A. & Spjuth, O. (2024). Federated learning for predicting compound mechanism of action based on image-data from cell painting. Artificial Intelligence in the Life Sciences, 5, Article ID 100098.
Open this publication in new window or tab >>Federated learning for predicting compound mechanism of action based on image-data from cell painting
2024 (English)In: Artificial Intelligence in the Life Sciences, E-ISSN 2667-3185, Vol. 5, article id 100098Article in journal (Refereed) Published
Abstract [en]

Having access to sufficient data is essential in order to train accurate machine learning models, but much data is not publicly available. In drug discovery this is particularly evident, as much data is withheld at pharmaceutical companies for various reasons. Federated Learning (FL) aims at training a joint model between multiple parties but without disclosing data between the parties. In this work, we leverage Federated Learning to predict compound Mechanism of Action (MoA) using fluorescence image data from cell painting. Our study evaluates the effectiveness and efficiency of FL, comparing to non-collaborative and data-sharing collaborative learning in diverse scenarios. Specifically, we investigate the impact of data heterogeneity across participants on MoA prediction, an essential concern in real-life applications of FL, and demonstrate the benefits for all involved parties. This work highlights the potential of federated learning in multi-institutional collaborative machine learning for drug discovery and assessment of chemicals, offering a promising avenue to overcome data-sharing constraints.

Place, publisher, year, edition, pages
Elsevier, 2024
Keywords
Federated learning, Cell profiling, Cell painting, Artificial intelligence, Collaborative learning
National Category
Computer and Information Sciences
Research subject
Scientific Computing; Machine learning
Identifiers
urn:nbn:se:uu:diva-544328 (URN)10.1016/j.ailsci.2024.100098 (DOI)001333825000001 ()2-s2.0-85193044761 (Scopus ID)
Projects
eSSENCE - An eScience Collaboration
Funder
Uppsala UniversityeSSENCE - An eScience CollaborationSwedish Research Council, 2020-03731Swedish Research Council, 2020-01865Swedish Research Council Formas, 2022-00940Swedish Cancer Society, 22 2412EU, Horizon Europe, 101057014EU, Horizon Europe, 101057442Swedish Research Council, 2022-06725
Available from: 2024-12-03 Created: 2024-12-03 Last updated: 2026-03-31Bibliographically approved
Ju, L., Singh, P. & Toor, S. (2021). Proactive Autoscaling for Edge Computing Systems with Kubernetes. In: : . Paper presented at 2021 IEEE/ACM 14th International Conference on Utility and Cloud Computing (UCC ’21) Companion, December 6-9, Leicester, UK. Association for Computing Machinery (ACM)
Open this publication in new window or tab >>Proactive Autoscaling for Edge Computing Systems with Kubernetes
2021 (English)Conference paper, Published paper (Refereed)
Abstract [en]

With the emergence of the Internet of Things and 5G technologies, the edge computing paradigm is playing increasingly important roles with better availability, latency-control and performance. However, existing autoscaling tools for edge computing applications do not utilize heterogeneous resources of edge systems efficiently, leaving scope for performance improvement. In this work, we propose a Proactive Pod Autoscaler (PPA) for edge computing applications on Kubernetes. The proposed PPA is able to forecast workloads in advance with multiple user-defined/customized metrics and to scale edge computing applications up and down correspondingly. The PPA is optimized and evaluated on an example CPU-intensive edge computing application further. It can be concluded that the proposed PPA outperforms the default pod autoscaler of Kubernetes on both efficiency of resource utilization and application performance. The article also highlights future possible improvements on the proposed PPA.

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2021
Keywords
edge computing, autoscaling, Kubernetes, proactive autoscaling
National Category
Computer Sciences
Identifiers
urn:nbn:se:uu:diva-464378 (URN)10.1145/3492323.3495588 (DOI)000938695200002 ()
Conference
2021 IEEE/ACM 14th International Conference on Utility and Cloud Computing (UCC ’21) Companion, December 6-9, Leicester, UK
Funder
eSSENCE - An eScience Collaboration
Available from: 2022-01-14 Created: 2022-01-14 Last updated: 2023-05-16Bibliographically approved
Ju, L., Nautiyal, M., Hellander, A., Vats, E. & Singh, P.Epistemic Uncertainty Quantification for Pre-trained VLMs via Riemannian Flow Matching.
Open this publication in new window or tab >>Epistemic Uncertainty Quantification for Pre-trained VLMs via Riemannian Flow Matching
Show others...
(English)Manuscript (preprint) (Other academic)
Abstract [en]

Vision-Language Models (VLMs) are typically deterministic in nature and lack intrinsic mechanisms to quantify epistemic uncertainty, which reflects the model's lack of knowledge or ignorance of its own representations. We theoretically motivate negative log-density of an embedding as a proxy for the epistemic uncertainty, where low-density regions signify model ignorance. The proposed method REPVLM computes the probability density on the hyperspherical manifold of the VLM embeddings using Riemannian Flow Matching. We empirically demonstrate that REPVLM achieves near-perfect correlation between uncertainty and prediction error, significantly outperforming existing baselines. Beyond classification, we also demonstrate that the model also provides a scalable metric for out-of-distribution detection and automated data curation.

National Category
Computer Vision and Learning Systems
Identifiers
urn:nbn:se:uu:diva-582516 (URN)
Available from: 2026-03-17 Created: 2026-03-17 Last updated: 2026-03-31Bibliographically approved
Zhang, T., Ju, L., Singh, P. & Toor, S.InfoHier: Hierarchical Information Extraction via Encoding and Embedding.
Open this publication in new window or tab >>InfoHier: Hierarchical Information Extraction via Encoding and Embedding
(English)Manuscript (preprint) (Other academic)
Abstract [en]

Analyzing large-scale datasets, especially involving complex and high-dimensional data like images, is particularly challenging. While self-supervised learning (SSL) has proven effective for learning representations from unlabeled data, it typically focuses on flat, non-hierarchical structures, missing the multi-level relationships present in many real-world datasets. Hierarchical clustering (HC) can uncover these relationships by organizing data into a tree-like structure, but it often relies on rigid similarity metrics that struggle to capture the complexity of diverse data types. To address these we envision InfoHier, a framework that combines SSL with HC to jointly learn robust latent representations and hierarchical structures. This approach leverages SSL to provide adaptive representations, enhancing HC's ability to capture complex patterns. Simultaneously, it integrates HC loss to refine SSL training, resulting in representations that are more attuned to the underlying information hierarchy. InfoHier has the potential to improve the expressiveness and performance of both clustering and representation learning, offering significant benefits for data analysis, management, and information retrieval.

Keywords
Hierarchical Representation, Hierarchical Clustering, Self-Supervised Learning, Joint Learning, Information Retrieval
National Category
Computer Sciences Information Systems
Research subject
Computer Science
Identifiers
urn:nbn:se:uu:diva-544717 (URN)
Available from: 2024-12-08 Created: 2024-12-08 Last updated: 2024-12-18Bibliographically approved
Nautiyal, M., Ju, L., Ernfors, M., Hagland, K., Holma, V., Werkö Söderholm, M., . . . Singh, P.OneFlowSBI: One Model, Many Queries for Simulation-Based Inference.
Open this publication in new window or tab >>OneFlowSBI: One Model, Many Queries for Simulation-Based Inference
Show others...
(English)Manuscript (preprint) (Other academic)
Abstract [en]

We introduce OneFlowSBI, a unified framework for simulation-based inference that learns a single flow-matching generative model over the joint distribution of parameters and observations. Leveraging a query-aware masking distribution during training, the same model supports multiple inference tasks, including posterior sampling, likelihood estimation, and arbitrary conditional distributions, without task-specific retraining. We evaluate OneFlowSBI on ten benchmark inference problems and two high-dimensional real-world inverse problems across multiple simulation budgets. OneFlowSBI is shown to deliver competitive performance against state-of-the-art generalized inference solvers and specialized posterior estimators, while enabling efficient sampling with few ODE integration steps and remaining robust under noisy and partially observed data.

National Category
Computer Sciences
Identifiers
urn:nbn:se:uu:diva-582537 (URN)
Available from: 2026-03-18 Created: 2026-03-18 Last updated: 2026-03-31Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0001-9500-1791

Search in DiVA

Show all publications