Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
System disruptions
We are currently experiencing disruptions on the search portals due to high traffic. We are working to resolve the issue, you may temporarily encounter an error message.
Change search
Link to record
Permanent link

Direct link
Publications (10 of 81) Show all publications
Ju, L., Zhang, T., Toor, S. & Hellander, A. (2024). Accelerating Fair Federated Learning: Adaptive Federated Adam. IEEE Transactions on Machine Learning in Communications and Networking, 2, 1017-1032
Open this publication in new window or tab >>Accelerating Fair Federated Learning: Adaptive Federated Adam
2024 (English)In: IEEE Transactions on Machine Learning in Communications and Networking, E-ISSN 2831-316X, Vol. 2, p. 1017-1032Article in journal (Refereed) Published
Abstract [en]

Federated learning is a distributed and privacy-preserving approach to train a statistical model collaboratively from decentralized data held by different parties. However, when the datasets are not independent and identically distributed, models trained by naive federated algorithms may be biased towards certain participants, and model performance across participants is non-uniform. This is known as the fairness problem in federated learning. In this paper, we formulate fairness-controlled federated learning as a dynamical multi-objective optimization problem to ensure the fairness and convergence with theoretical guarantee. To solve the problem efficiently, we study the convergence and bias of Adam as the server optimizer in federated learning, and propose Adaptive Federated Adam ( AdaFedAdam ) to accelerate fair federated learning with alleviated bias. We validated the effectiveness, Pareto optimality and robustness of AdaFedAdam with numerical experiments and show that AdaFedAdam outperforms existing algorithms, providing better convergence and fairness properties of the federated scheme.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
National Category
Computer Sciences
Research subject
Machine learning
Identifiers
urn:nbn:se:uu:diva-544327 (URN)10.1109/tmlcn.2024.3423648 (DOI)
Projects
eSSENCE - An eScience Collaboration
Available from: 2024-12-03 Created: 2024-12-03 Last updated: 2025-01-07Bibliographically approved
Zhang, T., Gupta, A., Francisco Rodríguez, M. A., Spjuth, O., Hellander, A. & Toor, S. (2024). Data management of scientific applications in a reinforcement learning-based hierarchical storage system. Expert systems with applications, 237, Article ID 121443.
Open this publication in new window or tab >>Data management of scientific applications in a reinforcement learning-based hierarchical storage system
Show others...
2024 (English)In: Expert systems with applications, ISSN 0957-4174, E-ISSN 1873-6793, Vol. 237, article id 121443Article in journal (Refereed) Published
Abstract [en]

In many areas of data-driven science, large datasets are generated where the individual data objects are images, matrices, or otherwise have a clear structure. However, these objects can be information-sparse, and a challenge is to efficiently find and work with the most interesting data as early as possible in an analysis pipeline. We have recently proposed a new model for big data management where the internal structure and information of the data are associated with each data object (as opposed to simple metadata). There is then an opportunity for comprehensive data management solutions to account for data-specific internal structure as well as access patterns. In this article, we explore this idea together with our recently proposed hierarchical storage management framework that uses reinforcement learning (RL) for autonomous and dynamic data placement in different tiers in a storage hierarchy. Our case-study is based on four scientific datasets: Protein translocation microscopy images, Airfoil angle of attack meshes, 1000 Genomes sequences, and Phenotypic screening images. The presented results highlight that our framework is optimal and can quickly adapt to new data access requirements. It overall reduces the data processing time, and the proposed autonomous data placement is superior compared to any static or semi-static data placement policies.

Place, publisher, year, edition, pages
Elsevier, 2024
Keywords
Data management, Scientific application, Hierarchical storage system, Reinforcement learning, Large scientific datasets
National Category
Computer Sciences Computational Mathematics
Research subject
Computer Science with specialization in Database Technology; Computer Science
Identifiers
urn:nbn:se:uu:diva-513854 (URN)10.1016/j.eswa.2023.121443 (DOI)001081909200001 ()
Funder
Swedish Foundation for Strategic Research, BD15-0008Swedish National Infrastructure for Computing (SNIC), SNIC 2022/22-835eSSENCE - An eScience Collaboration
Available from: 2023-10-12 Created: 2023-10-12 Last updated: 2024-12-08Bibliographically approved
Zhang, T., Gupta, A., Francisco Rodríguez, M. A., Spjuth, O., Hellander, A. & Toor, S. (2024). Data management of scientific applications in a reinforcement learning-based hierarchical storage system. Expert systems with applications, 237, 121443-121443, Article ID 121443.
Open this publication in new window or tab >>Data management of scientific applications in a reinforcement learning-based hierarchical storage system
Show others...
2024 (English)In: Expert systems with applications, ISSN 0957-4174, E-ISSN 1873-6793, Vol. 237, p. 121443-121443, article id 121443Article in journal (Refereed) Published
National Category
Computer Sciences
Identifiers
urn:nbn:se:uu:diva-552596 (URN)10.1016/j.eswa.2023.121443 (DOI)
Funder
Swedish Foundation for Strategic Research, BD15-0008Swedish National Infrastructure for Computing (SNIC), SNIC 2022/22-835eSSENCE - An eScience Collaboration
Available from: 2025-03-17 Created: 2025-03-17 Last updated: 2025-03-17
Ju, L., Hellander, A. & Spjuth, O. (2024). Federated learning for predicting compound mechanism of action based on image-data from cell painting. Artificial Intelligence in the Life Sciences, 5, Article ID 100098.
Open this publication in new window or tab >>Federated learning for predicting compound mechanism of action based on image-data from cell painting
2024 (English)In: Artificial Intelligence in the Life Sciences, E-ISSN 2667-3185, Vol. 5, article id 100098Article in journal (Refereed) Published
Abstract [en]

Having access to sufficient data is essential in order to train accurate machine learning models, but much data is not publicly available. In drug discovery this is particularly evident, as much data is withheld at pharmaceutical companies for various reasons. Federated Learning (FL) aims at training a joint model between multiple parties but without disclosing data between the parties. In this work, we leverage Federated Learning to predict compound Mechanism of Action (MoA) using fluorescence image data from cell painting. Our study evaluates the effectiveness and efficiency of FL, comparing to non-collaborative and data-sharing collaborative learning in diverse scenarios. Specifically, we investigate the impact of data heterogeneity across participants on MoA prediction, an essential concern in real-life applications of FL, and demonstrate the benefits for all involved parties. This work highlights the potential of federated learning in multi-institutional collaborative machine learning for drug discovery and assessment of chemicals, offering a promising avenue to overcome data-sharing constraints.

Place, publisher, year, edition, pages
Elsevier, 2024
Keywords
Federated learning, Cell profiling, Cell painting, Artificial intelligence, Collaborative learning
National Category
Computer and Information Sciences
Research subject
Scientific Computing; Machine learning
Identifiers
urn:nbn:se:uu:diva-544328 (URN)10.1016/j.ailsci.2024.100098 (DOI)001333825000001 ()2-s2.0-85193044761 (Scopus ID)
Projects
eSSENCE - An eScience Collaboration
Funder
Uppsala UniversityeSSENCE - An eScience CollaborationSwedish Research Council, 2020-03731Swedish Research Council, 2020-01865Swedish Research Council Formas, 2022-00940Swedish Cancer Society, 22 2412EU, Horizon Europe, 101057014EU, Horizon Europe, 101057442Swedish Research Council, 2022-06725
Available from: 2024-12-03 Created: 2024-12-03 Last updated: 2025-01-07Bibliographically approved
Alawadi, S., Ait-Mlouk, A., Toor, S. & Hellander, A. (2024). Toward efficient resource utilization at edge nodes in federated learning. Progress in Artificial Intelligence, 13(2), 101-117
Open this publication in new window or tab >>Toward efficient resource utilization at edge nodes in federated learning
2024 (English)In: Progress in Artificial Intelligence, ISSN 2192-6352, E-ISSN 2192-6360, Vol. 13, no 2, p. 101-117Article in journal (Refereed) Published
Abstract [en]

Federated learning (FL) enables edge nodes to collaboratively contribute to constructing a global model without sharing their data. This is accomplished by devices computing local, private model updates that are then aggregated by a server. However, computational resource constraints and network communication can become a severe bottleneck for larger model sizes typical for deep learning (DL) applications. Edge nodes tend to have limited hardware resources (RAM, CPU), and the network bandwidth and reliability at the edge is a concern for scaling federated fleet applications. In this paper, we propose and evaluate a FL strategy inspired by transfer learning in order to reduce resource utilization on devices, as well as the load on the server and network in each global training round. For each local model update, we randomly select layers to train, freezing the remaining part of the model. In doing so, we can reduce both server load and communication costs per round by excluding all untrained layer weights from being transferred to the server. The goal of this study is to empirically explore the potential trade-off between resource utilization on devices and global model convergence under the proposed strategy. We implement the approach using the FL framework FEDn. A number of experiments were carried out over different datasets (CIFAR-10, CASA, and IMDB), performing different tasks using different DL model architectures. Our results show that training the model partially can accelerate the training process, efficiently utilizes resources on-device, and reduce the data transmission by around 75% and 53% when we train 25%, and 50% of the model layers, respectively, without harming the resulting global model accuracy. Furthermore, our results demonstrate a negative correlation between the number of participating clients in the training process and the number of layers that need to be trained on each client's side. As the number of clients increases, there is a decrease in the required number of layers. This observation highlights the potential of the approach, particularly in cross-device use cases.

Place, publisher, year, edition, pages
Springer Nature, 2024
Keywords
Distributed training, Data privacy, Federated learning, Machine learning, Training parallelization, Partial training
National Category
Computer Sciences
Identifiers
urn:nbn:se:uu:diva-543660 (URN)10.1007/s13748-024-00322-3 (DOI)001242726300001 ()2-s2.0-85195583160 (Scopus ID)
Available from: 2024-11-25 Created: 2024-11-25 Last updated: 2024-11-25Bibliographically approved
Mathias, S., Adameyko, I., Hellander, A. & Kursawe, J. (2023). Contributions of cell behavior to geometric order in embryonic cartilage. PloS Computational Biology, 19(11), Article ID e1011658.
Open this publication in new window or tab >>Contributions of cell behavior to geometric order in embryonic cartilage
2023 (English)In: PloS Computational Biology, ISSN 1553-734X, E-ISSN 1553-7358, Vol. 19, no 11, article id e1011658Article in journal (Refereed) Published
Abstract [en]

During early development, cartilage provides shape and stability to the embryo while serving as a precursor for the skeleton. Correct formation of embryonic cartilage is hence essential for healthy development. In vertebrate cranial cartilage, it has been observed that a flat and laterally extended macroscopic geometry is linked to regular microscopic structure consisting of tightly packed, short, transversal clonar columns. However, it remains an ongoing challenge to identify how individual cells coordinate to successfully shape the tissue, and more precisely which mechanical interactions and cell behaviors contribute to the generation and maintenance of this columnar cartilage geometry during embryogenesis. Here, we apply a three-dimensional cell-based computational model to investigate mechanical principles contributing to column formation. The model accounts for clonal expansion, anisotropic proliferation and the geometrical arrangement of progenitor cells in space. We confirm that oriented cell divisions and repulsive mechanical interactions between cells are key drivers of column formation. In addition, the model suggests that column formation benefits from the spatial gaps created by the extracellular matrix in the initial configuration, and that column maintenance is facilitated by sequential proliferative phases. Our model thus correctly predicts the dependence of local order on division orientation and tissue thickness. The present study presents the first cell-based simulations of cell mechanics during cranial cartilage formation and we anticipate that it will be useful in future studies on the formation and growth of other cartilage geometries.

Place, publisher, year, edition, pages
Public Library of Science (PLoS), 2023
Keywords
cell-based model, cell mechanics, developmental biology, cartilage formation
National Category
Bioinformatics (Computational Biology)
Research subject
Scientific Computing
Identifiers
urn:nbn:se:uu:diva-476892 (URN)10.1371/journal.pcbi.1011658 (DOI)001124342500001 ()38019884 (PubMedID)
Funder
NIH (National Institutes of Health), NIH/2R01EB014877-04A1eSSENCE - An eScience Collaboration
Note

Igor Adameyko, Andreas Hellander and Jochen Kursawe contributed equally to this work as co-senior authors

Available from: 2022-06-22 Created: 2022-06-22 Last updated: 2024-01-08Bibliographically approved
Zhang, T., Hellander, A. & Toor, S. (2023). Efficient Hierarchical Storage Management Empowered by Reinforcement Learning. IEEE Transactions on Knowledge and Data Engineering, 35, 5780-5793
Open this publication in new window or tab >>Efficient Hierarchical Storage Management Empowered by Reinforcement Learning
2023 (English)In: IEEE Transactions on Knowledge and Data Engineering, ISSN 1041-4347, E-ISSN 1558-2191, Vol. 35, p. 5780-5793Article in journal (Refereed) Published
Abstract [en]

With the rapid development of big data and cloud computing, data management has become increasingly challenging. Over the years, a number of frameworks for data management have become available. Most of them are highly efficient, but ultimately create data silos. It becomes difficult to move and work coherently with data as new requirements emerge. A possible solution is to use an intelligent hierarchical (multi-tier) storage system (HSS). A HSS is a meta solution that consists of different storage frameworks organized as a jointly constructed storage pool. A built-in data migration policy that determines the optimal placement of the datasets in the hierarchy is essential. Placement decisions is a non-trivial task since it should be made according to the characteristics of the dataset, the tier status in a hierarchy, and access patterns. This paper presents an open-source hierarchical storage framework with a dynamic migration policy based on reinforcement learning (RL). We present a mathematical model, a software architecture, and implementations based on both simulations and a live cloud-based environment. We compare the proposed RL-based strategy to a baseline of three rule-based policies, showing that the RL-based policy achieves significantly higher efficiency and optimal data distribution in different scenarios.

Place, publisher, year, edition, pages
IEEE, 2023
Keywords
Data Management, Cloud Computing, Hierarchical Storage System, Data Migration, Reinforcement Learning
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:uu:diva-490399 (URN)10.1109/tkde.2022.3176753 (DOI)000981944600024 ()
Projects
eSSENCE - An eScience Collaboration
Funder
Swedish Foundation for Strategic Research, BD15-0008
Available from: 2022-12-09 Created: 2022-12-09 Last updated: 2024-12-16Bibliographically approved
Zhang, T., Hellander, A. & Toor, S. (2023). Efficient Hierarchical Storage Management Empowered by Reinforcement Learning Extended Abstract. In: : . Paper presented at 2023 IEEE 39th International Conference on Data Engineering (ICDE), Anaheim, California, 3-7 April, 2023 (pp. 3869-3870). IEEE
Open this publication in new window or tab >>Efficient Hierarchical Storage Management Empowered by Reinforcement Learning Extended Abstract
2023 (English)Conference paper, Published paper (Refereed)
Abstract [en]

With the rapid development of big data and cloud computing, data management has become increasingly challenging. A possible solution is to use an intelligent hierarchical (multi-tier) storage system (HSS). An HSS is a meta solution that consists of different storage frameworks organized as a jointly constructed storage pool. A built-in data migration policy that determines the optimal placement of the datasets in the hierarchy is essential. Placement decisions are a non-trivial task since they should be made according to the characteristics of the dataset, the tier status in a hierarchy, and access patterns. This paper presents an open-source hierarchical storage framework with a dynamic migration policy based on reinforcement learning (RL).

Place, publisher, year, edition, pages
IEEE, 2023
Keywords
Cloud computing, Storage management, Reinforcement learning, Big Data, Data engineering
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:uu:diva-525454 (URN)10.1109/ICDE55515.2023.00361 (DOI)979-8-3503-2227-9 (ISBN)979-8-3503-2228-6 (ISBN)
Conference
2023 IEEE 39th International Conference on Data Engineering (ICDE), Anaheim, California, 3-7 April, 2023
Funder
Swedish Foundation for Strategic Research, BD15-0008
Available from: 2024-03-22 Created: 2024-03-22 Last updated: 2024-03-22Bibliographically approved
Kaucka, M., Araus, A. J., Tesarova, M., Currie, J. D., Boström, J., Kavkova, M., . . . Adameyko, I. (2022). Altered developmental programs and oriented cell divisions lead to bulky bones during salamander limb regeneration. Nature Communications, 13, Article ID 6949.
Open this publication in new window or tab >>Altered developmental programs and oriented cell divisions lead to bulky bones during salamander limb regeneration
Show others...
2022 (English)In: Nature Communications, E-ISSN 2041-1723, Vol. 13, article id 6949Article in journal (Refereed) Published
Abstract [en]

There are major differences in duration and scale at which limb development and regeneration proceed, raising the question to what extent regeneration is a recapitulation of development. We address this by analyzing skeletal elements using a combination of micro-CT imaging, molecular profiling and clonal cell tracing. We find that, in contrast to development, regenerative skeletal growth is accomplished based entirely on cartilage expansion prior to ossification, not limiting the transversal cartilage expansion and resulting in bulkier skeletal parts. The oriented extension of salamander cartilage and bone appear similar to the development of basicranial synchondroses in mammals, as we found no evidence for cartilage stem cell niches or growth plate-like structures during neither development nor regeneration. Both regenerative and developmental ossification in salamanders start from the cortical bone and proceeds inwards, showing the diversity of schemes for the synchrony of cortical and endochondral ossification among vertebrates. Normal limb development relies on synchronized formation of cartilage and bone. Here, the authors show that in salamander limb regeneration these processes are decoupled: ossification occurs after the final size of regenerating cartilage is reached, allowing fast regeneration and leading to bulky bones.

Place, publisher, year, edition, pages
Springer Nature, 2022
Keywords
Bone development, cartilage development
National Category
Medical Biotechnology (with a focus on Cell Biology (including Stem Cell Biology), Molecular Biology, Microbiology, Biochemistry or Biopharmacy)
Research subject
Biology with specialization in Microbiology
Identifiers
urn:nbn:se:uu:diva-490746 (URN)10.1038/s41467-022-34266-w (DOI)000883836600034 ()36376278 (PubMedID)
Projects
eSSENCE - An eScience Collaboration
Funder
EU, European Research Council, KILL-OR-DIFFERENTIATE 856529Knut and Alice Wallenberg FoundationSwedish Research Council, 2020-02298Swedish Cancer SocietyOlle Engkvists stiftelseSwedish Research Council, 2019-01919
Available from: 2022-12-15 Created: 2022-12-15 Last updated: 2023-03-28Bibliographically approved
Mathias, S., Coulier, A. & Hellander, A. (2022). CBMOS: a GPU-enabled Python framework for the numerical study of center-based models. BMC Bioinformatics, 23, Article ID 55.
Open this publication in new window or tab >>CBMOS: a GPU-enabled Python framework for the numerical study of center-based models
2022 (English)In: BMC Bioinformatics, E-ISSN 1471-2105, Vol. 23, article id 55Article in journal (Refereed) Published
Abstract [en]

Background: Cell-based models are becoming increasingly popular for applications in developmental biology. However, the impact of numerical choices on the accuracy and efficiency of the simulation of these models is rarely meticulously tested. Without concrete studies to differentiate between solid model conclusions and numerical artifacts, modelers are at risk of being misled by their experiments’ results. Most cell-based modeling frameworks offer a feature-rich environment, providing a wide range of biological components, but are less suitable for numerical studies. There is thus a need for software specifically targeted at this use case.

Results: We present CBMOS, a Python framework for the simulation of the center-based or cell-centered model. Contrary to other implementations, CBMOS’ focus is on facilitating numerical study of center-based models by providing access to multiple ordinary differential equation solvers and force functions through a flexible, user-friendly interface and by enabling rapid testing through graphics processing unit (GPU) acceleration. We show-case its potential by illustrating two common workflows: (1) comparison of the numerical properties of two solvers within a Jupyter notebook and (2) measuring average wall times of both solvers on a high performance computing cluster. More specifically, we confirm that although for moderate accuracy levels the backward Euler method allows for larger time step sizes than the commonly used forward Euler method, its additional computational cost due to being an implicit method prohibits its use for practical test cases.

Conclusions: CBMOS is a flexible, easy-to-use Python implementation of the center-based model, exposing both basic model assumptions and numerical components to the user. It is available on GitHub and PyPI under an MIT license. CBMOS allows for fast prototyping on a central processing unit for small systems through the use of NumPy. Using CuPy on a GPU, cell populations of up to 10,000 cells can be simulated within a few seconds. As such, it will substantially lower the time investment for any modeler to check the crucial assumption that model conclusions are independent of numerical issues.

Place, publisher, year, edition, pages
Springer NatureSpringer Nature, 2022
Keywords
Cell-based model, Numerical method, Implicit solver, Python, NumPy, CuPy
National Category
Bioinformatics and Computational Biology
Research subject
Scientific Computing
Identifiers
urn:nbn:se:uu:diva-442222 (URN)10.1186/s12859-022-04575-4 (DOI)000750510400001 ()35100968 (PubMedID)
Projects
eSSENCE - An eScience Collaboration
Funder
Swedish National Infrastructure for Computing (SNIC), 2019/8-227
Available from: 2021-05-11 Created: 2021-05-11 Last updated: 2025-02-07Bibliographically approved
Projects
From single cells to cancer tumors - multiscale simulation of stochastic multicellular systems [2015-03964_VR]; Uppsala UniversityComputational exploration of high-dimensional stochastic biochemical reaction networks: A machine learning-assisted approach [2023-05167_VR]; Uppsala University
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0001-7273-7923

Search in DiVA

Show all publications