Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
System disruptions
We are currently experiencing disruptions on the search portals due to high traffic. We are working to resolve the issue, you may temporarily encounter an error message.
Change search
Link to record
Permanent link

Direct link
Toor, Salman, Associate ProfessorORCID iD iconorcid.org/0000-0003-0302-6276
Alternative names
Publications (10 of 48) Show all publications
Ju, L., Zhang, T., Toor, S. & Hellander, A. (2024). Accelerating Fair Federated Learning: Adaptive Federated Adam. IEEE Transactions on Machine Learning in Communications and Networking, 2, 1017-1032
Open this publication in new window or tab >>Accelerating Fair Federated Learning: Adaptive Federated Adam
2024 (English)In: IEEE Transactions on Machine Learning in Communications and Networking, E-ISSN 2831-316X, Vol. 2, p. 1017-1032Article in journal (Refereed) Published
Abstract [en]

Federated learning is a distributed and privacy-preserving approach to train a statistical model collaboratively from decentralized data held by different parties. However, when the datasets are not independent and identically distributed, models trained by naive federated algorithms may be biased towards certain participants, and model performance across participants is non-uniform. This is known as the fairness problem in federated learning. In this paper, we formulate fairness-controlled federated learning as a dynamical multi-objective optimization problem to ensure the fairness and convergence with theoretical guarantee. To solve the problem efficiently, we study the convergence and bias of Adam as the server optimizer in federated learning, and propose Adaptive Federated Adam ( AdaFedAdam ) to accelerate fair federated learning with alleviated bias. We validated the effectiveness, Pareto optimality and robustness of AdaFedAdam with numerical experiments and show that AdaFedAdam outperforms existing algorithms, providing better convergence and fairness properties of the federated scheme.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
National Category
Computer Sciences
Research subject
Machine learning
Identifiers
urn:nbn:se:uu:diva-544327 (URN)10.1109/tmlcn.2024.3423648 (DOI)
Projects
eSSENCE - An eScience Collaboration
Available from: 2024-12-03 Created: 2024-12-03 Last updated: 2025-01-07Bibliographically approved
Zhang, T., Gupta, A., Francisco Rodríguez, M. A., Spjuth, O., Hellander, A. & Toor, S. (2024). Data management of scientific applications in a reinforcement learning-based hierarchical storage system. Expert systems with applications, 237, Article ID 121443.
Open this publication in new window or tab >>Data management of scientific applications in a reinforcement learning-based hierarchical storage system
Show others...
2024 (English)In: Expert systems with applications, ISSN 0957-4174, E-ISSN 1873-6793, Vol. 237, article id 121443Article in journal (Refereed) Published
Abstract [en]

In many areas of data-driven science, large datasets are generated where the individual data objects are images, matrices, or otherwise have a clear structure. However, these objects can be information-sparse, and a challenge is to efficiently find and work with the most interesting data as early as possible in an analysis pipeline. We have recently proposed a new model for big data management where the internal structure and information of the data are associated with each data object (as opposed to simple metadata). There is then an opportunity for comprehensive data management solutions to account for data-specific internal structure as well as access patterns. In this article, we explore this idea together with our recently proposed hierarchical storage management framework that uses reinforcement learning (RL) for autonomous and dynamic data placement in different tiers in a storage hierarchy. Our case-study is based on four scientific datasets: Protein translocation microscopy images, Airfoil angle of attack meshes, 1000 Genomes sequences, and Phenotypic screening images. The presented results highlight that our framework is optimal and can quickly adapt to new data access requirements. It overall reduces the data processing time, and the proposed autonomous data placement is superior compared to any static or semi-static data placement policies.

Place, publisher, year, edition, pages
Elsevier, 2024
Keywords
Data management, Scientific application, Hierarchical storage system, Reinforcement learning, Large scientific datasets
National Category
Computer Sciences Computational Mathematics
Research subject
Computer Science with specialization in Database Technology; Computer Science
Identifiers
urn:nbn:se:uu:diva-513854 (URN)10.1016/j.eswa.2023.121443 (DOI)001081909200001 ()
Funder
Swedish Foundation for Strategic Research, BD15-0008Swedish National Infrastructure for Computing (SNIC), SNIC 2022/22-835eSSENCE - An eScience Collaboration
Available from: 2023-10-12 Created: 2023-10-12 Last updated: 2024-12-08Bibliographically approved
Zhang, T., Gupta, A., Francisco Rodríguez, M. A., Spjuth, O., Hellander, A. & Toor, S. (2024). Data management of scientific applications in a reinforcement learning-based hierarchical storage system. Expert systems with applications, 237, 121443-121443, Article ID 121443.
Open this publication in new window or tab >>Data management of scientific applications in a reinforcement learning-based hierarchical storage system
Show others...
2024 (English)In: Expert systems with applications, ISSN 0957-4174, E-ISSN 1873-6793, Vol. 237, p. 121443-121443, article id 121443Article in journal (Refereed) Published
National Category
Computer Sciences
Identifiers
urn:nbn:se:uu:diva-552596 (URN)10.1016/j.eswa.2023.121443 (DOI)
Funder
Swedish Foundation for Strategic Research, BD15-0008Swedish National Infrastructure for Computing (SNIC), SNIC 2022/22-835eSSENCE - An eScience Collaboration
Available from: 2025-03-17 Created: 2025-03-17 Last updated: 2025-03-17
Sun, Z., Teixeira, A. & Toor, S. (2024). GNN-IDS: Graph Neural Network based Intrusion Detection System. In: Proceedings of the 19th International Conference on Availability, Reliability and Security: . Paper presented at ARES 2024ARES, the 19th International Conference on Availability, Reliability and Security, JUL 30-AUG 02, 2024, Vienna, AUSTRIA. New York, NY, USA: Association for Computing Machinery (ACM), Article ID 14.
Open this publication in new window or tab >>GNN-IDS: Graph Neural Network based Intrusion Detection System
2024 (English)In: Proceedings of the 19th International Conference on Availability, Reliability and Security, New York, NY, USA: Association for Computing Machinery (ACM), 2024, article id 14Conference paper, Published paper (Refereed)
Abstract [en]

Intrusion detection systems (IDSs) are widely used to identify anomalies in computer networks and raise alarms on intrusive behaviors. ML-based IDSs generally take network traces or host logs as input to extract patterns from individual samples, whereas the inter-dependencies of network are often not captured and learned, which may result in large amounts of uncertain predictions, false positives, and false negatives. To tackle the challenges in intrusion detection, we propose a graph neural network-based intrusion detection system (GNN-IDS), which is data-driven and machine learning-empowered. In our proposed GNN-IDS, the attack graph and real-time measurements that represent static and dynamic attributes of computer networks, respectively, are incorporated and associated to represent complex computer networks. Graph neural networks are employed as the inference engine for intrusion detection. By learning network connectivity, graph neural networks can quantify the importance of neighboring nodes and node features to make more reliable predictions. Furthermore, by incorporating an attack graph, GNN-IDS could not only detect anomalies but also identify the malicious actions causing the anomalies. The experimental results on a use case network with two synthetic datasets (one generated from public IDS data) show that the proposed GNN-IDS achieves good performance. The results are analyzed from the aspects of uncertainty, explainability, and robustness.

Place, publisher, year, edition, pages
New York, NY, USA: Association for Computing Machinery (ACM), 2024
Keywords
Explainability, Graph Neural Network, Intrusion Detection System, Robustness, Uncertainty
National Category
Computer Systems
Research subject
Scientific Computing
Identifiers
urn:nbn:se:uu:diva-544329 (URN)10.1145/3664476.3664515 (DOI)001283894700038 ()2-s2.0-85200392088 (Scopus ID)9798400717185 (ISBN)
Conference
ARES 2024ARES, the 19th International Conference on Availability, Reliability and Security, JUL 30-AUG 02, 2024, Vienna, AUSTRIA
Projects
eSSENCE - An eScience Collaboration
Available from: 2024-12-03 Created: 2024-12-03 Last updated: 2025-02-03Bibliographically approved
Alawadi, S., Ait-Mlouk, A., Toor, S. & Hellander, A. (2024). Toward efficient resource utilization at edge nodes in federated learning. Progress in Artificial Intelligence, 13(2), 101-117
Open this publication in new window or tab >>Toward efficient resource utilization at edge nodes in federated learning
2024 (English)In: Progress in Artificial Intelligence, ISSN 2192-6352, E-ISSN 2192-6360, Vol. 13, no 2, p. 101-117Article in journal (Refereed) Published
Abstract [en]

Federated learning (FL) enables edge nodes to collaboratively contribute to constructing a global model without sharing their data. This is accomplished by devices computing local, private model updates that are then aggregated by a server. However, computational resource constraints and network communication can become a severe bottleneck for larger model sizes typical for deep learning (DL) applications. Edge nodes tend to have limited hardware resources (RAM, CPU), and the network bandwidth and reliability at the edge is a concern for scaling federated fleet applications. In this paper, we propose and evaluate a FL strategy inspired by transfer learning in order to reduce resource utilization on devices, as well as the load on the server and network in each global training round. For each local model update, we randomly select layers to train, freezing the remaining part of the model. In doing so, we can reduce both server load and communication costs per round by excluding all untrained layer weights from being transferred to the server. The goal of this study is to empirically explore the potential trade-off between resource utilization on devices and global model convergence under the proposed strategy. We implement the approach using the FL framework FEDn. A number of experiments were carried out over different datasets (CIFAR-10, CASA, and IMDB), performing different tasks using different DL model architectures. Our results show that training the model partially can accelerate the training process, efficiently utilizes resources on-device, and reduce the data transmission by around 75% and 53% when we train 25%, and 50% of the model layers, respectively, without harming the resulting global model accuracy. Furthermore, our results demonstrate a negative correlation between the number of participating clients in the training process and the number of layers that need to be trained on each client's side. As the number of clients increases, there is a decrease in the required number of layers. This observation highlights the potential of the approach, particularly in cross-device use cases.

Place, publisher, year, edition, pages
Springer Nature, 2024
Keywords
Distributed training, Data privacy, Federated learning, Machine learning, Training parallelization, Partial training
National Category
Computer Sciences
Identifiers
urn:nbn:se:uu:diva-543660 (URN)10.1007/s13748-024-00322-3 (DOI)001242726300001 ()2-s2.0-85195583160 (Scopus ID)
Available from: 2024-11-25 Created: 2024-11-25 Last updated: 2024-11-25Bibliographically approved
Zhang, T., Hellander, A. & Toor, S. (2023). Efficient Hierarchical Storage Management Empowered by Reinforcement Learning. IEEE Transactions on Knowledge and Data Engineering, 35, 5780-5793
Open this publication in new window or tab >>Efficient Hierarchical Storage Management Empowered by Reinforcement Learning
2023 (English)In: IEEE Transactions on Knowledge and Data Engineering, ISSN 1041-4347, E-ISSN 1558-2191, Vol. 35, p. 5780-5793Article in journal (Refereed) Published
Abstract [en]

With the rapid development of big data and cloud computing, data management has become increasingly challenging. Over the years, a number of frameworks for data management have become available. Most of them are highly efficient, but ultimately create data silos. It becomes difficult to move and work coherently with data as new requirements emerge. A possible solution is to use an intelligent hierarchical (multi-tier) storage system (HSS). A HSS is a meta solution that consists of different storage frameworks organized as a jointly constructed storage pool. A built-in data migration policy that determines the optimal placement of the datasets in the hierarchy is essential. Placement decisions is a non-trivial task since it should be made according to the characteristics of the dataset, the tier status in a hierarchy, and access patterns. This paper presents an open-source hierarchical storage framework with a dynamic migration policy based on reinforcement learning (RL). We present a mathematical model, a software architecture, and implementations based on both simulations and a live cloud-based environment. We compare the proposed RL-based strategy to a baseline of three rule-based policies, showing that the RL-based policy achieves significantly higher efficiency and optimal data distribution in different scenarios.

Place, publisher, year, edition, pages
IEEE, 2023
Keywords
Data Management, Cloud Computing, Hierarchical Storage System, Data Migration, Reinforcement Learning
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:uu:diva-490399 (URN)10.1109/tkde.2022.3176753 (DOI)000981944600024 ()
Projects
eSSENCE - An eScience Collaboration
Funder
Swedish Foundation for Strategic Research, BD15-0008
Available from: 2022-12-09 Created: 2022-12-09 Last updated: 2024-12-16Bibliographically approved
Zhang, T., Hellander, A. & Toor, S. (2023). Efficient Hierarchical Storage Management Empowered by Reinforcement Learning Extended Abstract. In: : . Paper presented at 2023 IEEE 39th International Conference on Data Engineering (ICDE), Anaheim, California, 3-7 April, 2023 (pp. 3869-3870). IEEE
Open this publication in new window or tab >>Efficient Hierarchical Storage Management Empowered by Reinforcement Learning Extended Abstract
2023 (English)Conference paper, Published paper (Refereed)
Abstract [en]

With the rapid development of big data and cloud computing, data management has become increasingly challenging. A possible solution is to use an intelligent hierarchical (multi-tier) storage system (HSS). An HSS is a meta solution that consists of different storage frameworks organized as a jointly constructed storage pool. A built-in data migration policy that determines the optimal placement of the datasets in the hierarchy is essential. Placement decisions are a non-trivial task since they should be made according to the characteristics of the dataset, the tier status in a hierarchy, and access patterns. This paper presents an open-source hierarchical storage framework with a dynamic migration policy based on reinforcement learning (RL).

Place, publisher, year, edition, pages
IEEE, 2023
Keywords
Cloud computing, Storage management, Reinforcement learning, Big Data, Data engineering
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:uu:diva-525454 (URN)10.1109/ICDE55515.2023.00361 (DOI)979-8-3503-2227-9 (ISBN)979-8-3503-2228-6 (ISBN)
Conference
2023 IEEE 39th International Conference on Data Engineering (ICDE), Anaheim, California, 3-7 April, 2023
Funder
Swedish Foundation for Strategic Research, BD15-0008
Available from: 2024-03-22 Created: 2024-03-22 Last updated: 2024-03-22Bibliographically approved
Chu, J., Singh, P. & Toor, S. (2023). Efficient Resource Scheduling for Distributed Infrastructures using Negotiation Capabilities. In: 2023 IEEE 16th International Conference on Cloud Computing (CLOUD): . Paper presented at IEEE 16th International Conference on Cloud Computing (CLOUD), 2-8 July, 2023, Chicago, IL, USA (pp. 486-492). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Efficient Resource Scheduling for Distributed Infrastructures using Negotiation Capabilities
2023 (English)In: 2023 IEEE 16th International Conference on Cloud Computing (CLOUD), Institute of Electrical and Electronics Engineers (IEEE), 2023, p. 486-492Conference paper, Published paper (Refereed)
Abstract [en]

The information explosion drives enterprises and individuals to rent cloud computing infrastructure for their applications in the cloud. However, the agreements between cloud computing providers and clients are often inefficient. We propose an agent-based auto-negotiation system for resource scheduling using fuzzy logic. Our method completes a one-to-one auto-negotiation process and generates optimal offers for providers and clients. We compare the impact of different member functions, fuzzy rule sets, and negotiation scenarios on the offers to optimize the system. Our proposed method efficiently utilizes resources and offers interpretability, high flexibility, and customization. We successfully train machine learning models to replace the fuzzy negotiation system, improving processing speed. The article also highlights potential future improvements to the proposed system and machine learning models. All codes and data are available as an open source repository.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2023
Series
IEEE International Conference on Cloud Computing, ISSN 2159-6182, E-ISSN 2159-6190
Keywords
distributed infrastructure, cloud computing, fuzzy logic, negotiation, intelligent agent, machine learning
National Category
Computer Sciences
Research subject
Scientific Computing; Computer Science
Identifiers
urn:nbn:se:uu:diva-509838 (URN)10.1109/CLOUD60044.2023.00065 (DOI)001085065100055 ()979-8-3503-0481-7 (ISBN)979-8-3503-0482-4 (ISBN)
Conference
IEEE 16th International Conference on Cloud Computing (CLOUD), 2-8 July, 2023, Chicago, IL, USA
Projects
eSSENCE - An eScience Collaboration
Funder
Swedish National Infrastructure for Computing (SNIC)eSSENCE - An eScience Collaboration
Available from: 2023-08-23 Created: 2023-08-23 Last updated: 2023-11-14Bibliographically approved
Zhang, W., Toor, S. & Al-Naday, M. (2023). Federated Machine Learning for Resource Allocation in Multi-domain Fog Ecosystems. In: 2023 IEEE 12th International Conference on Cloud Networking, CLOUDNET: . Paper presented at 12th International Conference on Cloud Networking (CloudNet), NOVEMBER 01-03, 2023, Hoboken, NJ, USA (pp. 290-298). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Federated Machine Learning for Resource Allocation in Multi-domain Fog Ecosystems
2023 (English)In: 2023 IEEE 12th International Conference on Cloud Networking, CLOUDNET, Institute of Electrical and Electronics Engineers (IEEE), 2023, p. 290-298Conference paper, Published paper (Refereed)
Abstract [en]

The proliferation of the Internet of Things (IoT) has incentivised extending cloud resources to the edge in what is deemed fog computing. The latter is manifesting as an ecosystem of connected clouds, geo-dispersed and of diverse capacities. In such ecosystem, workload allocation to fog services becomes a non-trivial challenge. Users' demand at the edge is diverse, which does not lend to straightforward resource planning. Conversely, running services at the edge may leverage proximity, but it comes at higher operational cost let alone increasing risk of resource straining. Consequently, there is a need for intelligent yet scalable allocation solutions that counter the adversity of demand, while efficiently distributing load between the edge and farther clouds. Machine learning is increasingly adopted in resource planning. This paper proposes a federated deep reinforcement learning system, based on deep Q-learning network (DQN), for workload distribution in a fog ecosystem. The proposed solution adapts a DQN to optimize local workload allocations, made by single gateways. Federated learning is incorporated to allow multiple gateways in a network to collaboratively build knowledge of users' demand. This is leveraged to establish consensus on the fraction of workload allocated to different fog nodes, using lower data supply and computation resources. System performance is evaluated using realistic demand from Google Cluster Workload Traces 2019. Evaluation results show over 50% reduction in failed allocations when spreading users over larger number of gateways, given fixed number of fog nodes. The results further illustrate the trade-offs between performance and cost under different conditions.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2023
Series
IEEE International Conference on Cloud Networking, ISSN 2374-3239, E-ISSN 2771-5663
Keywords
Workload Allocation, Federated Learning, Deep Q-network, Fog networks, Federated Average Aggregation
National Category
Computer Systems Computer Sciences Communication Systems
Research subject
Scientific Computing; Computer Science with specialization in Computer Communication
Identifiers
urn:nbn:se:uu:diva-536184 (URN)10.1109/CloudNet59005.2023.10490033 (DOI)001222507300018 ()9798350313062 (ISBN)9798350313079 (ISBN)
Conference
12th International Conference on Cloud Networking (CloudNet), NOVEMBER 01-03, 2023, Hoboken, NJ, USA
Projects
eSSENCE - An eScience Collaboration
Available from: 2024-08-14 Created: 2024-08-14 Last updated: 2025-01-07Bibliographically approved
AL-Naday, M., Reed, M., Dobre, V., Toor, S., Volckaert, B. & De Turck, F. (2023). Service-based Federated Deep Reinforcement Learning for Anomaly Detection in Fog Ecosystems. In: 2023 26th Conference on Innovation in Clouds, Internet and Networks and Workshops (ICIN): . Paper presented at 2023 26th Conference on Innovation in Clouds, Internet and Networks and Workshops (ICIN), 6-9 March, Paris, France (pp. 121-128). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Service-based Federated Deep Reinforcement Learning for Anomaly Detection in Fog Ecosystems
Show others...
2023 (English)In: 2023 26th Conference on Innovation in Clouds, Internet and Networks and Workshops (ICIN), Institute of Electrical and Electronics Engineers (IEEE), 2023, p. 121-128Conference paper, Published paper (Refereed)
Abstract [en]

With Digital transformation, the diversity of services and infrastructure in backhaul fog network(s) is rising to unprecedented levels. This is causing a rising threat of a wider range of cyber attacks coupled with a growing integration of constrained range of infrastructure, particularly seen at the network edge. Deep reinforcement-based learning is an attractive approach to detecting attacks, as it allows less dependency on labeled data with better ability to classify different attacks. However, current approaches to learning are known to be computationally expensive (cost) and the learning experience can be negatively impacted by the presence of outliers and noise (quality). This work tackles both the cost and quality challenges with a novel service-based federated deep reinforcement learning solution, enabling anomaly detection and attack classification at a reduced data cost and with better quality. The federated settings in the proposed approach enable multiple edge units to create clusters that follow a bottom-up learning approach. The proposed solution adapts deep Q-learning Network (DQN) for service-tunable flow classification, and introduces a novel federated DQN (FDQN) for federated learning. Through such targeted training and validation, variation in data patterns and noise is reduced. This leads to improved performance per service with lower training cost. Performance and cost of the solution, along with sensitivity to exploration parameters are evaluated using an example publicly available dataset (UNSW-NB15). Evaluation results show the proposed solution to maintain detection accuracy with lower data supply, while improving the classification rate by a factor of ≈ 2.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2023
Series
International Conference on Intelligence in Next Generation Networks, ISSN 2162-3414, E-ISSN 2472-8144
Keywords
cyber security, federated deep reinforcement learning, Deep Q-Learning, anomaly detection, cloud-to-edge continuum, fog computing
National Category
Computer Sciences
Research subject
Scientific Computing
Identifiers
urn:nbn:se:uu:diva-508788 (URN)10.1109/ICIN56760.2023.10073495 (DOI)001006975300023 ()979-8-3503-9804-5 (ISBN)979-8-3503-9805-2 (ISBN)
Conference
2023 26th Conference on Innovation in Clouds, Internet and Networks and Workshops (ICIN), 6-9 March, Paris, France
Projects
eSSENCE - An eScience Collaboration
Available from: 2023-08-09 Created: 2023-08-09 Last updated: 2023-08-24Bibliographically approved
Projects
Computational exploration of high-dimensional stochastic biochemical reaction networks: A machine learning-assisted approach [2023-05167_VR]; Uppsala University
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0003-0302-6276

Search in DiVA

Show all publications