Logotyp: till Uppsala universitets webbplats

uu.sePublikationer från Uppsala universitet
Ändra sökning
Länk till posten
Permanent länk

Direktlänk
Toor, Salman, Associate ProfessorORCID iD iconorcid.org/0000-0003-0302-6276
Alternativa namn
Publikationer (10 of 48) Visa alla publikationer
Ju, L., Zhang, T., Toor, S. & Hellander, A. (2024). Accelerating Fair Federated Learning: Adaptive Federated Adam. IEEE Transactions on Machine Learning in Communications and Networking, 2, 1017-1032
Öppna denna publikation i ny flik eller fönster >>Accelerating Fair Federated Learning: Adaptive Federated Adam
2024 (Engelska)Ingår i: IEEE Transactions on Machine Learning in Communications and Networking, E-ISSN 2831-316X, Vol. 2, s. 1017-1032Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Federated learning is a distributed and privacy-preserving approach to train a statistical model collaboratively from decentralized data held by different parties. However, when the datasets are not independent and identically distributed, models trained by naive federated algorithms may be biased towards certain participants, and model performance across participants is non-uniform. This is known as the fairness problem in federated learning. In this paper, we formulate fairness-controlled federated learning as a dynamical multi-objective optimization problem to ensure the fairness and convergence with theoretical guarantee. To solve the problem efficiently, we study the convergence and bias of Adam as the server optimizer in federated learning, and propose Adaptive Federated Adam ( AdaFedAdam ) to accelerate fair federated learning with alleviated bias. We validated the effectiveness, Pareto optimality and robustness of AdaFedAdam with numerical experiments and show that AdaFedAdam outperforms existing algorithms, providing better convergence and fairness properties of the federated scheme.

Ort, förlag, år, upplaga, sidor
Institute of Electrical and Electronics Engineers (IEEE), 2024
Nationell ämneskategori
Datavetenskap (datalogi)
Forskningsämne
Maskininlärning
Identifikatorer
urn:nbn:se:uu:diva-544327 (URN)10.1109/tmlcn.2024.3423648 (DOI)
Projekt
eSSENCE - An eScience Collaboration
Tillgänglig från: 2024-12-03 Skapad: 2024-12-03 Senast uppdaterad: 2025-01-07Bibliografiskt granskad
Zhang, T., Gupta, A., Francisco Rodríguez, M. A., Spjuth, O., Hellander, A. & Toor, S. (2024). Data management of scientific applications in a reinforcement learning-based hierarchical storage system. Expert systems with applications, 237, Article ID 121443.
Öppna denna publikation i ny flik eller fönster >>Data management of scientific applications in a reinforcement learning-based hierarchical storage system
Visa övriga...
2024 (Engelska)Ingår i: Expert systems with applications, ISSN 0957-4174, E-ISSN 1873-6793, Vol. 237, artikel-id 121443Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

In many areas of data-driven science, large datasets are generated where the individual data objects are images, matrices, or otherwise have a clear structure. However, these objects can be information-sparse, and a challenge is to efficiently find and work with the most interesting data as early as possible in an analysis pipeline. We have recently proposed a new model for big data management where the internal structure and information of the data are associated with each data object (as opposed to simple metadata). There is then an opportunity for comprehensive data management solutions to account for data-specific internal structure as well as access patterns. In this article, we explore this idea together with our recently proposed hierarchical storage management framework that uses reinforcement learning (RL) for autonomous and dynamic data placement in different tiers in a storage hierarchy. Our case-study is based on four scientific datasets: Protein translocation microscopy images, Airfoil angle of attack meshes, 1000 Genomes sequences, and Phenotypic screening images. The presented results highlight that our framework is optimal and can quickly adapt to new data access requirements. It overall reduces the data processing time, and the proposed autonomous data placement is superior compared to any static or semi-static data placement policies.

Ort, förlag, år, upplaga, sidor
Elsevier, 2024
Nyckelord
Data management, Scientific application, Hierarchical storage system, Reinforcement learning, Large scientific datasets
Nationell ämneskategori
Datavetenskap (datalogi) Beräkningsmatematik
Forskningsämne
Datavetenskap med inriktning mot databasteknik; Datavetenskap
Identifikatorer
urn:nbn:se:uu:diva-513854 (URN)10.1016/j.eswa.2023.121443 (DOI)001081909200001 ()
Forskningsfinansiär
Stiftelsen för strategisk forskning (SSF), BD15-0008Swedish National Infrastructure for Computing (SNIC), SNIC 2022/22-835eSSENCE - An eScience Collaboration
Tillgänglig från: 2023-10-12 Skapad: 2023-10-12 Senast uppdaterad: 2024-12-08Bibliografiskt granskad
Zhang, T., Gupta, A., Francisco Rodríguez, M. A., Spjuth, O., Hellander, A. & Toor, S. (2024). Data management of scientific applications in a reinforcement learning-based hierarchical storage system. Expert systems with applications, 237, 121443-121443, Article ID 121443.
Öppna denna publikation i ny flik eller fönster >>Data management of scientific applications in a reinforcement learning-based hierarchical storage system
Visa övriga...
2024 (Engelska)Ingår i: Expert systems with applications, ISSN 0957-4174, E-ISSN 1873-6793, Vol. 237, s. 121443-121443, artikel-id 121443Artikel i tidskrift (Refereegranskat) Published
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
urn:nbn:se:uu:diva-552596 (URN)10.1016/j.eswa.2023.121443 (DOI)
Forskningsfinansiär
Stiftelsen för strategisk forskning (SSF), BD15-0008Swedish National Infrastructure for Computing (SNIC), SNIC 2022/22-835eSSENCE - An eScience Collaboration
Tillgänglig från: 2025-03-17 Skapad: 2025-03-17 Senast uppdaterad: 2025-03-17
Sun, Z., Teixeira, A. & Toor, S. (2024). GNN-IDS: Graph Neural Network based Intrusion Detection System. In: Proceedings of the 19th International Conference on Availability, Reliability and Security: . Paper presented at ARES 2024ARES, the 19th International Conference on Availability, Reliability and Security, JUL 30-AUG 02, 2024, Vienna, AUSTRIA. New York, NY, USA: Association for Computing Machinery (ACM), Article ID 14.
Öppna denna publikation i ny flik eller fönster >>GNN-IDS: Graph Neural Network based Intrusion Detection System
2024 (Engelska)Ingår i: Proceedings of the 19th International Conference on Availability, Reliability and Security, New York, NY, USA: Association for Computing Machinery (ACM), 2024, artikel-id 14Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Intrusion detection systems (IDSs) are widely used to identify anomalies in computer networks and raise alarms on intrusive behaviors. ML-based IDSs generally take network traces or host logs as input to extract patterns from individual samples, whereas the inter-dependencies of network are often not captured and learned, which may result in large amounts of uncertain predictions, false positives, and false negatives. To tackle the challenges in intrusion detection, we propose a graph neural network-based intrusion detection system (GNN-IDS), which is data-driven and machine learning-empowered. In our proposed GNN-IDS, the attack graph and real-time measurements that represent static and dynamic attributes of computer networks, respectively, are incorporated and associated to represent complex computer networks. Graph neural networks are employed as the inference engine for intrusion detection. By learning network connectivity, graph neural networks can quantify the importance of neighboring nodes and node features to make more reliable predictions. Furthermore, by incorporating an attack graph, GNN-IDS could not only detect anomalies but also identify the malicious actions causing the anomalies. The experimental results on a use case network with two synthetic datasets (one generated from public IDS data) show that the proposed GNN-IDS achieves good performance. The results are analyzed from the aspects of uncertainty, explainability, and robustness.

Ort, förlag, år, upplaga, sidor
New York, NY, USA: Association for Computing Machinery (ACM), 2024
Nyckelord
Explainability, Graph Neural Network, Intrusion Detection System, Robustness, Uncertainty
Nationell ämneskategori
Datorsystem
Forskningsämne
Beräkningsvetenskap
Identifikatorer
urn:nbn:se:uu:diva-544329 (URN)10.1145/3664476.3664515 (DOI)001283894700038 ()2-s2.0-85200392088 (Scopus ID)9798400717185 (ISBN)
Konferens
ARES 2024ARES, the 19th International Conference on Availability, Reliability and Security, JUL 30-AUG 02, 2024, Vienna, AUSTRIA
Projekt
eSSENCE - An eScience Collaboration
Tillgänglig från: 2024-12-03 Skapad: 2024-12-03 Senast uppdaterad: 2025-02-03Bibliografiskt granskad
Alawadi, S., Ait-Mlouk, A., Toor, S. & Hellander, A. (2024). Toward efficient resource utilization at edge nodes in federated learning. Progress in Artificial Intelligence, 13(2), 101-117
Öppna denna publikation i ny flik eller fönster >>Toward efficient resource utilization at edge nodes in federated learning
2024 (Engelska)Ingår i: Progress in Artificial Intelligence, ISSN 2192-6352, E-ISSN 2192-6360, Vol. 13, nr 2, s. 101-117Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Federated learning (FL) enables edge nodes to collaboratively contribute to constructing a global model without sharing their data. This is accomplished by devices computing local, private model updates that are then aggregated by a server. However, computational resource constraints and network communication can become a severe bottleneck for larger model sizes typical for deep learning (DL) applications. Edge nodes tend to have limited hardware resources (RAM, CPU), and the network bandwidth and reliability at the edge is a concern for scaling federated fleet applications. In this paper, we propose and evaluate a FL strategy inspired by transfer learning in order to reduce resource utilization on devices, as well as the load on the server and network in each global training round. For each local model update, we randomly select layers to train, freezing the remaining part of the model. In doing so, we can reduce both server load and communication costs per round by excluding all untrained layer weights from being transferred to the server. The goal of this study is to empirically explore the potential trade-off between resource utilization on devices and global model convergence under the proposed strategy. We implement the approach using the FL framework FEDn. A number of experiments were carried out over different datasets (CIFAR-10, CASA, and IMDB), performing different tasks using different DL model architectures. Our results show that training the model partially can accelerate the training process, efficiently utilizes resources on-device, and reduce the data transmission by around 75% and 53% when we train 25%, and 50% of the model layers, respectively, without harming the resulting global model accuracy. Furthermore, our results demonstrate a negative correlation between the number of participating clients in the training process and the number of layers that need to be trained on each client's side. As the number of clients increases, there is a decrease in the required number of layers. This observation highlights the potential of the approach, particularly in cross-device use cases.

Ort, förlag, år, upplaga, sidor
Springer Nature, 2024
Nyckelord
Distributed training, Data privacy, Federated learning, Machine learning, Training parallelization, Partial training
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
urn:nbn:se:uu:diva-543660 (URN)10.1007/s13748-024-00322-3 (DOI)001242726300001 ()2-s2.0-85195583160 (Scopus ID)
Tillgänglig från: 2024-11-25 Skapad: 2024-11-25 Senast uppdaterad: 2024-11-25Bibliografiskt granskad
Zhang, T., Hellander, A. & Toor, S. (2023). Efficient Hierarchical Storage Management Empowered by Reinforcement Learning. IEEE Transactions on Knowledge and Data Engineering, 35, 5780-5793
Öppna denna publikation i ny flik eller fönster >>Efficient Hierarchical Storage Management Empowered by Reinforcement Learning
2023 (Engelska)Ingår i: IEEE Transactions on Knowledge and Data Engineering, ISSN 1041-4347, E-ISSN 1558-2191, Vol. 35, s. 5780-5793Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

With the rapid development of big data and cloud computing, data management has become increasingly challenging. Over the years, a number of frameworks for data management have become available. Most of them are highly efficient, but ultimately create data silos. It becomes difficult to move and work coherently with data as new requirements emerge. A possible solution is to use an intelligent hierarchical (multi-tier) storage system (HSS). A HSS is a meta solution that consists of different storage frameworks organized as a jointly constructed storage pool. A built-in data migration policy that determines the optimal placement of the datasets in the hierarchy is essential. Placement decisions is a non-trivial task since it should be made according to the characteristics of the dataset, the tier status in a hierarchy, and access patterns. This paper presents an open-source hierarchical storage framework with a dynamic migration policy based on reinforcement learning (RL). We present a mathematical model, a software architecture, and implementations based on both simulations and a live cloud-based environment. We compare the proposed RL-based strategy to a baseline of three rule-based policies, showing that the RL-based policy achieves significantly higher efficiency and optimal data distribution in different scenarios.

Ort, förlag, år, upplaga, sidor
IEEE, 2023
Nyckelord
Data Management, Cloud Computing, Hierarchical Storage System, Data Migration, Reinforcement Learning
Nationell ämneskategori
Datavetenskap (datalogi)
Forskningsämne
Datavetenskap
Identifikatorer
urn:nbn:se:uu:diva-490399 (URN)10.1109/tkde.2022.3176753 (DOI)000981944600024 ()
Projekt
eSSENCE - An eScience Collaboration
Forskningsfinansiär
Stiftelsen för strategisk forskning (SSF), BD15-0008
Tillgänglig från: 2022-12-09 Skapad: 2022-12-09 Senast uppdaterad: 2024-12-16Bibliografiskt granskad
Zhang, T., Hellander, A. & Toor, S. (2023). Efficient Hierarchical Storage Management Empowered by Reinforcement Learning Extended Abstract. In: : . Paper presented at 2023 IEEE 39th International Conference on Data Engineering (ICDE), Anaheim, California, 3-7 April, 2023 (pp. 3869-3870). IEEE
Öppna denna publikation i ny flik eller fönster >>Efficient Hierarchical Storage Management Empowered by Reinforcement Learning Extended Abstract
2023 (Engelska)Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

With the rapid development of big data and cloud computing, data management has become increasingly challenging. A possible solution is to use an intelligent hierarchical (multi-tier) storage system (HSS). An HSS is a meta solution that consists of different storage frameworks organized as a jointly constructed storage pool. A built-in data migration policy that determines the optimal placement of the datasets in the hierarchy is essential. Placement decisions are a non-trivial task since they should be made according to the characteristics of the dataset, the tier status in a hierarchy, and access patterns. This paper presents an open-source hierarchical storage framework with a dynamic migration policy based on reinforcement learning (RL).

Ort, förlag, år, upplaga, sidor
IEEE, 2023
Nyckelord
Cloud computing, Storage management, Reinforcement learning, Big Data, Data engineering
Nationell ämneskategori
Datavetenskap (datalogi)
Forskningsämne
Datavetenskap
Identifikatorer
urn:nbn:se:uu:diva-525454 (URN)10.1109/ICDE55515.2023.00361 (DOI)979-8-3503-2227-9 (ISBN)979-8-3503-2228-6 (ISBN)
Konferens
2023 IEEE 39th International Conference on Data Engineering (ICDE), Anaheim, California, 3-7 April, 2023
Forskningsfinansiär
Stiftelsen för strategisk forskning (SSF), BD15-0008
Tillgänglig från: 2024-03-22 Skapad: 2024-03-22 Senast uppdaterad: 2024-03-22Bibliografiskt granskad
Chu, J., Singh, P. & Toor, S. (2023). Efficient Resource Scheduling for Distributed Infrastructures using Negotiation Capabilities. In: 2023 IEEE 16th International Conference on Cloud Computing (CLOUD): . Paper presented at IEEE 16th International Conference on Cloud Computing (CLOUD), 2-8 July, 2023, Chicago, IL, USA (pp. 486-492). Institute of Electrical and Electronics Engineers (IEEE)
Öppna denna publikation i ny flik eller fönster >>Efficient Resource Scheduling for Distributed Infrastructures using Negotiation Capabilities
2023 (Engelska)Ingår i: 2023 IEEE 16th International Conference on Cloud Computing (CLOUD), Institute of Electrical and Electronics Engineers (IEEE), 2023, s. 486-492Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

The information explosion drives enterprises and individuals to rent cloud computing infrastructure for their applications in the cloud. However, the agreements between cloud computing providers and clients are often inefficient. We propose an agent-based auto-negotiation system for resource scheduling using fuzzy logic. Our method completes a one-to-one auto-negotiation process and generates optimal offers for providers and clients. We compare the impact of different member functions, fuzzy rule sets, and negotiation scenarios on the offers to optimize the system. Our proposed method efficiently utilizes resources and offers interpretability, high flexibility, and customization. We successfully train machine learning models to replace the fuzzy negotiation system, improving processing speed. The article also highlights potential future improvements to the proposed system and machine learning models. All codes and data are available as an open source repository.

Ort, förlag, år, upplaga, sidor
Institute of Electrical and Electronics Engineers (IEEE), 2023
Serie
IEEE International Conference on Cloud Computing, ISSN 2159-6182, E-ISSN 2159-6190
Nyckelord
distributed infrastructure, cloud computing, fuzzy logic, negotiation, intelligent agent, machine learning
Nationell ämneskategori
Datavetenskap (datalogi)
Forskningsämne
Beräkningsvetenskap; Datavetenskap
Identifikatorer
urn:nbn:se:uu:diva-509838 (URN)10.1109/CLOUD60044.2023.00065 (DOI)001085065100055 ()979-8-3503-0481-7 (ISBN)979-8-3503-0482-4 (ISBN)
Konferens
IEEE 16th International Conference on Cloud Computing (CLOUD), 2-8 July, 2023, Chicago, IL, USA
Projekt
eSSENCE - An eScience Collaboration
Forskningsfinansiär
Swedish National Infrastructure for Computing (SNIC)eSSENCE - An eScience Collaboration
Tillgänglig från: 2023-08-23 Skapad: 2023-08-23 Senast uppdaterad: 2023-11-14Bibliografiskt granskad
Zhang, W., Toor, S. & Al-Naday, M. (2023). Federated Machine Learning for Resource Allocation in Multi-domain Fog Ecosystems. In: 2023 IEEE 12th International Conference on Cloud Networking, CLOUDNET: . Paper presented at 12th International Conference on Cloud Networking (CloudNet), NOVEMBER 01-03, 2023, Hoboken, NJ, USA (pp. 290-298). Institute of Electrical and Electronics Engineers (IEEE)
Öppna denna publikation i ny flik eller fönster >>Federated Machine Learning for Resource Allocation in Multi-domain Fog Ecosystems
2023 (Engelska)Ingår i: 2023 IEEE 12th International Conference on Cloud Networking, CLOUDNET, Institute of Electrical and Electronics Engineers (IEEE), 2023, s. 290-298Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

The proliferation of the Internet of Things (IoT) has incentivised extending cloud resources to the edge in what is deemed fog computing. The latter is manifesting as an ecosystem of connected clouds, geo-dispersed and of diverse capacities. In such ecosystem, workload allocation to fog services becomes a non-trivial challenge. Users' demand at the edge is diverse, which does not lend to straightforward resource planning. Conversely, running services at the edge may leverage proximity, but it comes at higher operational cost let alone increasing risk of resource straining. Consequently, there is a need for intelligent yet scalable allocation solutions that counter the adversity of demand, while efficiently distributing load between the edge and farther clouds. Machine learning is increasingly adopted in resource planning. This paper proposes a federated deep reinforcement learning system, based on deep Q-learning network (DQN), for workload distribution in a fog ecosystem. The proposed solution adapts a DQN to optimize local workload allocations, made by single gateways. Federated learning is incorporated to allow multiple gateways in a network to collaboratively build knowledge of users' demand. This is leveraged to establish consensus on the fraction of workload allocated to different fog nodes, using lower data supply and computation resources. System performance is evaluated using realistic demand from Google Cluster Workload Traces 2019. Evaluation results show over 50% reduction in failed allocations when spreading users over larger number of gateways, given fixed number of fog nodes. The results further illustrate the trade-offs between performance and cost under different conditions.

Ort, förlag, år, upplaga, sidor
Institute of Electrical and Electronics Engineers (IEEE), 2023
Serie
IEEE International Conference on Cloud Networking, ISSN 2374-3239, E-ISSN 2771-5663
Nyckelord
Workload Allocation, Federated Learning, Deep Q-network, Fog networks, Federated Average Aggregation
Nationell ämneskategori
Datorsystem Datavetenskap (datalogi) Kommunikationssystem
Forskningsämne
Beräkningsvetenskap; Datavetenskap med inriktning mot datorkommunikation
Identifikatorer
urn:nbn:se:uu:diva-536184 (URN)10.1109/CloudNet59005.2023.10490033 (DOI)001222507300018 ()9798350313062 (ISBN)9798350313079 (ISBN)
Konferens
12th International Conference on Cloud Networking (CloudNet), NOVEMBER 01-03, 2023, Hoboken, NJ, USA
Projekt
eSSENCE - An eScience Collaboration
Tillgänglig från: 2024-08-14 Skapad: 2024-08-14 Senast uppdaterad: 2025-01-07Bibliografiskt granskad
AL-Naday, M., Reed, M., Dobre, V., Toor, S., Volckaert, B. & De Turck, F. (2023). Service-based Federated Deep Reinforcement Learning for Anomaly Detection in Fog Ecosystems. In: 2023 26th Conference on Innovation in Clouds, Internet and Networks and Workshops (ICIN): . Paper presented at 2023 26th Conference on Innovation in Clouds, Internet and Networks and Workshops (ICIN), 6-9 March, Paris, France (pp. 121-128). Institute of Electrical and Electronics Engineers (IEEE)
Öppna denna publikation i ny flik eller fönster >>Service-based Federated Deep Reinforcement Learning for Anomaly Detection in Fog Ecosystems
Visa övriga...
2023 (Engelska)Ingår i: 2023 26th Conference on Innovation in Clouds, Internet and Networks and Workshops (ICIN), Institute of Electrical and Electronics Engineers (IEEE), 2023, s. 121-128Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

With Digital transformation, the diversity of services and infrastructure in backhaul fog network(s) is rising to unprecedented levels. This is causing a rising threat of a wider range of cyber attacks coupled with a growing integration of constrained range of infrastructure, particularly seen at the network edge. Deep reinforcement-based learning is an attractive approach to detecting attacks, as it allows less dependency on labeled data with better ability to classify different attacks. However, current approaches to learning are known to be computationally expensive (cost) and the learning experience can be negatively impacted by the presence of outliers and noise (quality). This work tackles both the cost and quality challenges with a novel service-based federated deep reinforcement learning solution, enabling anomaly detection and attack classification at a reduced data cost and with better quality. The federated settings in the proposed approach enable multiple edge units to create clusters that follow a bottom-up learning approach. The proposed solution adapts deep Q-learning Network (DQN) for service-tunable flow classification, and introduces a novel federated DQN (FDQN) for federated learning. Through such targeted training and validation, variation in data patterns and noise is reduced. This leads to improved performance per service with lower training cost. Performance and cost of the solution, along with sensitivity to exploration parameters are evaluated using an example publicly available dataset (UNSW-NB15). Evaluation results show the proposed solution to maintain detection accuracy with lower data supply, while improving the classification rate by a factor of ≈ 2.

Ort, förlag, år, upplaga, sidor
Institute of Electrical and Electronics Engineers (IEEE), 2023
Serie
International Conference on Intelligence in Next Generation Networks, ISSN 2162-3414, E-ISSN 2472-8144
Nyckelord
cyber security, federated deep reinforcement learning, Deep Q-Learning, anomaly detection, cloud-to-edge continuum, fog computing
Nationell ämneskategori
Datavetenskap (datalogi)
Forskningsämne
Beräkningsvetenskap
Identifikatorer
urn:nbn:se:uu:diva-508788 (URN)10.1109/ICIN56760.2023.10073495 (DOI)001006975300023 ()979-8-3503-9804-5 (ISBN)979-8-3503-9805-2 (ISBN)
Konferens
2023 26th Conference on Innovation in Clouds, Internet and Networks and Workshops (ICIN), 6-9 March, Paris, France
Projekt
eSSENCE - An eScience Collaboration
Tillgänglig från: 2023-08-09 Skapad: 2023-08-09 Senast uppdaterad: 2023-08-24Bibliografiskt granskad
Projekt
Beräkningsbaserad utforskning av högdimensionella stokastiska biokemiska nätverk [2023-05167_VR]; Uppsala universitet
Organisationer
Identifikatorer
ORCID-id: ORCID iD iconorcid.org/0000-0003-0302-6276

Sök vidare i DiVA

Visa alla publikationer