Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
Change search
Link to record
Permanent link

Direct link
Publications (10 of 54) Show all publications
Zheng, J., Zhao, Y., Li, Y., Li, J., Wang, L. & Yuan, D. (2025). Dynamic flexible flow shop scheduling via cross-attention networks and multi-agent reinforcement learning. Journal of manufacturing systems, 80, 395-411
Open this publication in new window or tab >>Dynamic flexible flow shop scheduling via cross-attention networks and multi-agent reinforcement learning
Show others...
2025 (English)In: Journal of manufacturing systems, ISSN 0278-6125, E-ISSN 1878-6642, Vol. 80, p. 395-411Article in journal (Refereed) Published
Abstract [en]

With the increasing uncertainty in production environments and changes in market demand, flexible and efficient scheduling solutions have become particularly critical. However, existing research mainly focuses on static scheduling or relatively simple dynamic scheduling problems, which are inadequate to address the complexities of actual production processes. This paper considers the dynamic flexible flow shop scheduling problem (DFFSP) characterized by diverse processes, complexity, and high flexibility, and proposes a multi-agent reinforcement learning algorithm based on cross-attention networks (MARL_CA). First, this paper proposes a novel state feature representation method, which represents the job processing data and the production Gantt chart as a state matrix, fully reflecting the environment state in the scheduling process. In addition, a cross-attention network is proposed to extract state features, enabling efficient discovery of complex relationships between jobs and machines, thereby enhancing the model's ability to understand intricate features. The model is trained using an independent proximal policy optimization (IPPO) based on the actor-critic method to help agents learn accurate and efficient scheduling strategies. Experimental results on a large number of static and dynamic scheduling instances demonstrate that the proposed algorithm outperforms traditional heuristic rules and other advanced algorithms, exhibiting strong learning efficiency and generalization capability.

Place, publisher, year, edition, pages
Elsevier, 2025
Keywords
Dynamic flexible flow shop scheduling problem, Cross-attention networks, Multi-agent reinforcement learning, Independent proximal policy optimization
National Category
Computer Sciences Production Engineering, Human Work Science and Ergonomics
Identifiers
urn:nbn:se:uu:diva-557209 (URN)10.1016/j.jmsy.2025.03.005 (DOI)001458531000001 ()2-s2.0-105001005184 (Scopus ID)
Available from: 2025-05-27 Created: 2025-05-27 Last updated: 2025-05-27Bibliographically approved
Chen, D., Deng, T., Jia, J., Feng, S. & Yuan, D. (2025). Mobility-aware decentralized federated learning with joint optimization of local iteration and leader selection for vehicular networks. Computer Networks, 263, Article ID 111232.
Open this publication in new window or tab >>Mobility-aware decentralized federated learning with joint optimization of local iteration and leader selection for vehicular networks
Show others...
2025 (English)In: Computer Networks, ISSN 1389-1286, E-ISSN 1872-7069, Vol. 263, article id 111232Article in journal (Refereed) Published
Abstract [en]

Federated learning (FL) emerges as a promising approach to empower vehicular networks, composed by intelligent connected vehicles equipped with advanced sensing, computing, and communication capabilities. While previous studies have explored the application of FL in vehicular networks, they have largely overlooked the intricate challenges arising from the mobility of vehicles and resource constraints. In this paper, we propose a framework of mobility-aware decentralized federated learning (MDFL) for vehicular networks. In this framework, nearby vehicles train an FL model collaboratively, yet in a decentralized manner. We formulate a local iteration and leader selection joint optimization problem (LSOP) to improve the training efficiency of MDFL. For problem solving, we first reformulate LSOP as a decentralized partially observable Markov decision process (Dec-POMDP), and then develop an effective optimization algorithm based on multi-agent proximal policy optimization (MAPPO) to solve Dec-POMDP. Finally, we verify the performance of the proposed algorithm by comparing it with other algorithms.

Place, publisher, year, edition, pages
Elsevier, 2025
Keywords
Decentralized federated learning, Mobility-aware, Multi-agent proximal policy optimization, Vehicular networks
National Category
Computer Sciences Control Engineering Communication Systems Telecommunications
Identifiers
urn:nbn:se:uu:diva-554672 (URN)10.1016/j.comnet.2025.111232 (DOI)001458274100001 ()2-s2.0-105001372566 (Scopus ID)
Funder
Swedish Research Council, 2022-04123
Available from: 2025-04-15 Created: 2025-04-15 Last updated: 2025-04-15Bibliographically approved
Yu, Z., Zhao, Y., Chu, X. & Yuan, D. (2025). Online Learning for Intelligent Thermal Management of Interference-Coupled and Passively Cooled Base Stations. IEEE Transactions on Machine Learning in Communications and Networking, 3, 64-79
Open this publication in new window or tab >>Online Learning for Intelligent Thermal Management of Interference-Coupled and Passively Cooled Base Stations
2025 (English)In: IEEE Transactions on Machine Learning in Communications and Networking, E-ISSN 2831-316X, Vol. 3, p. 64-79Article in journal (Refereed) Published
Abstract [en]

Passively cooled base stations (PCBSs) have emerged to deliver better cost and energy efficiency. However, passive cooling necessitates intelligent thermal control via traffic management, i.e., the instantaneous data traffic or throughput of a PCBS directly impacts its thermal performance. This is particularly challenging for outdoor deployment of PCBSs because the heat dissipation efficiency is uncertain and fluctuates over time. What is more, the PCBSs are interference-coupled in multi-cell scenarios. Thus, a higher-throughput PCBS leads to higher interference to the other PCBSs, which, in turn, would require more resource consumption to meet their respective throughput targets. In this paper, we address online decision-making for maximizing the total downlink throughput for a multi-PCBS system subject to constraints related on operating temperature. We demonstrate that a reinforcement learning (RL) approach, specifically soft actor-critic (SAC), can successfully perform throughput maximization while keeping the PCBSs cool, by adapting the throughput to time-varying heat dissipation conditions. Furthermore, we design a denial and reward mechanism that effectively mitigates the risk of overheating during the exploration phase of RL. Simulation results show that our approach achieves up to 88.6% of the global optimum. This is very promising, as our approach operates without prior knowledge of future heat dissipation efficiency, which is required by the global optimum.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2025
Keywords
Reinforcement learning, interference, passive cooling, throughput maximization, thermal management
National Category
Telecommunications
Identifiers
urn:nbn:se:uu:diva-555954 (URN)10.1109/tmlcn.2024.3517619 (DOI)001487807300001 ()
Funder
Swedish Research Council, 2022-04123EU, European Research Council, 101086219
Available from: 2025-05-07 Created: 2025-05-07 Last updated: 2025-05-23Bibliographically approved
Zhao, Y., Yu, Z. & Yuan, D. (2024). Caching With Personalized and Incumbent-Aware Recommendation: Modeling and Optimization. IEEE Transactions on Mobile Computing, 23(10), 9595-9613
Open this publication in new window or tab >>Caching With Personalized and Incumbent-Aware Recommendation: Modeling and Optimization
2024 (English)In: IEEE Transactions on Mobile Computing, ISSN 1536-1233, E-ISSN 1558-0660, Vol. 23, no 10, p. 9595-9613Article in journal (Refereed) Published
Abstract [en]

Caching popular contents at cell edge has been recognized as a promising way to facilitate rapid content delivery and alleviate backhaul burden. The content popularity is greatly influenced by recommendations by content providers. In this paper, we leverage this fact to jointly optimize caching and recommendation towards higher caching efficiency. We focus on both personalized and incumbent-aware recommendation. The incumbent content refers to the content that a user is currently browsing, resulted by the user's short-term interest. We model and formulate the resulting cache efficiency maximization problem subject to user satisfaction requirements. We prove the NP-hardness of the problem, and reformulate it using integer linear programming, enabling to solve optimally small-scale instances. Based on problem analysis with a graph representation, we derive three polynomial-time algorithms, where the recommendation sub-problem is solved to global optimum. Among these algorithms, the first two are based on sub-modularity, with  1−e−1 approximation guarantee under mild conditions, while the last one is an alternation-based algorithm with fast convergence. Numerical results show the close-to-optimal performance of the proposed algorithms.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
Keywords
Optimization, Approximation algorithms, Mobile computing, Capacity planning, Web sites, Videos, Video on demand, Recommendation, caching, content delivery network (CDN), approximation algorithm
National Category
Computer Sciences
Identifiers
urn:nbn:se:uu:diva-538826 (URN)10.1109/TMC.2024.3365465 (DOI)001306818600037 ()
Funder
Swedish Research Council
Available from: 2024-09-30 Created: 2024-09-30 Last updated: 2024-09-30Bibliographically approved
Yu, Z., Zhao, Y., You, L. & Yuan, D. (2024). Learn to Stay Cool: Online Load Management for Passively Cooled Base Stations. In: 2024 IEEE Wireless Communications and Networking Conference, WCNC 2024: . Paper presented at IEEE Wireless Communications and Networking Conference (IEEE WCNC), April 21-24, 2024, Dubai, United Arab Emirates (pp. 1-6). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Learn to Stay Cool: Online Load Management for Passively Cooled Base Stations
2024 (English)In: 2024 IEEE Wireless Communications and Networking Conference, WCNC 2024, Institute of Electrical and Electronics Engineers (IEEE), 2024, p. 1-6Conference paper, Published paper (Refereed)
Abstract [en]

Passively cooled base stations (PCBSs) are highly relevant for achieving better efficiency in cost and energy. However, dealing with the thermal issue via load management, particularly for outdoor deployment of PCBS, becomes crucial. This is a challenge because the heat dissipation efficiency is subject to (uncertain) fluctuation over time. Moreover, load management is an online decision-making problem by its nature. In this paper, we demonstrate that a reinforcement learning (RL) approach, specifically Soft Actor-Critic (SAC), enables to make a PCBS stay cool. The proposed approach has the capability of adapting the PCBS load to the time-varying heat dissipation. In addition, we propose a denial and reward mechanism to mitigate the risk of overheating from the exploration such that the proposed RL approach can be implemented directly in a practical environment, i.e., online RL. Numerical results demonstrate that the learning approach can achieve as much as 88.6% of the global optimum. This is impressive, as our approach is used in an online fashion to perform decision-making without the knowledge of future heat dissipation efficiency, whereas the global optimum is computed assuming the presence of oracle that fully eliminates uncertainty. This paper pioneers the approach to the online PCBSs load management problem.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
Series
IEEE Wireless Communications and Networking Conference, ISSN 1525-3511
Keywords
Passive cooling, load management, deep reinforcement learning
National Category
Computer Sciences
Identifiers
urn:nbn:se:uu:diva-539623 (URN)10.1109/WCNC57260.2024.10571225 (DOI)001268569304052 ()9798350303582 (ISBN)9798350303599 (ISBN)
Conference
IEEE Wireless Communications and Networking Conference (IEEE WCNC), April 21-24, 2024, Dubai, United Arab Emirates
Funder
Swedish Research CouncilEU, Horizon 2020
Available from: 2024-10-02 Created: 2024-10-02 Last updated: 2025-05-07Bibliographically approved
Forghani, K., Carlsson, M., Flener, P., Fredriksson, M., Pearson, J. & Yuan, D. (2024). Maximizing Value Yield in Wood Industry through Flexible Sawing and Product Grading Based on Wane and Log Shape. Computers and Electronics in Agriculture, 216, Article ID 108513.
Open this publication in new window or tab >>Maximizing Value Yield in Wood Industry through Flexible Sawing and Product Grading Based on Wane and Log Shape
Show others...
2024 (English)In: Computers and Electronics in Agriculture, ISSN 0168-1699, E-ISSN 1872-7107, Vol. 216, article id 108513Article in journal (Refereed) Published
Abstract [en]

The optimization of sawing processes in the wood industry is critical for maximizing efficiency and profitability. The introduction of computerized tomography scanners provides sawmill operators with three-dimensional internal models of logs, which can be used to assess value and yield more accurately. We present a methodology for solving the sawing optimization problem employing a flexible sawing scheme that allows greater flexibility in cutting logs into products while considering product quality classes influenced by wane defects. The methodology has two phases: preprocessing and optimization. In the preprocessing phase, two alternative algorithms are given that generate and evaluate the potential sawing positions of products by considering the 3D surface of the log, product size requirements, and product quality classes. In the optimization phase, a maximum set-packing problem is solved for the preprocessed data using mixed-integer programming (MIP), aiming to obtain a feasible cut pattern that maximizes value yield. This is implemented in a system named FlexSaw, which takes advantage of parallel computation during the preprocessing phase and utilizes a MIP solver during the optimization phase. The proposed sawing methods are evaluated on the Swedish Pine Stem Bank. Additionally, FlexSaw is compared with an existing tool that utilizes cant sawing. Results demonstrate the superiority of flexible sawing. While the practical feasibility of implementing a flexible way of sawing logs is constrained by the limitations of current sawmill machinery, the potential increase in yield promotes the exploration of alternative machinery in the wood industry.

Place, publisher, year, edition, pages
Elsevier, 2024
National Category
Wood Science
Research subject
Wood Science and Engineering
Identifiers
urn:nbn:se:uu:diva-517316 (URN)10.1016/j.compag.2023.108513 (DOI)001139709900001 ()
Funder
Vinnova, 2020-03734
Available from: 2023-11-29 Created: 2023-12-06 Last updated: 2024-02-07Bibliographically approved
Yu, Z., Deng, T., Zhao, Y. & Yuan, D. (2024). Multi-cell content caching: Optimization for cost and information freshness. Computer Networks, 247, Article ID 110420.
Open this publication in new window or tab >>Multi-cell content caching: Optimization for cost and information freshness
2024 (English)In: Computer Networks, ISSN 1389-1286, E-ISSN 1872-7069, Vol. 247, article id 110420Article in journal (Refereed) Published
Abstract [en]

In multi-access edge computing (MEC) systems, there are multiple local cache servers caching contents to satisfy the users' requests, instead of letting the users download via the remote cloud server. In this paper, a multi -cell content scheduling problem (MCSP) in MEC systems is considered. Taking into account jointly the freshness of the cached contents and the traffic data costs, we study how to schedule content updates along time in a multi -cell setting. Different from single -cell scenarios, a user may have multiple candidate local cache servers, and thus the caching decisions in all cells must be jointly optimized. We first prove that MCSP is NP -hard, then we formulate MCSP using integer linear programming, by which the optimal scheduling can be obtained for small-scale instances. For problem solving of large scenarios, via a mathematical reformulation, we derive a scalable optimization algorithm based on repeated column generation. Our performance evaluation shows the effectiveness of the proposed algorithm in comparison to an off -the -shelf commercial solver and a popularity -based caching.

Place, publisher, year, edition, pages
Elsevier, 2024
Keywords
Age of information, Caching, Multi-cell
National Category
Communication Systems
Identifiers
urn:nbn:se:uu:diva-532164 (URN)10.1016/j.comnet.2024.110420 (DOI)001235269000001 ()
Funder
Swedish Research Council, 2022-04123
Available from: 2024-06-24 Created: 2024-06-24 Last updated: 2025-05-07Bibliographically approved
Zhao, Y. & Yuan, D. (2024). On optimization formulations for radio resource allocation subject to common transmission rate. Computers & Operations Research, 161, Article ID 106427.
Open this publication in new window or tab >>On optimization formulations for radio resource allocation subject to common transmission rate
2024 (English)In: Computers & Operations Research, ISSN 0305-0548, E-ISSN 1873-765X, Vol. 161, article id 106427Article in journal (Refereed) Published
Abstract [en]

We study a radio resource allocation problem in mobile communication systems. As the distinct characteristic of this problem, a common data transmission rate is used on all channels allocated to a user. Because the channels differ in their quality, for each user the achievable rate varies by channel. Thus allocating more channels does not necessarily increase the total rate, as the common rate is constrained to be the lowest one supported by the allocated channels. Radio resource allocation subject to the common-rate constraint is of practical relevance, though little attention has been paid to modeling and solving the problem. We take a mathematical optimization perspective with focus on modeling. We first provide a complexity analysis. Next, several integer linear programming (ILP) formulations for the problem, including compact as well as non-compact models, are derived. The bulk of our analysis consists in a rigorous comparative study of their linear programming (LP) relaxations, to reveal the relationship between the formulations in terms of bounding. Computational experiments are presented to illustrate the numerical performance in bounding and LP-assisted problem solving. Our theoretical analysis and numerical results together serve the aim of setting a ground for the next step of developing model-based and tailored optimization methods.

Place, publisher, year, edition, pages
Elsevier, 2024
Keywords
Integer programming, Modeling, Radio resource allocation
National Category
Communication Systems
Identifiers
urn:nbn:se:uu:diva-515465 (URN)10.1016/j.cor.2023.106427 (DOI)001084126800001 ()
Funder
Swedish Research Council
Available from: 2023-11-09 Created: 2023-11-09 Last updated: 2024-01-18Bibliographically approved
Ahani, G. & Yuan, D. (2024). Optimal Content Caching and Recommendation With Age of Information. IEEE Transactions on Mobile Computing, 23(1), 689-704
Open this publication in new window or tab >>Optimal Content Caching and Recommendation With Age of Information
2024 (English)In: IEEE Transactions on Mobile Computing, ISSN 1536-1233, E-ISSN 1558-0660, Vol. 23, no 1, p. 689-704Article in journal (Refereed) Published
Abstract [en]

Content caching at the network edge is an effective way of mitigating backhaul load and improving user experience. Caching efficiency can be enhanced by content recommendation and by keeping the information fresh. By content recommendation, a requested content that is not in the cache can be alternatively satisfied by a related cached content recommended by the system. Information freshness can be quantified by age of information (AoI). This article has the following contributions. First, we address optimal scheduling of cache updates for a time-slotted system accounting for content recommendation and AoI, and to the best of our knowledge, there is no work that has jointly taken into account these aspects. Next, we rigorously prove the problem's NP-hardness. Then, we derive an integer linear formulation, by which the optimal solution can be obtained for small-scale scenarios. On the algorithmic side, our contributions include the development of an effective algorithm based on Lagrangian decomposition, and efficient algorithms for solving the resulting subproblems. Our algorithm computes a bound that can be used to evaluate the performance of any suboptimal solution. We conduct simulations to show the effectiveness of our algorithm.

Place, publisher, year, edition, pages
IEEE Computer Society, 2024
Keywords
Age of information, caching, content recommendation, scheduling
National Category
Communication Systems Telecommunications Computer Sciences
Identifiers
urn:nbn:se:uu:diva-525049 (URN)10.1109/TMC.2022.3213782 (DOI)001136301500057 ()
Available from: 2024-03-19 Created: 2024-03-19 Last updated: 2024-03-19Bibliographically approved
Deng, T., Chen, D., Jia, J., Dong, M., Ota, K., Yu, Z. & Yuan, D. (2024). Optimizing Resource Allocation and Request Routing for AI-Generated Content (AIGC) Services in Mobile Edge Networks With Cell Coupling. IEEE Transactions on Vehicular Technology, 73(11), 17911-17916
Open this publication in new window or tab >>Optimizing Resource Allocation and Request Routing for AI-Generated Content (AIGC) Services in Mobile Edge Networks With Cell Coupling
Show others...
2024 (English)In: IEEE Transactions on Vehicular Technology, ISSN 0018-9545, E-ISSN 1939-9359, Vol. 73, no 11, p. 17911-17916Article in journal (Refereed) Published
Abstract [en]

In this paper, we investigate the deployment and service of pre-trained foundation models (PFMs) in mobile edge networks with cell coupling. We formulate a joint resource allocation and request routing optimization problem (RARP) to achieve a trade-off between the accuracy loss and cost of artificial intelligence-generated content (AIGC). For problem solving, we propose an alternating optimization algorithm (AOA) that decomposes RARP into two sub-problems and iteratively optimizes them. Specifically, for the first sub-problem, we reformulate it as a linear programming problem and use the off-the-shelf optimization solver to solve it. For the other sub-problem, we propose a deep reinforcement learning based algorithm to optimize the deployment to PFMs. Performance evaluations validate the efficiency of AOA.

Place, publisher, year, edition, pages
IEEE, 2024
Keywords
Servers, Costs, Accuracy, Computational modeling, Optimization, Artificial neural networks, Indexes, AIGC, mobile edge networks, deep reinforcement learning
National Category
Communication Systems Control Engineering Signal Processing Computational Mathematics Telecommunications
Identifiers
urn:nbn:se:uu:diva-545463 (URN)10.1109/TVT.2024.3421351 (DOI)001359239100011 ()
Funder
Swedish Research Council, 2022-04123
Available from: 2025-01-03 Created: 2025-01-03 Last updated: 2025-01-03Bibliographically approved
Projects
How to Empty the Queues Fast? - New Perspectives of Fundamental Performance Analysis of Wireless Networks via Mathematical Programming [2013-05649_VR]; Uppsala University5G Network Performance: A Mathematical Optimization Perspective [2018-05247_VR]; Uppsala UniversityOperations Research in a Millisecond: Real-Time Resource Optimization for Mobile Networks [2022-04123_VR]; Uppsala University; Publications
Chen, D., Deng, T., Jia, J., Feng, S. & Yuan, D. (2025). Mobility-aware decentralized federated learning with joint optimization of local iteration and leader selection for vehicular networks. Computer Networks, 263, Article ID 111232. Yu, Z., Zhao, Y., Chu, X. & Yuan, D. (2025). Online Learning for Intelligent Thermal Management of Interference-Coupled and Passively Cooled Base Stations. IEEE Transactions on Machine Learning in Communications and Networking, 3, 64-79
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0001-8119-5206

Search in DiVA

Show all publications