This paper presents an in-depth study of the application of Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithms with an exploratory strategy for duty cycle scheduling (DCS) in the wireless sensor networks (WSNs). The focus is on optimizing the performance of sensor nodes in terms of energy efficiency and event detection rates under varying environmental conditions. Through a series of simulations, we investigate the impact of key parameters such as the sensor specificity constant α and the Poisson rate of events on the learning and operational efficacy of sensor nodes. Our results demonstrate that the MADDPG algorithm with an exploratory strategy outperforms traditional reinforcement learning algorithms, particularly in environments characterized by high event rates and the need for precise energy management. The exploratory strategy enables a more effective balance between exploration and exploitation, leading to improved policy learning and adaptation in dynamic and uncertain environments. Furthermore, we explore the sensitivity of different algorithms to the tuning of the sensor specificity constant α, revealing that lower values generally yield better performance by reducing energy consumption without significantly compromising event detection. The study also examines the algorithms' robustness against the variability introduced by different event Poisson rates, emphasizing the importance of algorithm selection and parameter tuning in practical WSN applications. The insights gained from this research provide valuable guidelines for the deployment of sensor networks in real-world scenarios, where the trade-off between energy consumption and event detection is critical. Our findings suggest that the integration of exploratory strategies in MADDPG algorithms can significantly enhance the performance and reliability of sensor nodes in WSNs.
Published in | International Journal of Sensors and Sensor Networks (Volume 12, Issue 1) |
DOI | 10.11648/j.ijssn.20241201.11 |
Page(s) | 1-12 |
Creative Commons |
This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited. |
Copyright |
Copyright © The Author(s), 2024. Published by Science Publishing Group |
Multi-Agent Systems, Deep Reinforcement Learning, MADDPG, Wireless Sensor Networks, Energy Efficiency, Event Detection, Exploratory Strategy
[1] | Begum S, Wang S, Krishnamachari B, Helmy A. ELECTION: Energy-efficient and low-latency scheduling technique for wireless sensor networks. Proceedings of the 29th Annual IEEE Conference on Local Computer Networks (LCN). Tampa, FL, USA: IEEE; 2004. p. 16-18. https://orcid.org/10.1109/LCN.2004.49 |
[2] | Dantu R, Abbas K, O’Neill M II, Mikler A. Data centric modeling of environmental sensor networks. Proceedings of the IEEE Globecom. Dallas, TX, USA: IEEE, 2004: 447-452. https://orcid.org/10.1109/GLOCOMW.2004.1417621 |
[3] | Yang Y R, Lam S S. General AIMD congestion control. Proceedings of the 2000 International Conference on Network Protocols, 2000: 187-198. https://orcid.org/10.1109/ICNP.2000.896303 |
[4] | E. Ucer, M. C. Kisacikoglu and M. Yuksel. Analysis of Decentralized AIMD-based EV Charging Control. 2019 IEEE Power & Energy Society General Meeting (PESGM), Atlanta, GA, USA. 2019: 1-5. https://orcid.org/10.1109/PESGM40551.2019.8973725 |
[5] | Lee J, Lee D, Kim J, Cho W, Pajak J. A dynamic sensing cycle decision scheme for energy efficiency and data reliability in wireless sensor networks. Lecture Notes in Computer Science. 2007; 4681: 218-229. https://orcid.org/10.1007/978-3-540-74171-8_22 |
[6] | Jain A, Chang E Y. Adaptive sampling for sensor networks. Proceedings of the International Workshop on Data Management for Sensor Networks. Toronto, ON, Canada; 2004, 72 p. 10-16. https://orcid.org/10.1145/1052199.1052202 |
[7] | Stone P, Veloso M M. Team-partitioned, Opaque-transition Reinforcement Learning. Proceedings of the third annual conference on Autonomous Agents; 1999. |
[8] | Foerster J, Nardelli N, Farquhar G, Afouras T, Torr P H S, Kohli P, Whiteson S. Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning. ARXIV-CS.AI; 2017. |
[9] | Nguyen T T, Nguyen N D, Nahavandi S. Deep Reinforcement Learning For Multi-Agent Systems: A Review Of Challenges, Solutions And Applications. ARXIV-CS.LG; 2018. |
[10] | Samvelyan M, Rashid T, Schroeder de Witt C, Farquhar G, Nardelli N, Rudner T G J, Hung C M, Torr P H S, Foerster J, Whiteson S. The StarCraft Multi-Agent Challenge. ARXIV-CS.LG; 2019. |
[11] | Chu T, Wang J, Codecà L, Li Z. Multi-Agent Deep Reinforcement Learning For Large-scale Traffic Signal Control [J]. ARXIV-CS.LG, 2019. |
[12] | Lowe R, Wu Y I, Tamar A, Harb J, Abbeel P, Mordatch I. Multi-agent actor-critic for mixed cooperative-competitive environments. Advances in neural information processing systems. 2017, 30. |
[13] | Wang D, Zhang W, Song B, Du X, Guizani M. Market-Based Model In CR-WSN: A Q-Probabilistic Multi-agent Learning Approach. ARXIV-CS.MA; 2019. |
[14] | Zhang K, Yang Z, Başar T. Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms. ARXIV-CS.LG; 2019. |
[15] | Wang C, Shen X, Wang H, Xie W, Zhang H, Mei H. Multi-Agent Reinforcement Learning-Based Routing Protocol for Underwater Wireless Sensor Networks with Value of Information. IEEE Sensors Journal. 2023. https://orcid.org/10.1109/JSEN.2023.3345947 |
[16] | Ren J, et al. MeFi: Mean Field Reinforcement Learning for Cooperative Routing in Wireless Sensor Network. IEEE Internet of Things Journal. 2024 Jan 1; 11(1): 995-1011. https://orcid.org/10.1109/JIOT.2023.3289888. |
[17] | Liang Y, Wu H, Wang H. Asynchronous Multi-Agent Reinforcement Learning for Collaborative Partial Charging in Wireless Rechargeable Sensor Networks. IEEE Transactions on Mobile Computing. 2023. https://orcid.org/10.1109/TMC.2023.3299238 |
[18] | Cao J, Liu J, Dou J, Hu C, Cheng J, Wang S. Multi-Agent Reinforcement Learning Charging Scheme for Underwater Rechargeable Sensor Networks. IEEE Communications Letters. 2023. https://orcid.org/10.1109/LCOMM.2023.3345362 |
[19] | Zhao M, Wang G, Fu Q, Guo X, Chen Y, Li T, Liu X. MW-MADDPG: a meta-learning based decision-making method for collaborative UAV swarm. Frontiers in Neurorobotics. 2023; 17. https://orcid.org/10.3389/fnbot.2023.1243174 |
[20] | Liu X, Tan Y. Feudal Latent Space Exploration for Coordinated Multi-Agent Reinforcement Learning. IEEE Transactions on Neural Networks and Learning Systems. 2023 Oct; 34(10): 7775-7783. https://orcid.org/10.1109/TNNLS.2022.3146201 |
APA Style
Wu, L., Liu, P., Qu, J., Zhang, C., Zhang, B. (2024). Duty Cycle Scheduling in Wireless Sensor Networks Using an Exploratory Strategy-Directed MADDPG Algorithm. International Journal of Sensors and Sensor Networks, 12(1), 1-12. https://doi.org/10.11648/j.ijssn.20241201.11
ACS Style
Wu, L.; Liu, P.; Qu, J.; Zhang, C.; Zhang, B. Duty Cycle Scheduling in Wireless Sensor Networks Using an Exploratory Strategy-Directed MADDPG Algorithm. Int. J. Sens. Sens. Netw. 2024, 12(1), 1-12. doi: 10.11648/j.ijssn.20241201.11
@article{10.11648/j.ijssn.20241201.11, author = {Liangshun Wu and Peilin Liu and Junsuo Qu and Cong Zhang and Bin Zhang}, title = {Duty Cycle Scheduling in Wireless Sensor Networks Using an Exploratory Strategy-Directed MADDPG Algorithm}, journal = {International Journal of Sensors and Sensor Networks}, volume = {12}, number = {1}, pages = {1-12}, doi = {10.11648/j.ijssn.20241201.11}, url = {https://doi.org/10.11648/j.ijssn.20241201.11}, eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.ijssn.20241201.11}, abstract = {This paper presents an in-depth study of the application of Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithms with an exploratory strategy for duty cycle scheduling (DCS) in the wireless sensor networks (WSNs). The focus is on optimizing the performance of sensor nodes in terms of energy efficiency and event detection rates under varying environmental conditions. Through a series of simulations, we investigate the impact of key parameters such as the sensor specificity constant α and the Poisson rate of events on the learning and operational efficacy of sensor nodes. Our results demonstrate that the MADDPG algorithm with an exploratory strategy outperforms traditional reinforcement learning algorithms, particularly in environments characterized by high event rates and the need for precise energy management. The exploratory strategy enables a more effective balance between exploration and exploitation, leading to improved policy learning and adaptation in dynamic and uncertain environments. Furthermore, we explore the sensitivity of different algorithms to the tuning of the sensor specificity constant α, revealing that lower values generally yield better performance by reducing energy consumption without significantly compromising event detection. The study also examines the algorithms' robustness against the variability introduced by different event Poisson rates, emphasizing the importance of algorithm selection and parameter tuning in practical WSN applications. The insights gained from this research provide valuable guidelines for the deployment of sensor networks in real-world scenarios, where the trade-off between energy consumption and event detection is critical. Our findings suggest that the integration of exploratory strategies in MADDPG algorithms can significantly enhance the performance and reliability of sensor nodes in WSNs. }, year = {2024} }
TY - JOUR T1 - Duty Cycle Scheduling in Wireless Sensor Networks Using an Exploratory Strategy-Directed MADDPG Algorithm AU - Liangshun Wu AU - Peilin Liu AU - Junsuo Qu AU - Cong Zhang AU - Bin Zhang Y1 - 2024/02/28 PY - 2024 N1 - https://doi.org/10.11648/j.ijssn.20241201.11 DO - 10.11648/j.ijssn.20241201.11 T2 - International Journal of Sensors and Sensor Networks JF - International Journal of Sensors and Sensor Networks JO - International Journal of Sensors and Sensor Networks SP - 1 EP - 12 PB - Science Publishing Group SN - 2329-1788 UR - https://doi.org/10.11648/j.ijssn.20241201.11 AB - This paper presents an in-depth study of the application of Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithms with an exploratory strategy for duty cycle scheduling (DCS) in the wireless sensor networks (WSNs). The focus is on optimizing the performance of sensor nodes in terms of energy efficiency and event detection rates under varying environmental conditions. Through a series of simulations, we investigate the impact of key parameters such as the sensor specificity constant α and the Poisson rate of events on the learning and operational efficacy of sensor nodes. Our results demonstrate that the MADDPG algorithm with an exploratory strategy outperforms traditional reinforcement learning algorithms, particularly in environments characterized by high event rates and the need for precise energy management. The exploratory strategy enables a more effective balance between exploration and exploitation, leading to improved policy learning and adaptation in dynamic and uncertain environments. Furthermore, we explore the sensitivity of different algorithms to the tuning of the sensor specificity constant α, revealing that lower values generally yield better performance by reducing energy consumption without significantly compromising event detection. The study also examines the algorithms' robustness against the variability introduced by different event Poisson rates, emphasizing the importance of algorithm selection and parameter tuning in practical WSN applications. The insights gained from this research provide valuable guidelines for the deployment of sensor networks in real-world scenarios, where the trade-off between energy consumption and event detection is critical. Our findings suggest that the integration of exploratory strategies in MADDPG algorithms can significantly enhance the performance and reliability of sensor nodes in WSNs. VL - 12 IS - 1 ER -