A4 Article in conference proceedings
Generative Diffusion Model-Based Deep Reinforcement Learning for Uplink Rate-Splitting Multiple Access in LEO Satellite Networks (2024)


Wang, X., Wang, K., Zhang, D., Li, J., Zhou, M., & Hämäläinen, T. (2024). Generative Diffusion Model-Based Deep Reinforcement Learning for Uplink Rate-Splitting Multiple Access in LEO Satellite Networks. In 2024 IEEE Symposium on Computers and Communications (ISCC). IEEE Computer Society Press. Proceedings : IEEE Symposium on Computers and Communications. https://doi.org/10.1109/iscc61673.2024.10733704


JYU authors or editors


Publication details

All authors or editorsWang, Xingjie; Wang, Kan; Zhang, Di; Li, Junhuai; Zhou, Momiao; Hämäläinen, Timo

Parent publication2024 IEEE Symposium on Computers and Communications (ISCC)

Place and date of conferenceParis, France26.-29.6.2024

ISBN979-8-3503-5424-9

eISBN979-8-3503-5423-2

Journal or seriesProceedings : IEEE Symposium on Computers and Communications

ISSN1530-1346

eISSN2642-7389

Publication year2024

Publication date31/10/2024

PublisherIEEE Computer Society Press

Publication countryUnited States

Publication languageEnglish

DOIhttps://doi.org/10.1109/iscc61673.2024.10733704

Publication open accessNot open

Publication channel open access

Publication is parallel published (JYX)https://jyx.jyu.fi/handle/123456789/98474

Additional informationS05: Non-terrestrial networks


Abstract

This work studies the joint transmit power control and receive beamforming in uplink rate splitting multiple access (RSMA)-based low earth orbit (LEO) satellite networks, using both generative diffusion model and proximal policy optimization (PPO) learning framework. In particular, using RSMA, interference is partially decoded and partially treated as noise, thereby improving the spectral efficiency, while the dynamics and uncertainty in LEO satellite networks would pose challenges to the real-time power control and receive beamforming optimization. First, a long-run sum data rate maximization problem is formulated, subject to the individual data rate requirement, and then the Markov decision process (MDP) is used to model it. Second, on the basis of MDP, a generative diffusion model-based proximal policy optimization (PPO) framework is proposed, where a denoising network is taken as the actor network in PPO to output the optimal continuous policy, thereby facilitating the hyperparameter tuning and improve the sample efficiency. Finally, experiments are conducted to show advantages of merging diffusion model into PPO, in terms of larger spectral efficiency, by comparing proposed framework with benchmarks.


Keywordsdata communications networkscommunications satellitessatellite communicationswireless data transmissionoptimisationalgorithms

Free keywordsLEO satellite networks; RSMA; generative diffusion model; PPO


Contributing organizations


Ministry reportingYes

VIRTA submission year2024

Preliminary JUFO rating1


Last updated on 2024-30-11 at 20:05