EDGE360: Edge-Enabled Multi-Agent DRL for Region-Aware Rate Adaptation Solution to Enhance Quality of 360° Video Streaming

Subhan, Fazal E.; Yaqoob, Abid; Muntean, Cristina Hava; Muntean, Gabriel-Miro

EDGE360: Edge-Enabled Multi-Agent DRL for Region-Aware Rate Adaptation Solution to Enhance Quality of 360° Video Streaming

Tools

Subhan, Fazal E., Yaqoob, Abid, Muntean, Cristina Hava and Muntean, Gabriel-Miro (2025) EDGE360: Edge-Enabled Multi-Agent DRL for Region-Aware Rate Adaptation Solution to Enhance Quality of 360° Video Streaming. IEEE Transactions on Mobile Computing. ISSN 15361233

Full text not available from this repository.

Official URL: https://doi.org/10.1109/TMC.2025.3605849

Abstract

Optimal tile-based bitrate allocation improves the Quality of Experience (QoE) for adaptive 360° video streaming across multiple clients in heterogeneous network environments; however, it is challenging as it implies accurate viewport prediction, finest tile-based bitrate reservation, and maintaining QoE fairness, particularly under constrained network conditions. This paper proposes a strategy named EDGE360, that employs an edge-driven Multi-Agent Deep Reinforcement Learning (MADRL) solution for rate adaptation to improve the joint QoE in DASH-based rich media content delivery based on adaptive viewport prediction and Video Multi-method Assessment Fusion (VMAF) corresponding tiling granularity selection. Cooperative strategies among agents in the central critic network are crucial for addressing the complexity of network instances at the edge and optimizing media streaming bitrate assignment in multiple-client scenarios. Therefore, EDGE360 aims to implement the Counterfactual Multi-Agent Policy Gradients (COMA) based on 5G network traces to train agents in policies that optimize individual client QoE and fairness among clients, resulting in an improved rich streaming experience. At the edge, a tile-based quality monitor evaluates viewport trajectories, buffer status, and network throughput, employing deep learning to forecast optimal tile bitrate allocation, which is formulated as an MDP and solved with MADRL. Based on extensive experimentation, EDGE360 surpasses state-of-the-art adaptive bitrate algorithms by achieving the highest average reward, outperforming RAPT360, 360SRL, and BOLA360 by 8.12%, 11.86%, and 18.00%, respectively, demonstrating superior convergence and refinement.

Item Type:	Article
Uncontrolled Keywords:	Bitrate adaptation; Deep Reinforcement Learning; Edge Computing; MPEG DASH; QoE Fairness
Subjects:	Q Science > QA Mathematics > Electronic computers. Computer science T Technology > T Technology (General) > Information Technology > Electronic computers. Computer science Z Bibliography. Library Science. Information Resources > ZA Information resources > ZA4150 Computer Network Resources Q Science > QA Mathematics > Electronic computers. Computer science > Computer Systems > Computers > Electronic data processing > Electronic data processing--Distributed processing > Edge computing T Technology > T Technology (General) > Information Technology > Electronic computers. Computer science > Computer Systems > Computers > Electronic data processing > Electronic data processing--Distributed processing > Edge computing Q Science > Q Science (General) > Self-organizing systems. Conscious automata > Machine learning
Divisions:	School of Computing > Staff Research and Publications
Depositing User:	Tamara Malone
Date Deposited:	16 Sep 2025 14:32
Last Modified:	16 Sep 2025 14:32
URI:	https://norma.ncirl.ie/id/eprint/8858

Actions (login required)

View Item