Distributed Spectrum Management in Cognitive Radio Networks by Consensus-Based Reinforcement Learning

Stanković, Miloš

Distributed Spectrum Management in Cognitive Radio Networks by Consensus-Based Reinforcement Learning

Autori: Dejan Dašić, Nemanja ilić, Miljan Vučetić, Miroslav Perić, Marko Beko, Miloš Stanković

Časopis: Sensors

Volume, no: 21 , 9

ISSN: 1424-8220

DOI: 10.3390/s21092970

Stranice: 2970-2970

Link: https://www.mdpi.com/1424-8220/21/9/2970

Apstrakt:

In this paper, we propose a new algorithm for distributed spectrum sensing and channel selection in cognitive radio networks based on consensus. The algorithm operates within a multi-agent reinforcement learning scheme. The proposed consensus strategy, implemented over a directed, typically sparse, time-varying low-bandwidth communication network, enforces collaboration between the agents in a completely decentralized and distributed way. The motivation for the proposed approach comes directly from typical cognitive radio networks’ practical scenarios, where such a decentralized setting and distributed operation is of essential importance. Specifically, the proposed setting provides all the agents, in unknown environmental and application conditions, with viable network-wide information. Hence, a set of participating agents becomes capable of successful calculation of the optimal joint spectrum sensing and channel selection strategy even if the individual agents are not. The proposed algorithm is, by its nature, scalable and robust to node and link failures. The paper presents a detailed discussion and analysis of the algorithm’s characteristics, including the effects of denoising, the possibility of organizing coordinated actions, and the convergence rate improvement induced by the consensus scheme. The results of extensive simulations demonstrate the high effectiveness of the proposed algorithm, and that its behavior is close to the centralized scheme even in the case of sparse neighbor-based inter-node communication.

Ključne reči: multi-agent reinforcement learning; consensus algorithm; cognitive radio networking; joint spectrum sensing and channel selection; distributed policy evaluation; distributed Q-learning; off-policy temporal difference

Zahvaljujemo se što ste preuzeli publikaciju sa portala Singipedia.

Ukoliko želite da se prijavite za obaveštenja o sadržajima iz oblasti ove publikacije, možete nam ostaviti adresu svoje elektronske pošte.