La Trobe

Gaussian Process Reinforcement Learning for Fast Opportunistic Spectrum Access

Download (5.62 MB)
journal contribution
posted on 2025-02-12, 04:13 authored by Zun Yan, Peng ChengPeng Cheng, Zhuo Chen, Yonghui Li, Branka Vucetic
Opportunistic spectrum access (OSA) is envisioned to support the spectrum demand of future-generation wireless networks. The majority of existing work assumed independent primary channels with the knowledge of network dynamics. However, the channels are usually correlated and network dynamics is unknown a-priori. This entails a great challenge on the sensing policy design for spectrum opportunity tracking, and the conventional partially observable Markov decision process (POMDP) formulation with model-based solutions are generally inapplicable. In this paper, we take a different approach, and formulate the sensing policy design as a time-series POMDP from a model-free perspective. To solve this time-series POMDP, we propose a novel Gaussian process reinforcement learning (GPRL) based solution. It achieves accurate channel selection and a fast learning rate. In essence, GP is embedded in RL as a Q-function approximator to efficiently utilize the past learning experience. A novel kernel function is first tailor designed to measure the correlation of time-series spectrum data. Then a covariance-based exploration strategy is developed to enable a proactive exploration for better policy learning. Finally, for GPRL to adapt to multichannel sensing, we propose a novel action-trimming method to reduce the computational cost. Our simulation results show that the designed sensing policy outperforms existing ones, and can obtain a near-optimal performance within a short learning phase.

Funding

The work was supported by ARC under Grant DE190100162.

History

Publication Date

2020-04-13

Journal

IEEE Transactions on Signal Processing

Volume

68

Pagination

16p. (p. 2613-2628)

Publisher

Institute of Electrical and Electronics Engineers

ISSN

1053-587X

Rights Statement

© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.