Review Article

Application of Reinforcement Learning in Cognitive Radio Networks: Models and Algorithms

Algorithm 3

RL algorithm for joint dynamic channel selection and channel sensing [34].
Repeat
(a) Choose action
        
(b) Receive delayed reward
(c) Update belief
(d) Update -value: