Exploration Entropy for Reinforcement Learning

<div>State Entropy of all states at the (a) 10th, (b) 20th, (c) 40th, (d) 60th, (e) 100th, (f) 200th, (g) 300th, (h) 800th, and (i) 1000th iteration with Softmax strategy in Maze A.</div>

Mathematical Problems in Engineering

Exploration Entropy for Reinforcement Learning

Figure 5