What is the significance of “expected reward” in a Markov Decision Process?
a) The probability of reaching a steady state
b) The return associated with following a particular policy
c) The expected transition time between states
d) The number of steps in the process