Which among the following statements provides the difference between reinforcement-based learning and temporal difference technique?
(a) State represented by a directed graph
(b) Assignment of weightage to an action on the basis of the degree of success
(c) Computation of degree of success
(d) Priori model of the sequence of possible states
I got this question in a job interview.
My enquiry is from Next Generation Wireless Network in division Next Generation Wireless Network of Cognitive Radio