Computer Networking: A Top-Down Approach (7th Edition)
Computer Networking: A Top-Down Approach (7th Edition)
7th Edition
ISBN: 9780133594140
Author: James Kurose, Keith Ross
Publisher: PEARSON
Bartleby Related Questions Icon

Related questions

Question

10

Computing theta update rule
possible (graded)
The Q-learning approximation algorithm starts with an initial parameter estimate of 0. As the tabular Q-learning, upon observing a data tuple
(s, c, R (s, c), s'), the target value y for the Q-value of (s, c) is defined as the sampled version of the Bellman operator,
Then the parameter is simply updated by taking a gradient step with respect to the squared loss
y = = R(s, c) + ymax Q (s', c', 0).
L(0) =
The negative gradient can be computed as follows:
(Enter your answer in terms of y, Q(s, c, theta), and phi(s, c).).
g (0) =
=
L (0) = 1/2 (y — Q (s, c, 0))².
expand button
Transcribed Image Text:Computing theta update rule possible (graded) The Q-learning approximation algorithm starts with an initial parameter estimate of 0. As the tabular Q-learning, upon observing a data tuple (s, c, R (s, c), s'), the target value y for the Q-value of (s, c) is defined as the sampled version of the Bellman operator, Then the parameter is simply updated by taking a gradient step with respect to the squared loss y = = R(s, c) + ymax Q (s', c', 0). L(0) = The negative gradient can be computed as follows: (Enter your answer in terms of y, Q(s, c, theta), and phi(s, c).). g (0) = = L (0) = 1/2 (y — Q (s, c, 0))².
Expert Solution
Check Mark
Knowledge Booster
Background pattern image
Recommended textbooks for you
Text book image
Computer Networking: A Top-Down Approach (7th Edi...
Computer Engineering
ISBN:9780133594140
Author:James Kurose, Keith Ross
Publisher:PEARSON
Text book image
Computer Organization and Design MIPS Edition, Fi...
Computer Engineering
ISBN:9780124077263
Author:David A. Patterson, John L. Hennessy
Publisher:Elsevier Science
Text book image
Network+ Guide to Networks (MindTap Course List)
Computer Engineering
ISBN:9781337569330
Author:Jill West, Tamara Dean, Jean Andrews
Publisher:Cengage Learning
Text book image
Concepts of Database Management
Computer Engineering
ISBN:9781337093422
Author:Joy L. Starks, Philip J. Pratt, Mary Z. Last
Publisher:Cengage Learning
Text book image
Prelude to Programming
Computer Engineering
ISBN:9780133750423
Author:VENIT, Stewart
Publisher:Pearson Education
Text book image
Sc Business Data Communications and Networking, T...
Computer Engineering
ISBN:9781119368830
Author:FITZGERALD
Publisher:WILEY