505 B
505 B
id | challengeType | videoId |
---|---|---|
5e8f2f13c4cdbe86b5c72da4 | 11 | DX7hJuaUZ7o |
--question--
--text--
What can happen if the agent does not have a good balance of taking random actions and using learned actions?
--answers--
The agent will always try to minimize its reward for the current state/action, leading to local minima.
The agent will always try to maximize its reward for the current state/action, leading to local maxima.
--video-solution--
2