2020-08-13 10:00:20 +00:00
|
|
|
---
|
|
|
|
id: 5e8f2f13c4cdbe86b5c72da4
|
|
|
|
challengeType: 11
|
|
|
|
videoId: DX7hJuaUZ7o
|
|
|
|
---
|
|
|
|
|
2020-12-16 07:37:30 +00:00
|
|
|
# --question--
|
2020-08-13 10:00:20 +00:00
|
|
|
|
2020-12-16 07:37:30 +00:00
|
|
|
## --text--
|
2020-08-13 10:00:20 +00:00
|
|
|
|
2020-12-16 07:37:30 +00:00
|
|
|
What can happen if the agent does not have a good balance of taking random actions and using learned actions?
|
2020-08-13 10:00:20 +00:00
|
|
|
|
2020-12-16 07:37:30 +00:00
|
|
|
## --answers--
|
2020-08-13 10:00:20 +00:00
|
|
|
|
2020-12-16 07:37:30 +00:00
|
|
|
The agent will always try to minimize its reward for the current state/action, leading to local minima.
|
|
|
|
|
|
|
|
---
|
|
|
|
|
|
|
|
The agent will always try to maximize its reward for the current state/action, leading to local maxima.
|
|
|
|
|
|
|
|
## --video-solution--
|
|
|
|
|
|
|
|
2
|
|
|
|
|
|
|
|
# --hints--
|
|
|
|
|
|
|
|
|
|
|
|
# --solutions--
|
2020-08-13 10:00:20 +00:00
|
|
|
|