freeCodeCamp/curriculum/challenges/portuguese/11-machine-learning-with-py.../tensorflow/reinforcement-learning-with...

---
id: 5e8f2f13c4cdbe86b5c72da4
title: 'Aprendizagem de reforço com Q-Learning: Parte 2'
challengeType: 11
videoId: DX7hJuaUZ7o
bilibiliIds:
  aid: 420570359
  bvid: BV1G341127zr
  cid: 409139190
dashedName: reinforcement-learning-with-q-learning-part-2
---

# --question--

## --text--

O que pode acontecer se o agente não tiver um bom equilíbrio entre realizar ações aleatórias e usar ações aprendidas?

## --answers--

O agente sempre tentará minimizar sua recompensa pelo estado/ação atual, levando ao mínimo local.

---

O agente sempre tentará maximizar sua recompensa pelo estado/ação atual, levando ao máximo local.

## --video-solution--

2
feat: enable new langs (#42491) Enable italian and portuguese 2021-06-15 07:49:18 +00:00			`---`
			`id: 5e8f2f13c4cdbe86b5c72da4`
chore(i18n,curriculum): update translations (#42969) 2021-07-22 16:01:38 +00:00			`title: 'Aprendizagem de reforço com Q-Learning: Parte 2'`
feat: enable new langs (#42491) Enable italian and portuguese 2021-06-15 07:49:18 +00:00			`challengeType: 11`
			`videoId: DX7hJuaUZ7o`
chore(i18n,curriculum): update translations (#43661) 2021-10-03 19:24:27 +00:00			`bilibiliIds:`
			`aid: 420570359`
			`bvid: BV1G341127zr`
			`cid: 409139190`
feat: enable new langs (#42491) Enable italian and portuguese 2021-06-15 07:49:18 +00:00			`dashedName: reinforcement-learning-with-q-learning-part-2`
			`---`

			`# --question--`

			`## --text--`

chore(i18n,curriculum): update translations (#42969) 2021-07-22 16:01:38 +00:00			`O que pode acontecer se o agente não tiver um bom equilíbrio entre realizar ações aleatórias e usar ações aprendidas?`
feat: enable new langs (#42491) Enable italian and portuguese 2021-06-15 07:49:18 +00:00
			`## --answers--`

chore(i18n,curriculum): update translations (#42969) 2021-07-22 16:01:38 +00:00			`O agente sempre tentará minimizar sua recompensa pelo estado/ação atual, levando ao mínimo local.`
feat: enable new langs (#42491) Enable italian and portuguese 2021-06-15 07:49:18 +00:00
			`---`

chore(i18n,curriculum): update translations (#42969) 2021-07-22 16:01:38 +00:00			`O agente sempre tentará maximizar sua recompensa pelo estado/ação atual, levando ao máximo local.`
feat: enable new langs (#42491) Enable italian and portuguese 2021-06-15 07:49:18 +00:00
			`## --video-solution--`

			`2`