--- id: 5e8f2f13c4cdbe86b5c72da5 title: '使用 Q-Learning 进行强化学习:示例' challengeType: 11 videoId: RBBSNta234s dashedName: reinforcement-learning-with-q-learning-example --- # --question-- ## --text-- 填空以完成以下 Q-Learning 方程: ```py Q[__A__, __B__] = Q[__A__, __B__] + LEARNING_RATE * (reward + GAMMA * np.max(Q[__C__, :]) - Q[__A__, __B__]) ``` ## --answers-- A: `state` B: `action` C: `next_state` --- A: `state` B: `action` C: `prev_state` --- A: `state` B: `reaction` C: `next_state` ## --video-solution-- 1