---
id: 5e8f2f13c4cdbe86b5c72da5
title: '使用 Q-Learning 进行强化学习：示例'
challengeType: 11
videoId: RBBSNta234s
dashedName: reinforcement-learning-with-q-learning-example
---

# --question--

## --text--

填空以完成以下 Q-Learning 方程：

```py
Q[__A__, __B__] = Q[__A__, __B__] + LEARNING_RATE * (reward + GAMMA * np.max(Q[__C__, :]) - Q[__A__, __B__])
```

## --answers--

A: `state`

B: `action`

C: `next_state`

---

A: `state`

B: `action`

C: `prev_state`

---

A: `state`

B: `reaction`

C: `next_state`

## --video-solution--

1