Advanced AI Alignment & Safety Scalable OversightDebateAmplification

Scalable Oversight Vocabulary

5 exercises — Learn scalable oversight vocabulary: debate, amplification, iterated amplification, and how humans supervise AI they cannot fully verify.

0 / 5 completed

1 / 5

What is the core problem that scalable oversight research tries to solve?

2 / 5

In AI debate (a scalable oversight technique), what is the key assumption?

3 / 5

A colleague explains: "The human's ability to evaluate AI outputs doesn't scale as the AI gets smarter." Which technique directly addresses this by recursively decomposing tasks?

4 / 5

What does amplification mean in the context of scalable oversight?

5 / 5

Why is scalable oversight particularly critical for superhuman AI systems?