Advanced AI Alignment & Safety Scalable OversightDebateAmplification

Scalable Oversight Vocabulary

5 exercises — Learn scalable oversight vocabulary: debate, amplification, iterated amplification, and how humans supervise AI they cannot fully verify.

0 / 5 completed
1 / 5
What is the core problem that scalable oversight research tries to solve?