Advanced AI Alignment & Safety CorrigibilityControlSafety

Corrigibility Vocabulary

5 exercises — Learn the vocabulary of AI corrigibility: corrigible AI, instrumental convergence, resistance to shutdown, and the tension between capability and human control.

0 / 5 completed
1 / 5
What does it mean for an AI system to be corrigible?