Advanced AI Alignment & Safety CommunicationSafety reportsRisk

Communicating AI Safety Findings — Language

5 exercises — Practice the language used in AI safety reports, capability disclosures, and risk tier communication.

0 / 5 completed

1 / 5

A safety team says: "This feature imposes an alignment tax." What do they mean?

2 / 5

A researcher discovers a jailbreak in a deployed model. Following responsible disclosure practice, they should:

3 / 5

Which sentence correctly uses dual-use in an AI context?

4 / 5

When a capability is classified as a high risk tier, this means:

5 / 5

Fill in the blank: "We identified a jailbreak vector and are implementing a ___ in the next model version."