Scenario: "The contract specifies a freshness SLA of 4 hours." What does data freshness mean as a contractual obligation?
Freshness SLA: "data is no older than X hours at any given time." Measured by: max(updated_at) vs NOW() delta. Breach triggers alert, producer notified, consumers may pause dependent processes.
Scenario: "The contract includes a referential integrity check." What is referential integrity as a data quality expectation?
Referential integrity is critical for analytics correctness — orphaned rows inflate metrics. Contract spec: "100% of customer_id values in orders must exist in customers."
Scenario: "Data quality monitoring detected an anomaly in the distribution of the status column." What does statistical anomaly detection monitor?
Distribution monitoring: track value frequency + standard deviation and alert on shifts. e.g., normally 70% 'active', 20% 'cancelled' — if 'cancelled' spikes to 50%, investigate.
Key vocab:distribution anomaly, value frequency monitoring, statistical baseline, enum distribution shift.
5 / 5
Scenario: "The contract defines a combined data quality score." How is it typically calculated?