Advanced Vocabulary #dataops#data-quality#data-engineering#observability

DataOps Vocabulary

5 exercises — Practice DataOps vocabulary in English: data quality SLAs, data contracts, data observability, lineage, schema drift, and data incidents.

Core DataOps vocabulary clusters
  • Data quality: data quality SLA, freshness, completeness, accuracy, consistency, validity, data quality score
  • Data contracts: data contract, schema registry, backward/forward compatibility, breaking change, consumer-driven contract
  • Observability: data observability, data lineage, column-level lineage, anomaly detection, data freshness, volume spike
  • Incidents: data incident, silent failure, data downtime, SLA breach, data quality alert, incident classification
  • Tools: Monte Carlo, Great Expectations, dbt tests, Apache Atlas, OpenLineage, Marquez, Soda Core
0 / 5 completed
1 / 5
A data engineering lead introduces data contracts:
"A data contract is a formal agreement between data producers and consumers about the structure and semantics of a dataset. It specifies the schema, data types, expected freshness, null rate, and SLA. If the producer makes a breaking change — like renaming a column or changing a type — they violate the contract. We version our contracts and require producers to get consumer sign-off before breaking changes."
What is a data contract and what problem does it solve?