Advanced Interview #data-quality #data-engineering #dbt #interview-prep

Data Quality Engineer Interview Questions

5 exercises — choose the best-structured answer to common data quality engineering interview questions. Focus on validation frameworks, test architecture, data contracts, lineage, and anomaly detection.

Structure for data quality interview answers

Name the framework objects: Great Expectations has Expectations, Suites, Validators, Checkpoints — use the correct terms
Distinguish test types: schema tests vs singular tests vs custom generic tests have different scopes and reuse patterns
Cover enforcement layers: data contracts need schema, runtime, and CI enforcement — not just documentation
Address tooling specifics: dbt, OpenLineage, Soda, DataHub — name the config options and integration points

0 / 5 completed

1 / 5

The interviewer asks: "Explain how Great Expectations works — how do you define expectations, validate data, and integrate validation into a data pipeline?"
Which answer best covers Great Expectations architecture?

2 / 5

The interviewer asks: "Compare dbt singular tests and generic tests — how do you write each, and when would you use custom generic tests vs schema.yml assertions?"
Which answer best covers dbt test architecture?

3 / 5

The interviewer asks: "What is a data contract, how do you specify one, and how do you enforce it in a modern data stack?"
Which answer best covers data contract architecture?

4 / 5

The interviewer asks: "Explain column-level lineage — how is it different from table-level lineage, why does it matter for impact analysis, and how do modern tools capture it?"
Which answer best covers data lineage depth?

5 / 5

The interviewer asks: "Compare Z-score, IQR, and ML-based approaches to detecting data quality anomalies in a pipeline — when would you use each?"
Which answer best covers data anomaly detection?