IntermediateVocabulary#data-science-ml#backend#developer-tools

dbt Data Transformation Vocabulary

Learn the vocabulary of writing a SQL-first data transformation that runs directly inside the warehouse.

0 / 5 completed

1 / 5

At standup, a dev mentions writing a data transformation as a SQL SELECT statement that a tool compiles and runs directly inside the warehouse, rather than extracting data out to a separate transformation engine. What tool is being described?

2 / 5

During a design review, the team wants one model's SQL to reference another model by name rather than a hardcoded table name, so dbt can automatically figure out the correct build order between them. Which capability supports this?

3 / 5

In a code review, a dev notices a dbt test is defined declaratively on a model's column, like asserting a column's values are always unique and never null, and dbt runs that test automatically as part of the build. What does this represent?

4 / 5

An incident report shows a broken downstream model silently produced incorrect numbers for a week because an upstream model's output had started containing duplicate rows, and no test existed to catch that duplication automatically. What practice would prevent this?

5 / 5

During a PR review, a teammate asks why the team writes its data transformations in dbt instead of a traditional ETL tool that extracts data out of the warehouse to transform it in a separate engine. What is the reasoning?