The nightly pipeline will ___ the cleaned data into the warehouse for analytics.
To load data into the warehouse means to write processed data to the analytics store — the L in ETL/ELT. Load is the precise term. Put in up, drop out, and send over are informal or non-idiomatic. Engineers "load the data into BigQuery," so load data into the warehouse is the correct collocation.
2 / 5
The analyst will ___ a query against the sales table to build the quarterly report.
To run a query means to execute a SQL statement against the warehouse. Run is the standard term, behind "run a query" and "query execution." Do, play, and fire out are informal. Analysts "run a query against the table," so run a query is the correct collocation.
3 / 5
To improve query performance on huge tables, the team will ___ the table by date.
To partition a table means to divide it into segments (e.g. by date) so queries scan less data. Partition is the precise term, behind "partitioning" and "partition pruning." Split up, divide out, and cut over are informal. Engineers "partition the table by date," so partition the table is the correct collocation.
4 / 5
The dashboard query will ___ the metrics by region to show totals per market.
To aggregate metrics means to combine rows into summary values (sum, average, count) grouped by a dimension. Aggregate is the precise term, behind "aggregation" and GROUP BY. Add up out, sum over, and total up are informal or imprecise. Analysts "aggregate the metrics by region," so aggregate the metrics is the correct collocation.
5 / 5
The data engineer will ___ the pipeline to run every night at 2 a.m.
To schedule a pipeline means to configure it to run automatically on a defined cadence. Schedule is the precise term, behind "scheduled job" and orchestrators like Airflow. Time up, set out, and plan over are informal. Engineers "schedule the pipeline to run nightly," so schedule the pipeline is the correct collocation.