17 articles tagged #data-engineering
All English for IT articles related to #data-engineering.
-
Data Lineage Vocabulary: How to Talk About Data Provenance and Impact Analysis
Master the English vocabulary data engineers use when discussing data lineage, provenance, impact analysis, and metadata management tools like DataHub and Atlan.
-
dbt Snapshots and SCD2: English Vocabulary for Data Engineers
Learn English vocabulary for dbt snapshots and slowly changing dimensions to speak confidently in data engineering standups and design reviews.
-
English for Data Platform Architects: Lakehouse, Medallion, and Data Contracts
Master the English vocabulary and natural design review phrases for modern data platform discussions: lakehouse, medallion architecture, data contracts, and lineage.
-
Polars vs Pandas: English for DataFrame Comparison Discussions
Learn English vocabulary and phrases for comparing Python DataFrame libraries like Polars and Pandas in code reviews and technical debates.
-
English for Data Observability Engineers
Master essential data observability vocabulary — freshness, completeness, anomaly detection, schema drift, and lineage — for confident English communication.
-
English for dbt Unit Tests and Model Contracts
Learn the English vocabulary for dbt testing: model contracts, column-level constraints, unit test YAML, given/expect patterns, and fixture data explained.
-
English for Polars Data Processing Developers
Learn the English vocabulary for Polars data engineering: LazyFrame vs DataFrame, lazy evaluation, collect, scan_csv, expression API, and streaming mode explained.
-
English Vocabulary for Data Engineering Discussions
Master essential data engineering vocabulary: medallion architecture, data lineage, schema evolution, CDC, watermarks, exactly-once semantics, and data contracts.
-
Vocabulary for Data Engineers
Essential data engineering vocabulary explained in plain English: ETL vs ELT, data lakehouse, dbt, orchestration, data lineage, data quality — with examples.
-
Vocabulary for Data Mesh Architects
Essential English vocabulary for data mesh: domain ownership, data products, federated computational governance, self-serve data platform, and interoperability standards.
-
English for Data Quality Discussions: Talking About Validation and Trust
Vocabulary and phrases for data quality discussions in English: completeness, accuracy, freshness, anomalies, and diplomatic ways to flag bad data to stakeholders.
-
Vocabulary for Stream Processing: Windowing, Watermarks, and Backpressure
Master the English of stream processing: windowing, watermarks, backpressure, exactly-once, late data, and stateful operators. Precise terms for data and platform engineers.
-
English for Data Pipeline Engineers: Vocabulary and Phrases
Data pipeline vocabulary, ETL/ELT language, orchestration, data quality, and communication patterns for data engineers.
-
Data Governance Vocabulary for Engineers
60 essential data governance terms for data engineers: data stewardship, lineage, cataloguing, access control, data quality, compliance, and metadata management.
-
How to Write a Data Contract in English
A complete guide for data engineers: what a data contract is, how to write one in English, the required sections, vocabulary, and ready-to-use templates.
-
ETL vs ELT: Explaining the Difference in Plain English
Clear explanation of ETL and ELT for data engineers — vocabulary, when to use each approach, trade-offs, and real-world phrases for technical discussions and interviews.
-
Data Engineering Vocabulary: 70 Terms Every Data Engineer Must Know
Essential data engineering vocabulary: ETL, ELT, DAGs, data pipelines, Spark, Kafka, dbt, data contracts, lakehouse architecture, and 60 more terms with examples.