Intermediate Vocabulary #data #ml #acronyms

🤖 Data & ML Acronyms

5 exercises — ETL, LLM, RAG, OLAP, NLP, ML and the data pipeline and machine learning vocabulary every engineer encounters.

Acronyms covered in this set

ETL / ELT — data pipeline patterns (the order matters)
LLM / RAG — Large Language Model / Retrieval-Augmented Generation
OLAP / OLTP — analytical vs. transactional query workloads
NLP — Natural Language Processing
ML / GPU — Machine Learning / Graphics Processing Unit

0 / 5 completed

1 / 5

A data engineer explains a pipeline: "We use an ETL process to extract data from the source, transform it, and load it into the warehouse — but recently we've been moving to ELT instead."
What is the key difference between ETL and ELT?

2 / 5

A ML engineer describes a model performance issue: "Our LLM is producing hallucinations — the RAG architecture should help anchor it to real documents."
What are LLM and RAG?

3 / 5

A data analyst explains a query performance issue: "We run OLAP queries on the warehouse but OLTP queries on the production database — mixing them causes the slowdowns."
What is the difference between OLAP and OLTP?

4 / 5

A data scientist presents results: "The NLP model achieves 94% accuracy on the classification task, but we're seeing class imbalance — the F1 score is a better metric here."
What is NLP?

5 / 5

A data engineer says: "We store our model embeddings in a vector DB, and the raw logs go into an OLAP warehouse — our ML pipeline runs on GPU clusters."
In this context, what does ML stand for?