Intermediate Vocabulary #ai #llm #machine-learning #rag

AI & LLM Vocabulary

5 exercises — the vocabulary every developer working with AI needs in English: RAG architecture, hallucination, AI agents, model quantisation, and prompting techniques.

Core AI & LLM vocabulary clusters

Architecture: LLM, transformer, embedding, vector database, RAG, fine-tuning, context window, tokens
Quality issues: hallucination, grounding, factuality, alignment, RLHF, Constitutional AI
Agents: tool use, function calling, ReAct, planning, orchestration, LangChain, AutoGen
Deployment: inference, latency, throughput, VRAM, quantisation (INT4/INT8), GGUF, bitsandbytes
Prompting: zero-shot, few-shot, chain-of-thought, system prompt, temperature, top-p
Evaluation: benchmark, MMLU, HumanEval, BLEU, ROUGE, perplexity, evals

0 / 5 completed

1 / 5

An ML engineer explains their system:
"Instead of searching the entire document database for every query, we first convert documents to vectors using an embedding model and store them in a vector database. At query time, we embed the question and retrieve the k nearest neighbours — semantically similar documents — then pass them as context to the LLM."
What architecture is described here?

2 / 5

A product manager reads a model evaluation report:
"The model hallucinated in 3% of responses — confidently stating facts that were not present in the source documents and could not be verified."
What does hallucination mean in the context of LLMs?

3 / 5

An AI engineer writes in their architecture doc:
"The agent uses a ReAct loop — it reasons about the task, selects a tool, observes the result, and repeats until it reaches a final answer or hits the max iteration limit."
What is an AI agent in this context?

4 / 5

A team discusses model deployment cost:
"The 70B model gives better results but inference is too expensive at scale. We're exploring quantisation — going from FP16 to INT4 — to run it on a single GPU."
What does quantisation mean?

5 / 5

A developer describes their prompting approach:
"I use chain-of-thought prompting — I add 'think step by step' to complex reasoning tasks. The model's performance on multi-step problems improved significantly compared to direct-answer prompts."
What is chain-of-thought (CoT) prompting?