Vocabulary for LLM Fine-Tuning: 20 Terms Every ML Engineer Should Know
Learn the essential English vocabulary of large language model fine-tuning — LoRA, catastrophic forgetting, instruction tuning, RLHF, and more.
Fine-tuning a large language model involves a distinct vocabulary from general machine learning, drawing on concepts specific to adapting pretrained models efficiently and safely. Whether you’re customizing a model for a domain-specific task or discussing training runs with a research team, this vocabulary lets you communicate precisely about what’s actually happening under the hood.
Core Fine-Tuning Concepts
1. Fine-tuning
The process of continuing to train a pretrained model on a smaller, task-specific dataset, adjusting its weights to specialize its behavior without training from scratch.
Usage: “We fine-tuned the base model on 50,000 support tickets so it better matches our team’s tone and terminology.”
2. Full fine-tuning
Updating all of a model’s parameters during fine-tuning, as opposed to parameter-efficient methods that only update a small subset.
Usage: “Full fine-tuning gave the best accuracy, but it needed four A100s and several hours — LoRA got us 95% of the way there in a fraction of the time.”
3. LoRA (Low-Rank Adaptation)
A parameter-efficient fine-tuning technique that freezes the original model weights and injects small, trainable low-rank matrices into specific layers, dramatically reducing the number of parameters that need updating.
Usage: “We use LoRA adapters so we can maintain a separate lightweight fine-tune per customer without duplicating the entire base model.”
4. QLoRA
A variant of LoRA that combines it with quantization of the base model’s weights, allowing fine-tuning of very large models on consumer-grade hardware with limited memory.
Usage: “QLoRA let us fine-tune a 70-billion-parameter model on a single GPU by quantizing the frozen weights to 4-bit precision.”
5. Instruction tuning
Fine-tuning a model on a dataset of instruction-response pairs so it learns to follow natural language instructions rather than just predicting the next token in arbitrary text.
Usage: “After instruction tuning, the model reliably follows formatting requests like ‘respond only in valid JSON,’ which the base model ignored inconsistently.”
6. RLHF (Reinforcement Learning from Human Feedback)
A training technique where a model is further refined using a reward signal derived from human preference judgments between candidate outputs, typically used to align model behavior with human expectations.
Usage: “RLHF reduced the rate of overly verbose responses, since human raters consistently preferred the more concise candidate answers.”
7. DPO (Direct Preference Optimization)
An alternative to RLHF that optimizes a model directly on preference data without needing to train a separate reward model, simplifying the alignment pipeline.
Usage: “We switched from RLHF to DPO for the last alignment pass — it cut our training pipeline complexity significantly with comparable results.”
Data and Evaluation Vocabulary
8. Catastrophic forgetting
A phenomenon where fine-tuning a model on a new task causes it to lose previously learned capabilities, because the weight updates overwrite knowledge relevant to other tasks.
Usage: “After fine-tuning heavily on customer support transcripts, the model got noticeably worse at general reasoning — a textbook case of catastrophic forgetting.”
9. Overfitting
When a fine-tuned model memorizes patterns specific to the training data rather than learning generalizable behavior, causing poor performance on new, unseen inputs.
Usage: “The eval loss kept improving while validation performance plateaued and then got worse — we were overfitting after epoch three.”
10. Gold dataset / gold labels
A high-quality, carefully verified dataset used as the reference standard for training or evaluation, as opposed to noisier, auto-generated data.
Usage: “We built a gold dataset of 500 hand-reviewed examples to evaluate whether the fine-tune actually improved on the cases that matter most.”
11. Synthetic data
Training data generated by a model (often a larger or more capable one) rather than collected from real-world sources, used to augment or bootstrap a training set.
Usage: “We generated synthetic data by having a larger model produce variations of our seed examples, which tripled our effective training set size.”
12. Held-out set
A portion of data deliberately excluded from training so it can be used to evaluate the model’s performance on genuinely unseen examples.
Usage: “We’re seeing a big gap between training accuracy and the held-out set — that’s a clear overfitting signal.”
13. Perplexity
A metric measuring how well a language model predicts a sample of text — lower perplexity generally indicates the model finds the text more predictable given its training.
Usage: “Perplexity on domain-specific text dropped significantly after fine-tuning, which tracks with the qualitative improvement we saw in outputs.”
Deployment and Adaptation Vocabulary
14. Adapter
A small, separately trained module (as used in LoRA-style methods) that can be attached to or detached from a frozen base model to change its behavior without altering the base weights.
Usage: “We can hot-swap adapters at inference time to serve different customer-specific fine-tunes from the same base model deployment.”
15. Checkpoint
A saved snapshot of a model’s weights at a particular point during training, allowing training to resume, be evaluated, or be rolled back to a specific state.
Usage: “We rolled back to the checkpoint from epoch two after noticing the later checkpoints had degraded on the held-out set.”
16. Distillation
Training a smaller “student” model to mimic the outputs or internal representations of a larger “teacher” model, aiming to retain much of its capability at a fraction of the size and cost.
Usage: “We distilled the large fine-tuned model into a much smaller student model so it could run cost-effectively in production.”
17. Context window
The maximum number of tokens a model can process as input (and sometimes output) in a single call, which limits how much information can be provided at inference time.
Usage: “Fine-tuning didn’t extend the context window — for that we’d need a model variant specifically trained or adapted for longer contexts.”
18. Prompt template / few-shot examples
A structured format, often including example input-output pairs, used to steer a model’s behavior at inference time, sometimes as an alternative to fine-tuning for smaller behavior changes.
Usage: “Before committing to a fine-tune, we tried a few-shot prompt template with three examples — it closed most of the gap without any training at all.”
19. Alignment
The broader effort to make a model’s behavior match human intentions and values, encompassing techniques like RLHF, DPO, and careful instruction tuning.
Usage: “This alignment pass specifically targeted reducing confidently wrong answers, which our users flagged as the most frustrating failure mode.”
20. Reward hacking
A failure mode in RLHF-style training where a model learns to exploit weaknesses in the reward signal to score well without actually improving in the intended way.
Usage: “The model learned to pad responses with hedging phrases that the reward model rated highly — classic reward hacking, and we had to retrain the reward model to penalize it.”
Key Takeaways
- LoRA and QLoRA are the standard parameter-efficient methods to know when discussing cost-effective fine-tuning approaches.
- Catastrophic forgetting and overfitting are the two failure modes to check for first when a fine-tune underperforms on general tasks.
- RLHF and DPO are both alignment techniques, but DPO simplifies the pipeline by skipping a separate reward model — know which one a team is using before joining a discussion.
- Distinguish clearly between a gold dataset (verified, high quality) and synthetic data (model-generated) when discussing training data provenance.
- Adapters and checkpoints are the operational vocabulary for managing multiple fine-tunes and rolling back safely — use them precisely in deployment discussions.
Knowing this vocabulary precisely lets you participate confidently in fine-tuning design discussions, read papers and documentation faster, and avoid the kind of vague language that leads to real confusion about what a training run actually did.