AdvancedVocabulary#ai-llm#data-science-ml#developer-tools

Low-Rank Adaptation (LoRA) Vocabulary

Build fluency in the vocabulary of fine-tuning a model through small, injected low-rank matrices.

0 / 5 completed

1 / 5

At standup, a dev mentions fine-tuning a large pretrained model by injecting a pair of small trainable low-rank matrices alongside its frozen original weights, instead of updating every one of the model's parameters. What is this technique called?

2 / 5

During a design review, the team wants to control how expressive the injected low-rank matrices are, trading off adaptation capacity against the number of trainable parameters. Which capability supports this?

3 / 5

In a code review, a dev notices the team can either merge a LoRA adapter's weights directly into the frozen base model for deployment, or keep it as a separate, swappable module loaded alongside the shared base model. What does this represent?

4 / 5

An incident report shows the team's storage costs grew enormously because every new fine-tuned variant was trained as a full fine-tune of the entire model rather than as a small, separately stored low-rank adapter. What practice would prevent this?

5 / 5

During a PR review, a teammate asks why the team fine-tunes with LoRA's small, injected low-rank matrices instead of just fully fine-tuning every one of the model's own parameters directly. What is the reasoning?