All English for IT articles related to #finetuning.
Learn English vocabulary for TRL and RLHF: PPO trainer, reward model, DPO, ORPO, SFT, and chat templates for fine-tuning large language models.