LLM-Assisted and Machine Translation in Localization
5 exercises — 5 exercises practising MT post-editing vocabulary, quality estimation, LLM-assisted translation workflows, and MQM scoring.
0 / 5 completed
Quick reference: machine translation in localization
- MTPE — Machine Translation Post-Editing — human translator corrects MT output; full MTPE = thorough correction, light MTPE = minor fixes
- QE score — Quality Estimation — automated assessment of MT output quality without a reference translation; predicts post-editing effort
- DNT list — Do Not Translate — product names, brand terms, and technical identifiers that MT should preserve unchanged
1 / 5
A localization manager proposes switching from human translation to Neural Machine Translation (NMT) for all content. A localization engineer pushes back. Which concern is most valid?
NMT quality is highly content- and language-pair-dependent — "good enough for most content" is not good enough for all content.
NMT performs well for: high-frequency UI strings (short, in-context, high TM leverage), informational content with simple structure, and high-resource language pairs (en→fr, en→de, en→ja). NMT struggles with: creative marketing copy (tone and cultural adaptation), legal text (precise terminology, jurisdiction-specific terms), domain-specific jargon (new product features, technical specifications), and low-resource language pairs (en→Swahili, en→Kazakh). The industry standard today is risk stratification: MT + light MTPE for UI strings, MT + full MTPE for support content, human translation for marketing and legal. No localization engineer should recommend replacing all human translation with NMT without a quality assessment by category.
Key vocabulary:
• NMT (Neural Machine Translation) — deep learning-based MT; current industry standard (DeepL, Google Translate NMT, ModernMT)
• content risk stratification — categorising content by quality risk to determine appropriate MT+human mix
• MT hallucination — MT generating plausible-sounding but factually incorrect translations for domain terms
NMT performs well for: high-frequency UI strings (short, in-context, high TM leverage), informational content with simple structure, and high-resource language pairs (en→fr, en→de, en→ja). NMT struggles with: creative marketing copy (tone and cultural adaptation), legal text (precise terminology, jurisdiction-specific terms), domain-specific jargon (new product features, technical specifications), and low-resource language pairs (en→Swahili, en→Kazakh). The industry standard today is risk stratification: MT + light MTPE for UI strings, MT + full MTPE for support content, human translation for marketing and legal. No localization engineer should recommend replacing all human translation with NMT without a quality assessment by category.
Key vocabulary:
• NMT (Neural Machine Translation) — deep learning-based MT; current industry standard (DeepL, Google Translate NMT, ModernMT)
• content risk stratification — categorising content by quality risk to determine appropriate MT+human mix
• MT hallucination — MT generating plausible-sounding but factually incorrect translations for domain terms