AdvancedVocabulary#ai-llm#data-science-ml#developer-tools

Mixture-of-Experts (MoE) Vocabulary

Build fluency in the vocabulary of a model built from sparsely activated expert sub-networks.

0 / 5 completed
1 / 5
At standup, a dev mentions a model built from many separate 'expert' sub-networks, where only a small subset is activated for any given input rather than the whole model running every time. What is this architecture called?