Data Science & ML

Model Serving

/ˈmɒdəl ˈsɜːvɪŋ/

Definition

Deploying a trained ML model as an API endpoint so applications can request predictions in real or batch time.

Example in context

"The recommendation model is wrapped in a FastAPI server — the product service calls /predict with user context."

Related terms

Practice this term

Master Model Serving in context by working through exercises in the Data Science & ML module. You'll see the term used in real engineering scenarios with multiple-choice, fill-in-the-blank, and matching drills.