BeginnerVocabulary#Ollama#local LLM#Modelfile#REST API

Ollama Local Model Serving Exercises

Ollama simplifies running large language models locally with automatic model management and a REST API. These exercises cover pulling and running models, creating custom Modelfiles, the streaming REST API, monitoring loaded models, and automatic GPU memory management.

0 / 5 completed

1 / 5

A developer runs ollama run llama3.2 for the first time. What happens before the interactive session starts?

2 / 5

What is the purpose of a Modelfile in Ollama?

3 / 5

A developer calls POST /api/generate on Ollama's REST API with "stream": false. What changes about the response?

4 / 5

Which Ollama CLI command shows the currently running models and their memory usage?

5 / 5

How does Ollama determine how many GPU layers to use when loading a model?