English for LlamaIndex Developers
Learn the English vocabulary for LlamaIndex: document loaders, indices, retrievers, and query engines for building RAG applications.
LlamaIndex conversations combine data-pipeline vocabulary with retrieval-augmented-generation terms, so a developer new to RAG needs both the general concepts and LlamaIndex’s specific naming for each pipeline stage.
Key Vocabulary
Document loader — a component that pulls raw content from a source, like a PDF, a website, or a database, and converts it into LlamaIndex’s internal Document objects for indexing.
“Swap the generic file loader for the PDF-specific one — it’s preserving the table structure much better than plain text extraction.”
Node — a chunk of a document, split according to a chosen strategy, that becomes the actual unit stored and retrieved from an index. “These nodes are too large — the retriever keeps pulling back whole sections when we only need the one relevant paragraph.”
Index — a data structure, such as a vector store index or a summary index, that organizes nodes so they can be efficiently searched at query time. “We’re using a vector store index for semantic search, but a summary index would be a better fit for the ‘give me an overview’ queries.”
Retriever — the component responsible for fetching the most relevant nodes from an index given a query, before those nodes are passed to the language model for synthesis. “The retriever’s returning irrelevant nodes — try tuning the similarity threshold or switching to a hybrid retriever before touching the prompt.”
Query engine — the end-to-end pipeline that takes a natural-language question, retrieves relevant nodes, and synthesizes a final answer using an LLM.
“Wrap the retriever and the response synthesizer into a single query engine so the API surface is just .query() for consumers.”
Common Phrases
- “Is the document loader preserving structure, or are we losing table and heading information at ingestion?”
- “Are these nodes chunked at the right size, or is the retriever pulling back too much context?”
- “Which index type fits this use case — vector search, or something more summary-oriented?”
- “Is the retriever the bottleneck here, or is the language model failing to synthesize a good answer from good context?”
- “Should we expose this as a query engine, or does the consumer need lower-level access to the retriever?”
Example Sentences
Debugging a retrieval quality issue: “The answers are vague because the nodes are too coarse — re-chunk with a smaller node size so the retriever can surface the specific passage instead of an entire chapter.”
Explaining an architecture choice: “We picked a vector store index for the FAQ search but kept a separate summary index for the ‘explain this whole document’ feature — one index type doesn’t fit both use cases.”
Reviewing a pull request: “This query engine skips reranking entirely — add a reranker step before synthesis, or we’re relying on raw similarity scores alone.”
Professional Tips
- Distinguish document from node precisely — a document is the whole source, a node is the chunked retrieval unit, and confusing them makes chunking discussions unclear.
- Name the index type explicitly when discussing performance — “the index is slow” is vague; “the vector store index” or “the summary index” tells the team exactly what to investigate.
- Say retriever rather than “search” when diagnosing RAG quality issues — it isolates whether the problem is retrieval or generation.
- Use query engine as the term for the full pipeline when describing the system to non-RAG-specialist teammates — it’s the natural unit of abstraction to expose.
Practice Exercise
- Explain the difference between a document and a node in LlamaIndex’s pipeline.
- Describe what a retriever does and how it differs from a full query engine.
- Write a sentence explaining why chunk size affects retrieval quality.