Redis Vector Search: English for Engineers Building AI Applications
Learn the English terminology for Redis Vector Search — VSS, HNSW index, FT.CREATE, vector fields, KNN queries, hybrid search, and Redis Stack.
Vector search has moved from research papers into production AI applications, and Redis has become one of the most popular choices for storing and querying embeddings at low latency. Engineers building semantic search, recommendation engines, or retrieval-augmented generation (RAG) pipelines need to discuss index types, query strategies, and schema design with precision. This guide covers the English vocabulary that comes up in design reviews, pull request comments, and team discussions when Redis Vector Search is on the stack.
Key Vocabulary
VSS (Vector Similarity Search) — the Redis capability, part of the RediSearch module, that allows storing high-dimensional vectors alongside regular data fields and querying them by similarity rather than exact match. “We chose VSS over a dedicated vector database because we were already running Redis in production and didn’t want to introduce another stateful service.”
HNSW index — Hierarchical Navigable Small World, an approximate nearest-neighbour algorithm that trades a small amount of recall accuracy for dramatically faster query performance at scale. “We switched from a FLAT index to an HNSW index once the embedding count exceeded half a million — query latency dropped from 80 ms to under 5 ms.”
FT.CREATE — the Redis command used to create a search index, including the schema definition for text, numeric, tag, and vector fields.
“The schema in FT.CREATE defines a VECTOR field with the HNSW algorithm and sets the dimension count to 1536 to match the OpenAI Ada embeddings.”
Vector field — a field type within a RediSearch index schema that stores a fixed-dimension floating-point array representing an embedding.
“Each product document has a description_embedding vector field populated by passing the product description through the embedding model at index time.”
KNN query — a K-nearest-neighbour query that returns the K most similar vectors to a given query vector, ranked by cosine or Euclidean distance. “The search endpoint runs a KNN query with K set to 10, then re-ranks the results using a lightweight cross-encoder before returning the top three to the client.”
Hybrid search — a query that combines a vector similarity score with traditional filter conditions (tag, numeric range, full-text) so that results must satisfy both semantic relevance and structured criteria. “We implemented hybrid search so users can say ‘find products similar to this description but only in the footwear category and under £50’ — the pre-filter happens before the KNN stage.”
Redis Stack — the distribution of Redis that bundles core Redis with a set of modules including RediSearch (for vector and full-text search), RedisJSON, RedisTimeSeries, and RedisBloom.
“Make sure the Docker image you pull is redis/redis-stack, not plain redis — VSS requires the RediSearch module that only ships with Redis Stack.”
Embedding dimension — the number of floating-point values in a vector, which must match exactly between the index schema and every vector stored or queried against it. “We hit a dimension mismatch error in staging because the embedding model was updated from 768 to 1024 dimensions but the index schema wasn’t recreated.”
Useful Phrases
- “I’ll run
FT.CREATEwith a VECTOR field using the HNSW algorithm, cosine distance metric, and a dimension of 1536 — then we can start ingesting embeddings.” - “The pre-filter in the hybrid query is reducing the candidate set before the KNN step, which keeps recall high without scanning the full index.”
- “We’re storing the raw JSON document with
JSON.SETand keeping the vector in a separate hash field indexed by RediSearch — that way we can update the embedding without rewriting the whole document.” - “At a million vectors, FLAT search becomes too slow for real-time use; HNSW with
EF_RUNTIMEtuned to 200 gives us the right balance of speed and recall for our use case.” - “The
FT.SEARCHcommand returns both the distance score and the document fields — we normalise the cosine distance to a similarity percentage before displaying it in the UI.”
Common Mistakes
Saying “search by vector” instead of “query by similarity”. Non-native speakers sometimes describe a KNN query as “searching by the vector,” which sounds awkward to English-speaking engineers. The standard phrasing is “query by similarity,” “run a similarity search,” or “perform a nearest-neighbour lookup.” Saying “I’m querying by cosine similarity” is clear and natural.
Mixing up “index” as a noun and a verb without the right preposition. Engineers sometimes say “we index the embeddings to Redis” when the correct phrasing is “we index the embeddings in Redis” or “we store and index the embeddings using RediSearch.” The index itself is in (or on) the database, not to it. This distinction matters in written documentation and design documents.
Confusing “dimension” and “dimensionality”. Both words are correct, but dimension is used as a countable noun in practical Redis conversations: “the vector has 1536 dimensions” or “the embedding dimension is 1536.” Dimensionality is more common in academic writing (“the curse of dimensionality”). In daily engineering English, prefer the simpler dimensions or embedding dimension for clarity.
Mastering this vocabulary lets you participate fully in architecture discussions about AI search systems — from initial schema design through to performance tuning and production incident analysis.