Knowledge Management English for Engineering Teams

Explore the English vocabulary engineering teams use to document knowledge — runbooks, postmortems, ADRs, and more.

Introduction

Engineering teams accumulate knowledge constantly — in code, in conversations, in incident responses, and in architectural decisions. The problem is that most of this knowledge lives in people’s heads rather than in systems, and when those people leave or move to other teams, the knowledge goes with them. Effective knowledge management is the practice of making team knowledge durable, discoverable, and useful. This post covers the eight terms that form the foundation of knowledge management in modern engineering organizations.

Knowledge Management Vocabulary

Institutional knowledge — The accumulated understanding, context, and expertise that exists within an organization, particularly the informal, undocumented knowledge that long-tenured employees carry. Institutional knowledge is valuable precisely because it is hard to acquire and easy to lose.

“When the founding engineer left last year, we realized how much institutional knowledge had never been documented — entire classes of edge cases in the payment flow existed only in her memory.”

Runbook — A documented set of procedures for operating, maintaining, or troubleshooting a system or service. Runbooks are intended to be followed during incidents or routine operations, enabling engineers to respond effectively even when they are unfamiliar with a system.

“Every service we run in production must have a runbook that covers the top five failure modes — when an on-call engineer gets paged at 2am, they should be able to diagnose and resolve the most common issues without hunting for context.”

Postmortem — A structured retrospective analysis conducted after a production incident, outage, or significant failure. A good postmortem identifies what happened, the root cause, the contributing factors, and specific action items to prevent recurrence. Most engineering cultures adopt blameless postmortems to encourage honest analysis.

“After the database failover that caused 45 minutes of downtime, we ran a blameless postmortem and identified three systemic issues — the follow-up action items from that document drove our infrastructure roadmap for the next quarter.”

Decision log — A record of significant decisions made during a project or sprint, including the context, the options considered, and the rationale for the chosen path. A decision log prevents teams from relitigating old decisions and gives future engineers visibility into why things are the way they are.

“We keep a decision log in Notion for each major project — when a new engineer joins and asks why we chose GraphQL over REST for the data API, the answer is already documented with the original reasoning.”

Architectural decision record — Commonly called an ADR, this is a short, structured document that captures a significant architectural decision. Unlike a decision log entry, an ADR is typically written before or during the decision to ensure that all options are properly considered and the rationale is clearly articulated.

“I wrote an ADR before we migrated our job queue from Redis to a dedicated message broker — three months later, when the performance issues we anticipated actually appeared, the document made it easy to explain why we had made that call.”

Tribal knowledge — A specific type of institutional knowledge held by a small group or tribe within a larger organization. It refers to specialized understanding that is not written down and only accessible by being part of a particular team or community of practice.

“The deployment process for the legacy monolith is pure tribal knowledge — only two engineers know the full sequence of manual steps required, and we need to document and automate it before either of them moves to a new role.”

Knowledge base — A centralized repository of documented information, guides, FAQs, runbooks, and reference material that team members can search and access independently. A well-maintained knowledge base reduces reliance on tribal knowledge and decreases the number of repetitive questions that disrupt senior engineers.

“We rebuilt our internal knowledge base in Confluence last quarter and categorized everything by service domain — onboarding time for new engineers dropped from six weeks to three because answers to common questions were actually findable.”

Single source of truth — The principle that every piece of important information should be stored in exactly one authoritative location, with all other references pointing back to that source. It prevents the confusion and errors that arise when multiple copies of the same information exist and diverge over time.

“Our infrastructure configuration must have a single source of truth in Terraform — if the same resource is defined in both Terraform and a manual script, we will eventually have a discrepancy that causes an incident.”

Why Knowledge Management Is an Engineering Problem

Many engineering teams treat knowledge management as a documentation problem — something that should happen after the real work is done. This framing is wrong. Documentation debt compounds just like technical debt. The longer a team waits to capture institutional knowledge, the more expensive it becomes to recover it, and the more risk the team carries from key-person dependencies.

The most effective engineering organizations treat knowledge management as a continuous practice rather than a periodic cleanup. They write runbooks before systems go live, not after the first incident. They write postmortems within 48 hours of an incident, while memory is fresh. They maintain ADRs as part of the design process, not as an afterthought.

Getting Started

If your team does not have a knowledge management practice, start with two things: a runbook for your most critical service and a postmortem template that your team agrees to use after every significant incident. These two artifacts alone will begin to make your team’s knowledge more durable and your on-call rotations less stressful. From there, building a decision log and a lightweight knowledge base becomes a natural extension of the same discipline.