Question 1

What is database normalisation and why does it matter in schema design?

Accepted Answer

Normalisation is the process of organising a relational database to reduce data redundancy and improve data integrity. It proceeds through normal forms (1NF, 2NF, 3NF, BCNF), each eliminating a specific type of anomaly — insertion, update, or deletion. A well-normalised schema ensures that every fact is stored in exactly one place, making updates consistent and queries predictable.

Question 2

What vocabulary do engineers use when discussing Entity-Relationship (ER) diagrams?

Accepted Answer

Common ER vocabulary includes entities (the objects being modelled), attributes (their properties), and relationships (associations between entities). Cardinality terms — one-to-one, one-to-many, many-to-many — describe how instances relate. Engineers also use primary key, foreign key, composite key, identifying relationship, and weak entity.

Question 3

How do engineers talk about indexing strategy in schema design reviews?

Accepted Answer

Indexing discussions centre on selectivity, cardinality, and query access patterns. Engineers ask what columns appear in WHERE clauses and JOIN conditions, and consider B-tree indexes for range queries, hash indexes for equality lookups, and composite indexes for multi-column predicates. Key terms include covering index, partial index, index scan vs. sequential scan, and write amplification.

Question 4

What is denormalisation and when is it appropriate?

Accepted Answer

Denormalisation intentionally introduces redundancy to improve read performance. It is appropriate when normalised queries require expensive joins and read load greatly exceeds write load. Techniques include storing pre-aggregated counts, duplicating frequently joined columns, or materialising views.

Question 5

What language is used for schema migration conversations?

Accepted Answer

Schema migration vocabulary covers migration scripts, up/down migrations, zero-downtime migrations, and the expand-contract pattern. Engineers discuss additive migrations versus breaking migrations, and use phrases like backfill the data in batches and coordinate the deploy with the migration rollout.

Question 6

How do you explain the difference between relational and NoSQL schema design?

Accepted Answer

Relational schemas enforce structure upfront via DDL and normalise data into tables with SQL and strong consistency. NoSQL schema design is schema-on-read, allowing heterogeneous records in documents, key-value pairs, or wide-column stores. The choice depends on access patterns, consistency requirements, and scale.

Question 7

What does cardinality mean in database indexing vs. ER modelling?

Accepted Answer

In ER modelling, cardinality describes the count relationship between entity instances. In indexing, cardinality refers to the number of distinct values in an indexed column: high-cardinality columns are good index candidates, while low-cardinality columns provide little selectivity.

Question 8

What is a composite primary key and when should you use one?

Accepted Answer

A composite primary key uses two or more columns together to uniquely identify a row. It is natural for junction tables in many-to-many relationships. Engineers use composite keys when no single column is a sufficient natural identifier and adding a surrogate key would be redundant.

Question 9

How do engineers describe referential integrity and constraint enforcement?

Accepted Answer

Referential integrity means every foreign key value must correspond to an existing primary key in the parent table. Engineers enforce it with FOREIGN KEY constraints and discuss ON DELETE CASCADE, ON DELETE RESTRICT, and ON DELETE SET NULL behaviours.

Question 10

What is the expand-contract pattern in zero-downtime schema migrations?

Accepted Answer

The expand-contract pattern breaks a breaking schema change into three phases: Expand (add new structure alongside old), Migrate (backfill existing data), and Contract (remove old structure). This allows continuous deployment without downtime or table locks.

Database Schema Design Language Exercises

Frequently Asked Questions