AdvancedVocabulary#software-architecture#developer-tools#backend

NUMA Awareness Vocabulary

Build fluency in the vocabulary of keeping a thread's memory local to the CPU socket that's actually using it.

0 / 5 completed

1 / 5

At standup, a dev mentions that on a multi-socket server, each CPU can access memory physically attached to its own socket faster than memory attached to a different socket, and writing code that accounts for this difference is called what?

2 / 5

During a design review, the team pins each worker thread to a specific CPU socket and allocates that thread's memory from the same socket, specifically to avoid the extra latency of reaching memory attached to a different socket. Which capability does this provide?

3 / 5

In a code review, a dev notices a performance-sensitive service on multi-socket hardware lets the operating system freely migrate threads between sockets and allocates memory with no regard for which socket will actually run the thread using it. What does this represent?

4 / 5

An incident report shows a performance-sensitive service's memory-access latency was inconsistent and sometimes much higher than expected on multi-socket hardware, because the operating system freely migrated threads between sockets and memory was allocated with no locality consideration. What practice would prevent this?

5 / 5

During a PR review, a teammate asks why the team pins threads to specific sockets instead of just letting the operating system's scheduler freely balance load across every socket however it sees fit. What is the reasoning?