AdvancedVocabulary#data-science-ml#backend#developer-tools

Backpropagation Vocabulary

Learn the vocabulary of computing every parameter's gradient in one backward pass through a neural network.

0 / 5 completed

1 / 5

At standup, a dev mentions computing the gradient of a neural network's loss with respect to every parameter by applying the chain rule backward from the output layer to the input layer, reusing intermediate derivatives along the way. What is this algorithm called?

2 / 5

During a design review, the team relies on backpropagation to train a deep network with many layers, specifically because reusing each layer's intermediate derivatives during the backward pass avoids recomputing a full forward pass for every single parameter. Which capability does this provide?

3 / 5

In a code review, a dev notices a neural-network training feature estimates each parameter's gradient by perturbing that one parameter slightly and re-running a full forward pass to see how the loss changed, instead of reusing intermediate derivatives in one backward pass. What does this represent?

4 / 5

An incident report shows a training job's cost scaled directly with the number of parameters in the network, because it estimated each parameter's gradient with a separate perturb-and-rerun forward pass instead of one backward pass reusing intermediate derivatives. What practice would prevent this?

5 / 5

During a PR review, a teammate asks why the team reaches for backpropagation instead of numerical finite-difference gradient checking, given that finite differences are simpler to implement. What is the reasoning?