IOAI ML Notes Mathematics FundamentalsCalculus

Gradient Descent

Math fundamentals note for IOAI ML preparation.

16 February 2026

Syllabus Map

Study map: Syllabus Study Map

Core Idea

Iterative method that updates parameters in the negative gradient direction.

Update Rule

\theta_{t+1}=\theta_t-\eta\nabla_\theta L(\theta_t)

Practical Notes

Learning rate controls stability-speed tradeoff.

Too large diverges; too small is slow.

Mini-batch noise can aid exploration.

Moderate stochasticity can escape poor regions.

Why This Matters for ML

Gradient descent and its variants are the core training engines for most ML models.
Update-rule behavior determines speed, stability, and final model performance.
Batch vs mini-batch tradeoffs affect noise, generalization, and compute efficiency.
Practical tuning of optimizers is impossible without gradient-descent fundamentals.

← Back to Blog