Optimization Nuggets: Stochastic Polyak Step-size, Part 2

Faster rates under strong convexity

⊕
By Fabian Pedregosa and Fabian Schaipp.

Category: optimization
#SGD #proofs

Sun 19 November 2023

This blog post discusses the convergence rate of the Stochastic Gradient Descent with Stochastic Polyak Step-size (SGD-SPS) algorithm for minimizing a finite sum objective. Building upon the proof of the previous post, we show that the convergence rate can be improved to O(1/t) under the additional assumption that …

Optimization Nuggets: Stochastic Polyak Step-size

A simple step-size tuner with optimal rates

⊕
By Fabian Pedregosa.

Category: optimization
#SGD #proofs

Fri 29 September 2023

The stochastic Polyak step-size (SPS) is a practical variant of the Polyak step-size for stochastic optimization. In this blog post, we'll discuss the algorithm and provide a simple analysis for convex objectives with bounded gradients.

Optimization Nuggets: Exponential Convergence of SGD

⊕
By Fabian Pedregosa.

Category: optimization
#SGD #proofs #nuggets

Wed 15 December 2021

This is the first of a series of blog posts on short and beautiful proofs in optimization (let me know what you think in the comments!). For this first post in the series I'll show that stochastic gradient descent (SGD) converges exponentially fast to a neighborhood of the solution.