AdamW
Published on: August 5, 2021
![AdamW](/articles/adamw-explained/adamw.png)
Table of Content
AdamW is a stochastic optimization method that modifies the typical implementation of weight decay in Adam to combat Adam's known convergence problems by decoupling the weight decay from the gradient updates.