+ 3

Gradient Descent Optimization

Anyone familiar with the G.D.O. techniques help me in this... 1. Adam 2. AdaMax 3. Adagrad 4. Nesterov momentum 5. RMSprop 6. L1 and L2 regularization These are some of the techniques applied in G.D.O. 1. Which one of these algorithms can be used together? 2. Please help with links to resources for further reading. 3. Can I apply them in Linear Regression? I have seen scikit learn implementation not talking about them in its documentation. Thanks in advance

gradients optimization machinelearning

18th Oct 2019, 3:53 AM

Dan Rhamba

2 Answers

+ 1

Some links: https://ruder.io/optimizing-gradient-descent/ https://arxiv.org/pdf/1609.04747.pdf https://www.kdnuggets.com/2019/06/gradient-descent-algorithms-cheat-sheet.html https://github.com/harshraj11584/Paper-Implementation-Overview-Gradient-Descent-Optimization-Sebastian-Ruder

4th Nov 2019, 3:29 AM

José Ailton Batista da Silva

+ 2

José Ailton Batista da Silva thanks for the links

4th Nov 2019, 3:39 PM

Dan Rhamba

Hot today

How to mention someone in a comment

5 Votes

codeing for game

0 Votes

how can we use switch case in while loop?

2 Votes

Can we stop pointer assignment any way ?

1 Votes

¿Can i use html, css, and javascript in vscode ?

0 Votes

Web Development Project adding Local storage.

1 Votes

0 Votes

Angular in solo learn

1 Votes

C++ Inheritance lesson crashing

0 Votes

Having trouble learning js

1 Votes