+ 3

Gradient Descent Optimization

Anyone familiar with the G.D.O. techniques help me in this... 1. Adam 2. AdaMax 3. Adagrad 4. Nesterov momentum 5. RMSprop 6. L1 and L2 regularization These are some of the techniques applied in G.D.O. 1. Which one of these algorithms can be used together? 2. Please help with links to resources for further reading. 3. Can I apply them in Linear Regression? I have seen scikit learn implementation not talking about them in its documentation. Thanks in advance

gradients optimization machinelearning

18th Oct 2019, 3:53 AM

Dan Rhamba

2 odpowiedzi

+ 1

Some links: https://ruder.io/optimizing-gradient-descent/ https://arxiv.org/pdf/1609.04747.pdf https://www.kdnuggets.com/2019/06/gradient-descent-algorithms-cheat-sheet.html https://github.com/harshraj11584/Paper-Implementation-Overview-Gradient-Descent-Optimization-Sebastian-Ruder

4th Nov 2019, 3:29 AM

José Ailton Batista da Silva

+ 2

José Ailton Batista da Silva thanks for the links

4th Nov 2019, 3:39 PM

Dan Rhamba

Popularne dzisiaj

1 Votes

Does anyone have the solution for this challenge?

1 Votes

How would you solve the part of the C# Intermediate code project that requires operator overloading?

0 Votes

Why does coding take so long to learn

0 Votes

Solved# Survey data format in coding for data

1 Votes

Solved Ai generated practice the last question

0 Votes

How to add unordered lists in HTML.

0 Votes

What is the use of .kt classes in the React Native project

0 Votes

Why is it so hard to get a job as a junior dev? Or just wanting to do an internship.

1 Votes

Kernel in Jupyter

1 Votes