+ 3

Gradient Descent Optimization

Anyone familiar with the G.D.O. techniques help me in this... 1. Adam 2. AdaMax 3. Adagrad 4. Nesterov momentum 5. RMSprop 6. L1 and L2 regularization These are some of the techniques applied in G.D.O. 1. Which one of these algorithms can be used together? 2. Please help with links to resources for further reading. 3. Can I apply them in Linear Regression? I have seen scikit learn implementation not talking about them in its documentation. Thanks in advance

gradients optimization machinelearning

18th Oct 2019, 3:53 AM

Dan Rhamba

2 Antworten

+ 1

Some links: https://ruder.io/optimizing-gradient-descent/ https://arxiv.org/pdf/1609.04747.pdf https://www.kdnuggets.com/2019/06/gradient-descent-algorithms-cheat-sheet.html https://github.com/harshraj11584/Paper-Implementation-Overview-Gradient-Descent-Optimization-Sebastian-Ruder

4th Nov 2019, 3:29 AM

José Ailton Batista da Silva

+ 2

José Ailton Batista da Silva thanks for the links

4th Nov 2019, 3:39 PM

Dan Rhamba

Heute heiß

I have a python coding but not working

3 Votes

How to create img ?

1 Votes

2 Votes

Complete the code to get a user input and store it into a variable called address

0 Votes

0 Votes

Halloween Candy (Python)

0 Votes

What is the best language to learn dsa?

0 Votes

0 Votes

1 Votes

Professionals answer this

1 Votes