Which SGD algorithm should you use?

You are building an Azure Machine Learning workflow by using Azure Machine Learning Studio.You create an Azure notebook that supports the Microsoft Cognitive Toolkit.
You need to ensure that the stochastic gradient descent (SGD) configuration maximizes the samples per
second and supports parallel modeling that is managed by a parameter server.
Which SGD algorithm should you use?

You are building an Azure Machine Learning workflow by using Azure Machine Learning Studio.You create an Azure notebook that supports the Microsoft Cognitive Toolkit.
You need to ensure that the stochastic gradient descent (SGD) configuration maximizes the samples per
second and supports parallel modeling that is managed by a parameter server.
Which SGD algorithm should you use?

A.
DataParallelASGD

B.
DataParallelSGD

C.
ModelAveragingSGD

D.
BlockMomentumSGD

← Previous question

Next question →

Leave a Reply 3

Ruchita

If it is to be managed by parameter server, then it is DataParallelASGD

Reply

rai

DataParallelASGD.

https://docs.microsoft.com/en-us/cognitive-toolkit/multiple-gpus-and-machines#5-data-parallel-training-with-1-bit-sgd
8 Data-Parallel Training with Parameter Server

Reply

Levo

More new 70-774 Questions: https://drive.google.com/drive/folders/1WVXCup_qKNm0iitL4rKQ_hsgZd6M_dQD?usp=sharing

Reply