You are building an Azure Machine Learning workflow by using Azure Machine Learning Studio.You create an Azure notebook that supports the Microsoft Cognitive Toolkit.
You need to ensure that the stochastic gradient descent (SGD) configuration maximizes the samples per
second and supports parallel modeling that is managed by a parameter server.
Which SGD algorithm should you use?
A.
DataParallelASGD
B.
DataParallelSGD
C.
ModelAveragingSGD
D.
BlockMomentumSGD
If it is to be managed by parameter server, then it is DataParallelASGD
DataParallelASGD.
https://docs.microsoft.com/en-us/cognitive-toolkit/multiple-gpus-and-machines#5-data-parallel-training-with-1-bit-sgd
8 Data-Parallel Training with Parameter Server
More new 70-774 Questions: https://drive.google.com/drive/folders/1WVXCup_qKNm0iitL4rKQ_hsgZd6M_dQD?usp=sharing