Note: This question is part of a series of questions that use the same or similar answer choices. An answer
choice may be correct for more than one question in the series. Each question is independent of the other
questions in this series. Information and details provided in a question apply only to that question.
You need to use only one percent of an Apache Hive data table by conducting random sampling by groups.
Which module should you use?
A.
Execute Python Script
B.
Tune Model Hyperparameters
C.
Normalize Data
D.
Select Columns in Dataset
E.
Import Data
F.
Edit Metadata
G.
Clip Values
H.
Clean Missing Data
Explanation:
https://docs.microsoft.com/en-us/azure/machine-learning/team-data-science-process/sample-data-hive
Clip Values
・Used to identify and optionally replace data values that are above or below a specified threshold. This is useful when you want to remove outliers or replace them with a mean, a constant, or other substitute value.