You have an Azure Machine Learning pipeline that contains a Split Data module. The Split Data module outputs to a Train Model module and a Score Model module. What is the function of the Split Data module?
A.
scaling numeric variables so that they are within a consistent numeric range
B.
creating training and validation datasets
C.
diverting records that have missing data
D.
selecting columns that must be included in the model
Validation is part of testing. After you train your model and test it, you can then evaluate it by providing new cases that were not present in the initial data entries or let it out in the wild to interact and collect input that can be used then for retraining (this is also called active learning)
B is the answer.
https://learn.microsoft.com/en-us/azure/machine-learning/component-reference/split-data?view=azureml-api-2
Use the Split Data component to divide a dataset into two distinct sets.
This component is useful when you need to separate data into training and testing sets. You can also customize the way that data is divided. Some options support randomization of data. Others are tailored for a certain data type or model type.
A voting comment increases the vote count for the chosen answer by one.
Upvoting a comment with a selected answer will also increase the vote count towards that answer by one.
So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.
Clodia
Highly Voted 1 year, 1 month agoxRiot007
4 months agordemontis
Highly Voted 9 months agozellck
Most Recent 8 months, 2 weeks agoMurtuza
11 months ago