exam questions

Exam Certified Machine Learning Associate All Questions

View all questions & answers for the Certified Machine Learning Associate exam

Exam Certified Machine Learning Associate topic 1 question 19 discussion

Actual exam question from Databricks's Certified Machine Learning Associate
Question #: 19
Topic #: 1
[All Certified Machine Learning Associate Questions]

Which of the Spark operations can be used to randomly split a Spark DataFrame into a training DataFrame and a test DataFrame for downstream use?

  • A. TrainValidationSplit
  • B. DataFrame.where
  • C. CrossValidator
  • D. TrainValidationSplitModel
  • E. DataFrame.randomSplit
Show Suggested Answer Hide Answer
Suggested Answer: E 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
oliver29
1 month, 4 weeks ago
Selected Answer: E
DataFrame.randomSplit is specifically designed to randomly split a Spark DataFrame into multiple subsets based on specified proportions.
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago