Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.
exam questions

Exam AWS Certified Machine Learning - Specialty All Questions

View all questions & answers for the AWS Certified Machine Learning - Specialty exam

Exam AWS Certified Machine Learning - Specialty topic 1 question 357 discussion

A data scientist needs to create a model for predictive maintenance. The model will be based on historical data to identify rare anomalies in the data.

The historical data is stored in an Amazon S3 bucket. The data scientist needs to use Amazon SageMaker Data Wrangler to ingest the data. The data scientist also needs to perform exploratory data analysis (EDA) to understand the statistical properties of the data.

Which solution will meet these requirements with the LEAST amount of compute resources?

  • A. Import the data by using the None option.
  • B. Import the data by using the Stratified option.
  • C. Import the data by using the First K option. Infer the value of K from domain knowledge.
  • D. Import the data by using the Randomized option. Infer the random size from domain knowledge.
Show Suggested Answer Hide Answer
Suggested Answer: C 🗳️

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
7f1fe73
2 weeks ago
Selected Answer: D
D. Import the data by using the Randomized option. Infer the random size from domain knowledge: This option selects a random sample of the data. Pros: It provides a representative sample of the entire dataset while using fewer compute resources than importing all data. Cons: There's a small chance of missing some rare anomalies, but this risk can be mitigated by choosing an appropriate sample size based on domain knowledge.
upvoted 1 times
...
MultiCloudIronMan
2 weeks, 1 day ago
Selected Answer: C
Why Option C? Efficiency: Importing a subset of the data using the First K option minimizes compute resources while still providing a representative sample for exploratory data analysis (EDA). Domain Knowledge: Leveraging domain knowledge to determine the value of K ensures that the subset is relevant and sufficient for meaningful analysis.
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...