Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.
exam questions

Exam AWS Certified Machine Learning - Specialty All Questions

View all questions & answers for the AWS Certified Machine Learning - Specialty exam

Exam AWS Certified Machine Learning - Specialty topic 1 question 305 discussion

A company wants to forecast the daily price of newly launched products based on 3 years of data for older product prices, sales, and rebates. The time-series data has irregular timestamps and is missing some values.

Data scientist must build a dataset to replace the missing values. The data scientist needs a solution that resamples the data daily and exports the data for further modeling.

Which solution will meet these requirements with the LEAST implementation effort?

  • A. Use Amazon EMR Serverless with PySpark.
  • B. Use AWS Glue DataBrew.
  • C. Use Amazon SageMaker Studio Data Wrangler.
  • D. Use Amazon SageMaker Studio Notebook with Pandas.
Show Suggested Answer Hide Answer
Suggested Answer: C 🗳️

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
GS_77
2 months, 3 weeks ago
Selected Answer: B
While SageMaker Data Wrangler (option C) is also a strong contender, DataBrew is slightly easier to use and requires even less implementation effort, especially for users who may not be as familiar with the SageMaker ecosystem.
upvoted 1 times
spinatram
1 week, 5 days ago
C is the right one I think. For B, you should to feed this data to sagemaker which brings more operational effort than Data Wrangler
upvoted 1 times
...
...
vkbajoria
7 months, 3 weeks ago
Selected Answer: C
Data Wrangler is better for ML work. Brew can be used as well
upvoted 1 times
...
AIWave
8 months, 1 week ago
Selected Answer: C
Data wrangler supports tight integration with Sagemaker and is better suited for this scenario since resampled data is used in further modelling. AWS Glue DataBrew is a data preparation service more for general purpose use.
upvoted 1 times
...
Adzz
8 months, 3 weeks ago
Selected Answer: C
Best for Data Wrangler
upvoted 1 times
...
akdavsan
8 months, 3 weeks ago
Selected Answer: C
This is exactly what Data Wrangler is for
upvoted 1 times
...
kyuhuck
9 months, 1 week ago
Selected Answer: C
Answer: C Explanation: Amazon SageMaker Studio Data Wrangler is a visual data preparation tool that enables users to clean and normalize data without writing any code. Using Data Wrangler, the data scientist can easily import the time-series data from various sources, such as Amazon S3, Amazon Athena, or Amazon Redshift. Data Wrangler can automatically generate data insights and quality reports, which can help identify and fix missing values, outliers, and anomalies in the data. Data Wrangler also provides over 250 built-in transformations, such as resampling, interpolation, aggregation, and filtering, which can be applied to the data with a point-and-click interface. Data Wrangler can also export the prepared data to different destinations, such as Amazon S3, Amazon SageMaker Feature Store, or Amazon SageMaker Pipelines, for further modeling and analysis. D
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...