Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.
exam questions

Exam AWS Certified Machine Learning - Specialty All Questions

View all questions & answers for the AWS Certified Machine Learning - Specialty exam

Exam AWS Certified Machine Learning - Specialty topic 1 question 311 discussion

A data engineer wants to perform exploratory data analysis (EDA) on a petabyte of data. The data engineer does not want to manage compute resources and wants to pay only for queries that are run. The data engineer must write the analysis by using Python from a Jupyter notebook.

Which solution will meet these requirements?

  • A. Use Apache Spark from within Amazon Athena.
  • B. Use Apache Spark from within Amazon SageMaker.
  • C. Use Apache Spark from within an Amazon EMR cluster.
  • D. Use Apache Spark through an integration with Amazon Redshift.
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
spinatram
1 week, 5 days ago
B - SageMaker Athena and redshift does not support Apache spark scripts EMR requires managing infra
upvoted 1 times
...
MJSY
1 month, 1 week ago
Selected Answer: B
SageMaker provide serverless, spark cluster notebook, pay as you go if you remember to close it when you finished your work.
upvoted 2 times
...
Shivanshub
2 months, 2 weeks ago
Selected Answer: B
Amazon Athena does not natively support running Python code directly. Amazon Athena is primarily a serverless, interactive query service that allows you to analyze data in Amazon S3 using standard SQL. Use Apache Spark from within Amazon SageMaker. Amazon SageMaker allows you to run Jupyter notebooks and provides managed Apache Spark integration, which means you don't need to manage the underlying compute resources yourself. You can also use SageMaker to perform the analysis and pay only for the resources you consume during the execution of your queries.
upvoted 2 times
...
eicresv2
4 months ago
Selected Answer: A
A and not B also because of paying for queries that you run. Notebooks will continue to run and cost money
upvoted 2 times
...
pandkast
4 months, 3 weeks ago
Selected Answer: A
https://docs.aws.amazon.com/athena/latest/ug/notebooks-spark-working-with-notebooks.html
upvoted 2 times
...
rav009
5 months, 3 weeks ago
Selected Answer: A
https://docs.aws.amazon.com/athena/latest/ug/notebooks-spark-editor.html
upvoted 2 times
...
ddaanndann
7 months, 2 weeks ago
Correct Answer: A https://docs.aws.amazon.com/athena/latest/ug/notebooks-spark.html
upvoted 2 times
...
JonSno
7 months, 2 weeks ago
It's A - Using Apache Spark on Amazon Athena https://aws-sdk-pandas.readthedocs.io/en/3.2.1/tutorials/041%20-%20Apache%20Spark%20on%20Amazon%20Athena.html
upvoted 1 times
...
vkbajoria
7 months, 3 weeks ago
Selected Answer: B
Serverless, Python, and Notebook are key elements for making the decision. It's B
upvoted 2 times
vkbajoria
7 months, 2 weeks ago
I changed my mind, Athena supports spark. It's A
upvoted 1 times
...
...
rav009
8 months ago
Selected Answer: A
https://docs.amazonaws.cn/en_us/athena/latest/ug/notebooks-spark-getting-started.html
upvoted 4 times
...
shivamgulati13
8 months ago
Just thinking out loud, how can it be not Redshift as well? The question also mentions pay for queries, and handle petabyte of data. Spark is an integration possible with Amazon Redshift, and Redshift has serverless version too. https://aws.amazon.com/blogs/aws/new-amazon-redshift-integration-with-apache-spark/
upvoted 1 times
...
AIWave
8 months, 1 week ago
Selected Answer: B
A: No - Athena does not support python code B: Yes - Sagemaker is serverless and SageMaker Processing allows you to run Spark jobs from a Jupyter notebook using Python. You only pay for resources used during processing jobs. C: No - involves managing the EMR cluster. You pay for running EC2 instances whether in use or not. D: No - Redshift can't run spark jobs and no native support for python/Jupiter notebooks
upvoted 4 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...