exam questions

Exam AWS Certified Data Analytics - Specialty All Questions

View all questions & answers for the AWS Certified Data Analytics - Specialty exam

Exam AWS Certified Data Analytics - Specialty topic 1 question 74 discussion

A marketing company is using Amazon EMR clusters for its workloads. The company manually installs third-party libraries on the clusters by logging in to the master nodes. A data analyst needs to create an automated solution to replace the manual process.
Which options can fulfill these requirements? (Choose two.)

  • A. Place the required installation scripts in Amazon S3 and execute them using custom bootstrap actions.
  • B. Place the required installation scripts in Amazon S3 and execute them through Apache Spark in Amazon EMR.
  • C. Install the required third-party libraries in the existing EMR master node. Create an AMI out of that master node and use that custom AMI to re-create the EMR cluster.
  • D. Use an Amazon DynamoDB table to store the list of required applications. Trigger an AWS Lambda function with DynamoDB Streams to install the software.
  • E. Launch an Amazon EC2 instance with Amazon Linux and install the required third-party libraries on the instance. Create an AMI and use that AMI to create the EMR cluster.
Show Suggested Answer Hide Answer
Suggested Answer: AE 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
ramozo
Highly Voted 3 years, 7 months ago
I will choose A and E. https://aws.amazon.com/about-aws/whats-new/2017/07/amazon-emr-now-supports-launching-clusters-with-custom-amazon-linux-amis/ https://docs.aws.amazon.com/de_de/emr/latest/ManagementGuide/emr-plan-bootstrap.html
upvoted 25 times
testtaker3434
3 years, 7 months ago
Doubt in this one... Documentation says you use E as a option to avoid A. https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-custom-ami.html
upvoted 3 times
testtaker3434
3 years, 7 months ago
Its A and E, if you do one or the other, but you shouldn't do both (as if it was a step 1 and after that step 2)
upvoted 4 times
...
...
awssp12345
3 years, 6 months ago
Agreed!
upvoted 1 times
...
...
Paitan
Highly Voted 3 years, 6 months ago
A and E.
upvoted 8 times
...
roymunson
Most Recent 1 year, 5 months ago
AE: I'll pass the test.
upvoted 3 times
...
LocalHero
1 year, 5 months ago
I think existing cluster must not change. It is danger. Installing software for new EC2 instance is more safe. so A and E correct.I think.
upvoted 2 times
...
confuzz
1 year, 9 months ago
AE Custom AMIs created from the base EMR AMI are not supported and will lead to application provisioning errors upon cluster startup. https://medium.com/@amberrunnels/creating-a-custom-ami-on-amazon-emr-a60ddeb7821b
upvoted 2 times
...
Debi_mishra
1 year, 11 months ago
A is very obvious. Between C and E - I will prefer E , as creating AMI from a master node may create a bulky AMI with lot of redundant hadoop libraries that can be done during bootstrap process.
upvoted 3 times
...
pk349
1 year, 12 months ago
AE: I passed the test
upvoted 1 times
...
VijiTu
2 years, 4 months ago
AE https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-custom-ami.html#emr-custom-ami-preconfigure
upvoted 1 times
...
nharaz
2 years, 6 months ago
A AND E According to Stephane Maarek Udemy course A= You can use a bootstrap action to install additional software or customize the configuration of the EMR cluster instances. Bootstrap actions are scripts that run on the cluster after Amazon EMR launches the instance using the Amazon Linux Amazon Machine Image (AMI). Bootstrap actions run before Amazon EMR installs the applications that you specify when you create the cluster and before cluster nodes begin processing data. E= You can create Amazon EMR clusters that have custom Amazon Machine Images (AMI) running Amazon Linux. You can create the AMI from an EC2 instance running Amazon Linux. Make sure that you have installed all the required third-party libraries on this EC2 instance. This allows you to preload additional software on your AMI and use these AMIs to launch your EMR clusters.
upvoted 1 times
...
JHJHJHJHJ
2 years, 7 months ago
AC Confirmed by paid dumps
upvoted 1 times
JoellaLi
2 years, 6 months ago
Could you give the reason for C?
upvoted 1 times
sly_tail
2 years, 1 month ago
C instead of E because it's absolutely redundant and inefficient to launch another instance when you already have the same master node.
upvoted 1 times
...
...
...
Sanmeda
2 years, 7 months ago
Answer A & E
upvoted 1 times
...
rocky48
2 years, 9 months ago
Selected Answer: AE
A and E.
upvoted 1 times
...
Bik000
2 years, 11 months ago
Selected Answer: AE
Answer is A & E
upvoted 1 times
...
certificationJunkie
2 years, 11 months ago
A and E are right answers. How would installing libraries on Master Nodes resolve anything? Computation happens on Data nodes (slaves) and all required packages should be installed there.
upvoted 3 times
Ryo0w0o
2 years, 5 months ago
But the quention says the company manually installs third-party libraries on the clusters "by logging in to the master nodes". Does it mean that they log in there but install the libraries in slave-nodes?
upvoted 2 times
...
...
MWL
3 years ago
Selected Answer: AC
Although most of others choose A.E. But I think C is right instead of E. For E: Launch EC2 instance and install these softwares are not easy for EMR. I have installed hadoop and skark one time. And it took my much time. And if I want to make a hadoop/hive/spark... environment to be used as AWS EMR, it will take much efford. But C: I can login into master node with ssh, install the lib, and use the master node EC2 instance to cretae a custom AMI. Although it will waste the previous EMR cluster, but the if I want to establish an hadoop/spark/hive clusters using EC2, I still need several instances to prepare AMI. So, I vote for AC.
upvoted 2 times
...
MWL
3 years ago
Although most of others choose A.E. But I think C is right instead of E. For E: Launch EC2 instance and install these softwares are not easy for EMR. I have installed hadoop and skark one time. And it took my much time. And if I want to make a hadoop/hive/spark... environment to be used as AWS EMR, it will take much efford. But C: I can login into master node with ssh, install the lib, and use the master node EC2 instance to cretae a custom AMI. Although it will waste the previous EMR cluster, but the if I want to establish an hadoop/spark/hive clusters using EC2, I still need several instances to prepare AMI. So, I vote for AC.
upvoted 1 times
...
RSSRAO
3 years, 2 months ago
Selected Answer: AE
A and E is correct
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago