exam questions

Exam AWS Certified Data Engineer - Associate DEA-C01 All Questions

View all questions & answers for the AWS Certified Data Engineer - Associate DEA-C01 exam

Exam AWS Certified Data Engineer - Associate DEA-C01 topic 1 question 26 discussion

A company is planning to use a provisioned Amazon EMR cluster that runs Apache Spark jobs to perform big data analysis. The company requires high reliability. A big data team must follow best practices for running cost-optimized and long-running workloads on Amazon EMR. The team must find a solution that will maintain the company's current level of performance.
Which combination of resources will meet these requirements MOST cost-effectively? (Choose two.)

  • A. Use Hadoop Distributed File System (HDFS) as a persistent data store.
  • B. Use Amazon S3 as a persistent data store.
  • C. Use x86-based instances for core nodes and task nodes.
  • D. Use Graviton instances for core nodes and task nodes.
  • E. Use Spot Instances for all primary nodes.
Show Suggested Answer Hide Answer
Suggested Answer: BD 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
[Removed]
Highly Voted 9 months ago
Selected Answer: BD
HDFS is not recommended for persistent storage because once a cluster is terminated, all HDFS data is lost. Also, long-running workloads can fill the disk space quickly. Thus, S3 is the best option since it's highly available, durable, and scalable. AWS Graviton-based instances cost up to 20% less than comparable x86-based Amazon EC2 instances: https://aws.amazon.com/ec2/graviton/
upvoted 9 times
BartoszGolebiowski24
8 months, 1 week ago
If you are using instance storage this is true, but you can use EBS instead of instance storage. EBS has better performance than s3 for HDFS. This is the keyword from question, so EBS > S3 I would rather select AD.
upvoted 1 times
...
...
sam_pre
Most Recent 3 weeks, 1 day ago
Selected Answer: BD
Cost effective + high reliability > S3 Gravitation > Low cost
upvoted 1 times
...
ttpro1995
3 months, 3 weeks ago
Selected Answer: BD
Rule of thumb: pick the AWS in-house solution provided for that service. Graviton is aws processor, and also EMRFS on S3.
upvoted 1 times
...
pypelyncar
4 months, 1 week ago
Selected Answer: BD
s3 no question. Graviton=> Cost-Effectiveness: Graviton instances are ARM-based instances specifically designed for cloud workloads. They offer significant cost savings compared to x86-based instances while delivering comparable or better performance for many Apache Spark workloads. Performance: Graviton instances are optimized for Spark workloads and can deliver the same level of performance as x86-based instances in many cases. Additionally, EMR offers performance-optimized versions of Spark built for Graviton instances.
upvoted 3 times
...
okechi
6 months, 1 week ago
My answer is BE
upvoted 1 times
chris_spencer
6 months ago
E is incorrect, Spot instances does not provide high reliability as required by the company.
upvoted 4 times
...
...
certplan
7 months ago
A. - AWS recommends using Amazon S3 as a persistent data store for Amazon EMR due to its scalability, durability, and cost-effectiveness. Storing data in HDFS would require managing and maintaining additional infrastructure, which may incur higher costs in terms of storage, management, and scalability compared to using Amazon S3. AWS documentation emphasizes the benefits of integrating Amazon EMR with Amazon S3 for cost optimization and efficiency. D. - While Graviton instances may offer cost savings in certain scenarios, they might not always be the most cost-effective option depending on the specific workload requirements and availability of compatible software. x86-based instances are more commonly supported by a broader range of software and frameworks, which could result in better performance and compatibility in some cases. Additionally, AWS documentation on instance types and pricing can provide insights into the cost-effectiveness of Graviton instances compared to x86-based instances.
upvoted 2 times
...
GiorgioGss
7 months, 1 week ago
Selected Answer: BD
B and D.
upvoted 3 times
nyaopoko
6 months, 2 weeks ago
yes BD is answer
upvoted 1 times
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago