exam questions

Exam AWS Certified Solutions Architect - Professional All Questions

View all questions & answers for the AWS Certified Solutions Architect - Professional exam

Exam AWS Certified Solutions Architect - Professional topic 1 question 836 discussion

A company is using an Amazon EMR cluster to run its big data jobs. The cluster's jobs are invoked by AWS Step Functions Express Workflows that consume various Amazon Simple Queue Service (Amazon SQS) queues. The workload of this solution is variable and unpredictable. Amazon CloudWatch metrics show that the cluster's peak utilization is only 25% at times and that the cluster sits idle the rest of the time.
A solutions architect must optimize the costs of the cluster without negatively impacting the time it takes to run the various jobs.
What is the MOST cost-effective solution that meets these requirements?

  • A. Modify the EMR cluster by turning on automatic scaling of the core nodes and task nodes with a custom policy that is based on cluster utilization. Purchase Reserved Instance capacity to cover the master node.
  • B. Modify the EMR cluster to use an instance fleet of Dedicated On-Demand Instances for the master node and core nodes, and to use Spot Instances for the task nodes. Define target capacity for each node type to cover the load.
  • C. Purchase Reserved Instances for the master node and core nodes. Terminate all existing task nodes in the EMR cluster.
  • D. Modify the EMR cluster to use capacity-optimized Spot Instances and a diversified task fleet. Define target capacity for each node type with a mix of On- Demand Instances and Spot Instances.
Show Suggested Answer Hide Answer
Suggested Answer: D 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
nyshaa
Highly Voted 2 years, 3 months ago
Selected Answer: A
A. https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-automatic-scaling.html
upvoted 10 times
...
evargasbrz
Most Recent 1 year, 11 months ago
Selected Answer: A
I'll go with A. B -> is more expensive than A. D-> can negatively impacting the time it takes to run the various jobs.
upvoted 1 times
...
due
2 years ago
Selected Answer: D
cluster's peak utilization is only 25% at times and that the cluster sits idle the rest of the time need to optimized
upvoted 2 times
...
fdoxxx
2 years, 1 month ago
The key part of the question is "without negatively impacting the time it takes to run the various jobs." So Spot Instance for master node seems not to be perfect choice. I am going for B. https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-plan-instances-guidelines.html#emr-dev-master-instance-group-spot:~:text=label%2Dexpression%3A%20%27CORE%27-,Master%20node%20on%20a%20Spot%20Instance,are%20running%20the%20entire%20cluster%20(all%20instance%20groups)%20as%20Spot%20Instances.,-Core%20nodes%20on
upvoted 1 times
fdoxxx
2 years, 1 month ago
changing to A - RI for master node is better than On-Demand
upvoted 2 times
...
...
AwsBRFan
2 years, 1 month ago
Selected Answer: D
https://aws.amazon.com/blogs/big-data/optimizing-amazon-emr-for-resilience-and-cost-with-capacity-optimized-spot-instances/
upvoted 4 times
...
skywalker
2 years, 1 month ago
A https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-scale-on-demand.html
upvoted 1 times
...
Ni_yot
2 years, 3 months ago
I also thinks its A. Automatic scaling with a custom policy in Amazon EMR release versions 4.0 and later allows you to programmatically scale out and scale in core nodes and task nodes based on a CloudWatch metric and other parameters that you specify in a scaling policy. Automatic scaling with a custom policy is available with the instance groups configuration and is not available when you use instance fleets. For more information about instance groups and instance fleets, see Create a cluster with instance fleets or uniform instance groups.
upvoted 3 times
...
asfsdfsdf
2 years, 3 months ago
Selected Answer: B
B for me
upvoted 3 times
...
Harithareddynn
2 years, 3 months ago
Selected Answer: B
Between B and D, You cannot use spot for master hence B
upvoted 1 times
...
cen007
2 years, 4 months ago
Selected Answer: B
Reserved Or Dedicated for master node, spot for task nodes
upvoted 1 times
...
Enigmaaaaaa
2 years, 4 months ago
Selected Answer: D
Will go with D due to the below: https://aws.amazon.com/blogs/big-data/strategies-for-reducing-your-amazon-emr-costs/ https://aws.amazon.com/blogs/big-data/optimizing-amazon-emr-for-resilience-and-cost-with-capacity-optimized-spot-instances/ The correct is answer is a combining both A and D since the question asks for "minimizing the time required to perform numerous workloads" it has to be D. https://aws.amazon.com/blogs/big-data/best-practices-for-resizing-and-automatic-scaling-in-amazon-emr/
upvoted 3 times
Enigmaaaaaa
2 years, 4 months ago
Changing to A. You cant define mix of on-demand and spot to master node type
upvoted 3 times
...
...
wannaaws
2 years, 5 months ago
Selected Answer: D
https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-instance-fleet.html
upvoted 2 times
...
Bigbearcn
2 years, 7 months ago
Selected Answer: D
D is ok. https://aws.amazon.com/blogs/big-data/optimizing-amazon-emr-for-resilience-and-cost-with-capacity-optimized-spot-instances/
upvoted 3 times
Bigbearcn
2 years, 7 months ago
A auto scaling is also workable. but auto scaling need instance group that each node type (master, core, or task) consists of the same instance type and the same purchasing option for instances:On-Demand or Spot. To optimizing cost, D is better.
upvoted 3 times
...
pixepe
2 years, 3 months ago
Seems D is correct based on link https://aws.amazon.com/blogs/big-data/optimizing-amazon-emr-for-resilience-and-cost-with-capacity-optimized-spot-instances/
upvoted 1 times
pixepe
2 years, 2 months ago
Correction: B makes more sense; as task nodes are Spot instances (and remember that task nodes are stateless) D is incorrect, because master node can either be OnDemand or Spot instance, But NOT a MIX of it - Requirement (Define target capacity for each node type with a mix of On- Demand Instances and Spot Instances. Most Voted)
upvoted 1 times
pixepe
2 years, 1 month ago
Correcting it to A The workload of this solution is variable and unpredictable => We need auto-scaling. If they have mentioned EMR managed scaling for B, than only B would have been better. In B there is no mention of scaling, So answer is A
upvoted 1 times
...
...
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...