Exam AWS Certified Solutions Architect - Professional All Questions

View all questions & answers for the AWS Certified Solutions Architect - Professional exam

Exam AWS Certified Solutions Architect - Professional topic 1 question 475 discussion

Exam question from Amazon's AWS Certified Solutions Architect - Professional

Question #: 475
Topic #: 1

[All AWS Certified Solutions Architect - Professional Questions]

A company has a High Performance Computing (HPC) cluster in its on-premises data center, which runs thousands of jobs in parallel for one week every month, processing petabytes of images. The images are stored on a network file server, which is replicated to a disaster recovery site. The on-premises data center has reached capacity and has started to spread the jobs out over the course of the month in order to better utilize the cluster, causing a delay in the job completion.
The company has asked its Solutions Architect to design a cost-effective solution on AWS to scale beyond the current capacity of 5,000 cores and 10 petabytes of data. The solution must require the least amount of management overhead and maintain the current level of durability.
Which solution will meet the company's requirements?

A. Create a container in the Amazon Elastic Container Registry with the executable file for the job. Use Amazon ECS with Spot Fleet in Auto Scaling groups. Store the raw data in Amazon EBS SC1 volumes and write the output to Amazon S3.
B. Create an Amazon EMR cluster with a combination of On Demand and Reserved Instance Task Nodes that will use Spark to pull data from Amazon S3. Use Amazon DynamoDB to maintain a list of jobs that need to be processed by the Amazon EMR cluster.
C. Store the raw data in Amazon S3, and use AWS Batch with Managed Compute Environments to create Spot Fleets. Submit jobs to AWS Batch Job Queues to pull down objects from Amazon S3 onto Amazon EBS volumes for temporary storage to be processed, and then write the results back to Amazon S3.
D. Submit the list of jobs to be processed to an Amazon SQS to queue the jobs that need to be processed. Create a diversified cluster of Amazon EC2 worker instances using Spot Fleet that will automatically scale based on the queue depth. Use Amazon EFS to store all the data sharing it across all instances in the cluster.

Show Suggested Answer

Suggested Answer: C 🗳️

by tkanmani76 at Dec. 31, 2021, 1:33 a.m.

Disclaimers:

- ExamTopics website is not related to, affiliated with, endorsed or authorized by Amazon.
- Trademarks, certification & product names are used for reference only and belong to Amazon.

Comments

Submit Cancel

3a632a3

1 year, 3 months ago

Selected Answer: B

B: is the most cost effective using EMR as it uses S3 as the file system (EMRFS). 10PB of data is about 200k and it is using spot instances for the task nodes. C: may or may not be doable as it doesn't specify how much data is required by each job and EBS is limited to 64TB per volume given that the OS supports that much. Considering 10PB EBS for storage alone is $1.287M + IOPS cost for io2 it seems like a more costly solution. D: is technically more feasible than C but more costly at $3M for 10PB in EFS alone.

upvoted 1 times

...

hobokabobo

2 years, 4 months ago

Selected Answer: C

A: stopped reading after the word container. B: I like the idea of utilizing spark and it indeed can pull from S3 but whats the EMR for if data is in S3 and procssed clientside on an ec2. Either the data goes HFS in the EMR or the spark need to run serverside in the EMR (which only makes sense if the data is in the EMR) - and not in S3 or the EMR. Clumsy at best. C: yes, we have batch processing and this is a perfect setup for batch processing that works and it says that they move the data to s3. D: EFS? For petabyte of Data? Clumsy at best.

upvoted 1 times

...

superuser784

2 years, 5 months ago

Selected Answer: C

AWS Batch is more suitable for this case, AWS EMR if mainly for BigData and ML processing.

upvoted 1 times

...

ToanVN1988

2 years, 6 months ago

Selected Answer: C

For cost effective, C is best choice

upvoted 3 times

...

aqiao

2 years, 8 months ago

Selected Answer: C

emr is not for HPC scenario

upvoted 3 times

...

TechX

2 years, 9 months ago

Selected Answer: B

It's B. The question said that "The solution must be as low-maintenance as possible while maintaining the existing degree of durability" With B, we can maintain durability and also can safe cost with combination of on-demand and reverse instance. Other choices are using Spot Instance which is not satisfy the requirement.

upvoted 2 times

...

jj22222

3 years ago

Selected Answer: C

more cost effective than B

upvoted 4 times

...

gorodetsky

3 years, 1 month ago

Selected Answer: B

Answer is B, A,C,D are using Spot instances, there is guarantee for a cluster with more then 5000 cores with Spot instances

upvoted 4 times

wassb

2 years, 6 months ago

B is not cost effective because you use Reserved Instance one week every month ...

upvoted 3 times

...

HellGate

3 years, 2 months ago

My answer is B. EBS seems not proper storage choice in C.

upvoted 4 times

Jonfernz

2 years, 6 months ago

that's how hpc/batch processing works. you'd have a fleet of ec2 instances processing data stored in a single file storage.

upvoted 2 times

...

pititcu667

3 years, 3 months ago

Selected Answer: C

I would go for c

upvoted 3 times

...

pititcu667

3 years, 3 months ago

I would go for c

upvoted 2 times

...

peddyua

3 years, 3 months ago

C https://aws.amazon.com/blogs/industries/building-a-scalable-image-processing-pipeline-for-image-based-transcriptomics/ https://docs.aws.amazon.com/wellarchitected/latest/high-performance-computing-lens/batch-based-architecture.html

upvoted 3 times

...

tkanmani76

3 years, 3 months ago

C looks more cost effective over B.

upvoted 3 times

lavy

3 years, 2 months ago

They are not asking for cost efficient, it's looking for durability and less maintenance

upvoted 2 times

wahlbergusa

3 years, 2 months ago

"...with developing a cost-effective solution on AWS..."

upvoted 6 times

...

Exam AWS Certified Solutions Architect - Professional All Questions

View all questions & answers for the AWS Certified Solutions Architect - Professional exam

Exam AWS Certified Solutions Architect - Professional topic 1 question 475 discussion

Comments

3a632a3

hobokabobo

superuser784

ToanVN1988

aqiao

TechX

jj22222

gorodetsky

wassb

HellGate

Jonfernz

pititcu667

pititcu667

peddyua

tkanmani76

lavy

wahlbergusa

SY0-701