exam questions

Exam AWS Certified Solutions Architect - Professional All Questions

View all questions & answers for the AWS Certified Solutions Architect - Professional exam

Exam AWS Certified Solutions Architect - Professional topic 1 question 475 discussion

A company has a High Performance Computing (HPC) cluster in its on-premises data center, which runs thousands of jobs in parallel for one week every month, processing petabytes of images. The images are stored on a network file server, which is replicated to a disaster recovery site. The on-premises data center has reached capacity and has started to spread the jobs out over the course of the month in order to better utilize the cluster, causing a delay in the job completion.
The company has asked its Solutions Architect to design a cost-effective solution on AWS to scale beyond the current capacity of 5,000 cores and 10 petabytes of data. The solution must require the least amount of management overhead and maintain the current level of durability.
Which solution will meet the company's requirements?

  • A. Create a container in the Amazon Elastic Container Registry with the executable file for the job. Use Amazon ECS with Spot Fleet in Auto Scaling groups. Store the raw data in Amazon EBS SC1 volumes and write the output to Amazon S3.
  • B. Create an Amazon EMR cluster with a combination of On Demand and Reserved Instance Task Nodes that will use Spark to pull data from Amazon S3. Use Amazon DynamoDB to maintain a list of jobs that need to be processed by the Amazon EMR cluster.
  • C. Store the raw data in Amazon S3, and use AWS Batch with Managed Compute Environments to create Spot Fleets. Submit jobs to AWS Batch Job Queues to pull down objects from Amazon S3 onto Amazon EBS volumes for temporary storage to be processed, and then write the results back to Amazon S3.
  • D. Submit the list of jobs to be processed to an Amazon SQS to queue the jobs that need to be processed. Create a diversified cluster of Amazon EC2 worker instances using Spot Fleet that will automatically scale based on the queue depth. Use Amazon EFS to store all the data sharing it across all instances in the cluster.
Show Suggested Answer Hide Answer
Suggested Answer: C 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
3a632a3
1 year, 3 months ago
Selected Answer: B
B: is the most cost effective using EMR as it uses S3 as the file system (EMRFS). 10PB of data is about 200k and it is using spot instances for the task nodes. C: may or may not be doable as it doesn't specify how much data is required by each job and EBS is limited to 64TB per volume given that the OS supports that much. Considering 10PB EBS for storage alone is $1.287M + IOPS cost for io2 it seems like a more costly solution. D: is technically more feasible than C but more costly at $3M for 10PB in EFS alone.
upvoted 1 times
...
hobokabobo
2 years, 4 months ago
Selected Answer: C
A: stopped reading after the word container. B: I like the idea of utilizing spark and it indeed can pull from S3 but whats the EMR for if data is in S3 and procssed clientside on an ec2. Either the data goes HFS in the EMR or the spark need to run serverside in the EMR (which only makes sense if the data is in the EMR) - and not in S3 or the EMR. Clumsy at best. C: yes, we have batch processing and this is a perfect setup for batch processing that works and it says that they move the data to s3. D: EFS? For petabyte of Data? Clumsy at best.
upvoted 1 times
...
superuser784
2 years, 5 months ago
Selected Answer: C
AWS Batch is more suitable for this case, AWS EMR if mainly for BigData and ML processing.
upvoted 1 times
...
ToanVN1988
2 years, 6 months ago
Selected Answer: C
For cost effective, C is best choice
upvoted 3 times
...
aqiao
2 years, 8 months ago
Selected Answer: C
emr is not for HPC scenario
upvoted 3 times
...
TechX
2 years, 9 months ago
Selected Answer: B
It's B. The question said that "The solution must be as low-maintenance as possible while maintaining the existing degree of durability" With B, we can maintain durability and also can safe cost with combination of on-demand and reverse instance. Other choices are using Spot Instance which is not satisfy the requirement.
upvoted 2 times
...
jj22222
3 years ago
Selected Answer: C
more cost effective than B
upvoted 4 times
...
gorodetsky
3 years, 1 month ago
Selected Answer: B
Answer is B, A,C,D are using Spot instances, there is guarantee for a cluster with more then 5000 cores with Spot instances
upvoted 4 times
wassb
2 years, 6 months ago
B is not cost effective because you use Reserved Instance one week every month ...
upvoted 3 times
...
...
HellGate
3 years, 2 months ago
My answer is B. EBS seems not proper storage choice in C.
upvoted 4 times
Jonfernz
2 years, 6 months ago
that's how hpc/batch processing works. you'd have a fleet of ec2 instances processing data stored in a single file storage.
upvoted 2 times
...
...
pititcu667
3 years, 3 months ago
Selected Answer: C
I would go for c
upvoted 3 times
...
pititcu667
3 years, 3 months ago
I would go for c
upvoted 2 times
...
peddyua
3 years, 3 months ago
C https://aws.amazon.com/blogs/industries/building-a-scalable-image-processing-pipeline-for-image-based-transcriptomics/ https://docs.aws.amazon.com/wellarchitected/latest/high-performance-computing-lens/batch-based-architecture.html
upvoted 3 times
...
tkanmani76
3 years, 3 months ago
C looks more cost effective over B.
upvoted 3 times
lavy
3 years, 2 months ago
They are not asking for cost efficient, it's looking for durability and less maintenance
upvoted 2 times
wahlbergusa
3 years, 2 months ago
"...with developing a cost-effective solution on AWS..."
upvoted 6 times
...
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago