exam questions

Exam AWS Certified Data Analytics - Specialty All Questions

View all questions & answers for the AWS Certified Data Analytics - Specialty exam

Exam AWS Certified Data Analytics - Specialty topic 1 question 137 discussion

A data analyst runs a large number of data manipulation language (DML) queries by using Amazon Athena with the JDBC driver. Recently, a query failed after it ran for 30 minutes. The query returned the following message: java.sql.SQLException: Query timeout
The data analyst does not immediately need the query results. However, the data analyst needs a long-term solution for this problem.
Which solution will meet these requirements?

  • A. Split the query into smaller queries to search smaller subsets of data
  • B. In the settings for Athena, adjust the DML query timeout limit
  • C. In the Service Quotas console, request an increase for the DML query timeout
  • D. Save the tables as compressed .csv files
Show Suggested Answer Hide Answer
Suggested Answer: C 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
f4bi4n
Highly Voted 2 years, 11 months ago
Not sure if C fulfills the "long term solution" part of the question. In the worst case, he needs to adjust it every time the data grows and the query takes again longer. What about A?
upvoted 10 times
...
MLCL
Highly Voted 1 year, 8 months ago
Selected Answer: C
Right answer is A but exam answer is C, they want you to know about service quotas.
upvoted 5 times
...
LocalHero
Most Recent 1 year, 5 months ago
Default Athena’s max query time out minute is 30min. so we needs to up Service Quoata.
upvoted 1 times
...
rlnd2000
1 year, 6 months ago
Selected Answer: A
I believe the best choice is A. From the given information, we aren't clear on how much we can extend the timeout—would 60 minutes suffice? It's uncertain. Increasing timeouts without a clear rationale isn't a recommended approach, as there are costs associated with prolonged query times, both in terms of resources and expenses. Operational Best Practices: Increasing timeouts without understanding the actual need can be considered a "band-aid" solution. It might temporarily solve the symptom (i.e., timeout) but not the underlying problem causing the delay.:)
upvoted 1 times
...
Mirandaali
1 year, 11 months ago
Selected Answer: A
C is not a sustainable solution. So A is correct
upvoted 3 times
...
pk349
1 year, 11 months ago
A: I passed the test
upvoted 2 times
...
rsn
2 years ago
Selected Answer: A
A seems to be the long term fix here. I go with A
upvoted 2 times
...
CleverMonkey092
2 years, 1 month ago
why not D?
upvoted 1 times
...
AwsNewPeople
2 years, 1 month ago
Selected Answer: C
Answer is C: By default, the maximum timeout limit for a single query execution in Athena is 30 minutes. If the query requires more than 30 minutes to complete, it can be terminated with a "Query timeout" error. Splitting the query into smaller queries, as mentioned in option A, can work but may be time-consuming and not a scalable solution for large datasets. Requesting an increase in the DML query timeout via the Service Quotas console, as mentioned in option C, can provide a more scalable and long-term solution to the problem. Option D, saving tables as compressed .csv files, is not a solution to the problem described in the scenario. It will not address the query timeout issue and does not provide a long-term solution for the problem.
upvoted 4 times
...
rsn
2 years, 1 month ago
Selected Answer: A
I will go with A as that appears to be the long term fix. C talks about increasing the timeout but there is no mention of increase by what extent
upvoted 1 times
...
aara98
2 years, 1 month ago
The 30-minute DDL query limit is a soft limit, and an increase can be requested in the Service Quotas console.
upvoted 1 times
...
Merrick
2 years, 2 months ago
C https://docs.aws.amazon.com/athena/latest/ug/service-limits.html
upvoted 1 times
...
jazzok
2 years, 6 months ago
For A, as there’re a large number of queries that need to be split, the effort is huge. Some queries may still need a longer time to complete, and it’s OK for the data analyst to wait for them. So increasing the timeout limit is the long-term solution. I go with C.
upvoted 3 times
dinodragon
2 years, 6 months ago
It says "a query failed" so effort is not to split many queries but one. Since "data analyst" is running the query, splitting query is also possible. Not sure between A or C but cannot rule out A.
upvoted 2 times
...
...
rocky48
2 years, 9 months ago
Selected Answer: C
Selected Answer: C
upvoted 2 times
...
certificationJunkie
2 years, 11 months ago
A looks correct option. timeout could occur if the spool size of a query is too much an spice is not able to generate output in specified time. Increasing timeout is not a long term solution as other queries may continue to timeout with such large data sets. So better to limit the scope by splitting the queries for specific limited dataset even if that means running query multiple times.
upvoted 4 times
...
jrheen
2 years, 12 months ago
Answer - C
upvoted 1 times
...
astalavista1
3 years ago
Selected Answer: C
C - Per amazon doc - https://docs.aws.amazon.com/athena/latest/ug/service-limits.html
upvoted 3 times
astalavista1
3 years ago
The quota page specifically shows 30 mins max for DML if an increase isn't requested. https://docs.aws.amazon.com/general/latest/gr/athena.html#amazon-athena-limits
upvoted 2 times
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago