exam questions

Exam AWS Certified Data Analytics - Specialty All Questions

View all questions & answers for the AWS Certified Data Analytics - Specialty exam

Exam AWS Certified Data Analytics - Specialty topic 1 question 50 discussion

A company wants to improve user satisfaction for its smart home system by adding more features to its recommendation engine. Each sensor asynchronously pushes its nested JSON data into Amazon Kinesis Data Streams using the Kinesis Producer Library (KPL) in Java. Statistics from a set of failed sensors showed that, when a sensor is malfunctioning, its recorded data is not always sent to the cloud.
The company needs a solution that offers near-real-time analytics on the data from the most updated sensors.
Which solution enables the company to meet these requirements?

  • A. Set the RecordMaxBufferedTime property of the KPL to "גˆ’1" to disable the buffering on the sensor side. Use Kinesis Data Analytics to enrich the data based on a company-developed anomaly detection SQL script. Push the enriched data to a fleet of Kinesis data streams and enable the data transformation feature to flatten the JSON file. Instantiate a dense storage Amazon Redshift cluster and use it as the destination for the Kinesis Data Firehose delivery stream.
  • B. Update the sensors code to use the PutRecord/PutRecords call from the Kinesis Data Streams API with the AWS SDK for Java. Use Kinesis Data Analytics to enrich the data based on a company-developed anomaly detection SQL script. Direct the output of KDA application to a Kinesis Data Firehose delivery stream, enable the data transformation feature to flatten the JSON file, and set the Kinesis Data Firehose destination to an Amazon OpenSearch Service (Amazon Elasticsearch Service) cluster.
  • C. Set the RecordMaxBufferedTime property of the KPL to "0" to disable the buffering on the sensor side. Connect for each stream a dedicated Kinesis Data Firehose delivery stream and enable the data transformation feature to flatten the JSON file before sending it to an Amazon S3 bucket. Load the S3 data into an Amazon Redshift cluster.
  • D. Update the sensors code to use the PutRecord/PutRecords call from the Kinesis Data Streams API with the AWS SDK for Java. Use AWS Glue to fetch and process data from the stream using the Kinesis Client Library (KCL). Instantiate an Amazon Elasticsearch Service cluster and use AWS Lambda to directly push data into it.
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
Priyanka_01
Highly Voted 3 years, 7 months ago
B? https://aws.amazon.com/blogs/big-data/perform-near-real-time-analytics-on-streaming-data-with-amazon-kinesis-and-amazon-elasticsearch-service/
upvoted 21 times
GCPereira
1 year, 3 months ago
agreed
upvoted 1 times
...
...
awssp12345
Highly Voted 3 years, 7 months ago
When Not to Use the KPL : The KPL can incur an additional processing delay of up to RecordMaxBufferedTime within the library (user-configurable). Larger values of RecordMaxBufferedTime results in higher packing efficiencies and better performance. Applications that cannot tolerate this additional delay may need to use the AWS SDK directly. For more information about using the AWS SDK with Kinesis Data Streams, see Developing Producers Using the Amazon Kinesis Data Streams API with the AWS SDK for Java. For more information about RecordMaxBufferedTime and other user-configurable properties of the KPL, see Configuring the Kinesis Producer Library. https://docs.aws.amazon.com/streams/latest/dev/developing-producers-with-kpl.html
upvoted 13 times
awssp12345
3 years, 6 months ago
I agree the answer is B.
upvoted 14 times
...
...
pk349
Most Recent 1 year, 11 months ago
B: I passed the test
upvoted 2 times
...
cloudlearnerhere
2 years, 5 months ago
Selected Answer: B
Correct answer is B. Using PutRecord/PutRecords would send the data synchronously to Kinesis Data Streams. Kinesis Data Analytics can be detect anomalies and the data can be pushed to Kinesis Data Firehose with transformation to Elasticsearch for analysis. Option A is wrong as Kinesis data streams does not provide data transformation feature. Option C is wrong as copying data to S3 and loading to Redshift would not make it near-real time. Option D is wrong as using Glue is ideal for batch jobs and not for near-real time analytics.
upvoted 7 times
...
dushmantha
2 years, 8 months ago
Selected Answer: B
Low latency retrival can be achieved with DynamoDB, Redis and OpenSearch. I guess that would be enough to select the answer.
upvoted 1 times
dushmantha
2 years, 8 months ago
D can be eleminated coz, KCL can't read from SDK producer
upvoted 1 times
...
...
rocky48
2 years, 9 months ago
Selected Answer: B
Answer - B
upvoted 1 times
...
treeli
2 years, 10 months ago
Selected Answer: B
near realtime should be opensearch
upvoted 1 times
...
Bik000
2 years, 11 months ago
Selected Answer: B
Answer is B
upvoted 1 times
...
jrheen
2 years, 11 months ago
Answer - B
upvoted 1 times
...
rb39
3 years, 1 month ago
B - near-realtime analytics keyword, only ES can provide that from the set of options
upvoted 2 times
...
aws2019
3 years, 5 months ago
B it is
upvoted 1 times
...
Donell
3 years, 5 months ago
Answer B
upvoted 1 times
...
AjithkumarSL
3 years, 5 months ago
Looks like Answer is B.. https://docs.aws.amazon.com/streams/latest/dev/developing-producers-with-kpl.html When Not to Use the KPL The KPL can incur an additional processing delay of up to RecordMaxBufferedTime within the library (user-configurable). Larger values of RecordMaxBufferedTime results in higher packing efficiencies and better performance. Applications that cannot tolerate this additional delay may need to use the AWS SDK directly.
upvoted 1 times
...
umatrilok
3 years, 5 months ago
B is the answer
upvoted 1 times
...
lostsoul07
3 years, 5 months ago
B is the right answer
upvoted 2 times
...
gtourkas
3 years, 6 months ago
The only thing about B is why transform to CSV since putting to Elastic Search ?
upvoted 1 times
...
freaky
3 years, 6 months ago
Answer B A and C dropped because RedShift is not meant for "near-real-time" analysis. Also, it would require some kind on Visualization on top of it to do the analysis. Dropped D because KDS in itself cannot transform data.
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago