exam questions

Exam AWS Certified Machine Learning - Specialty All Questions

View all questions & answers for the AWS Certified Machine Learning - Specialty exam

Exam AWS Certified Machine Learning - Specialty topic 1 question 263 discussion

A pharmaceutical company performs periodic audits of clinical trial sites to quickly resolve critical findings. The company stores audit documents in text format. Auditors have requested help from a data science team to quickly analyze the documents. The auditors need to discover the 10 main topics within the documents to prioritize and distribute the review work among the auditing team members. Documents that describe adverse events must receive the highest priority.

A data scientist will use statistical modeling to discover abstract topics and to provide a list of the top words for each category to help the auditors assess the relevance of the topic.

Which algorithms are best suited to this scenario? (Choose two.)

  • A. Latent Dirichlet allocation (LDA)
  • B. Random forest classifier
  • C. Neural topic modeling (NTM)
  • D. Linear support vector machine
  • E. Linear regression
Show Suggested Answer Hide Answer
Suggested Answer: AC 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
seifskl
Highly Voted 6 months ago
Selected Answer: AC
A. LDA is designed to discover abstract topics in a collection of documents. It is commonly used for topic modeling and is one of the most popular techniques for extracting topics from text data. C. NTM is also used in topic modeling. It uses deep learning to discover topics in a collection of documents, and it can produce similar results to LDA but potentially with better accuracy due to its neural network foundation. Incorrect choices: B. Random forest classifier is a classification algorithm. It is better suited for classification tasks based on labeled data. D. SVM is also a classification algorithm. It works well for binary classification problems. E. Linear regression is a regression algorithm used to predict continuous values. It's not suitable for topic modeling.
upvoted 5 times
...
endeesa
Most Recent 5 months ago
Selected Answer: AC
LDA and NTM are the only applicable options for topic modelling here
upvoted 1 times
...
loict
7 months, 2 weeks ago
Selected Answer: AC
A. YES B. NO - for classification C. YES D. NO - for classification E. NO - for classification
upvoted 2 times
...
Mickey321
8 months, 1 week ago
Selected Answer: AC
both unsupervised learning algorithms that can discover abstract topics in a collection of text documents . These algorithms can help the data scientist to analyze the audit documents and provide a list of the top words for each category to help the auditors assess the relevance of the topic. LDA and NTM are different from other algorithms that are not suitable for this scenario, such as:
upvoted 1 times
...
awsarchitect5
9 months ago
Selected Answer: AC
AC for topics
upvoted 1 times
...
worldboss
9 months, 4 weeks ago
Although you can use both the Amazon SageMaker NTM and LDA algorithms for topic modeling, they are distinct algorithms and can be expected to produce different results on the same input data. A and C https://docs.aws.amazon.com/sagemaker/latest/dg/ntm.html https://docs.aws.amazon.com/sagemaker/latest/dg/lda.html
upvoted 4 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago