exam questions

Exam AI-900 All Questions

View all questions & answers for the AI-900 exam

Exam AI-900 topic 1 question 122 discussion

Actual exam question from Microsoft's AI-900
Question #: 122
Topic #: 1
[All AI-900 Questions]

You have an AI solution that provides users with the ability to control smart devices by using verbal commands.
Which two types of natural language processing (NLP) workloads does the solution use? Each correct answer presents part of the solution.
NOTE: Each correct selection is worth one point.

  • A. text-to-speech
  • B. key phrase extraction
  • C. speech-to-text
  • D. language modeling
  • E. translation
Show Suggested Answer Hide Answer
Suggested Answer: CD 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
ThariCD
Highly Voted 1 year, 12 months ago
Selected Answer: CD
This should be speech-to-text and language modeling. You need to use language modeling to determine the intent of the utterance and to perform an action based on that intent.
upvoted 16 times
...
fguglia
Highly Voted 1 year, 10 months ago
Selected Answer: CD
For me the answer is Speech to Text and Language Modeling
upvoted 10 times
...
QueenShi
Most Recent 1 month ago
Selected Answer: BC
Not a huge fan of this question. It is missing an option that allows for intent recognition like Conversation language understanding or LUIS.
upvoted 1 times
...
argb30
1 month, 4 weeks ago
Selected Answer: BC
Completely agree with VintageLady explanation
upvoted 1 times
...
M2000F007fubar
2 months ago
Selected Answer: CD
answer is C D
upvoted 1 times
...
VintageLady
4 months, 3 weeks ago
Language modeling is based on prediction; it uses the preceding words in a sentence to determine context and "predict" the next word (e.g., autofill), so I don't think it's as useful here (your Alexa does not need to predict what you're asking, only to process the command you give and respond appropriately, so extracting key phrases in what you say to her is more important).
upvoted 4 times
...
AplUSAndmINUS
5 months, 3 weeks ago
Language modeling is too broad of a term to apply here. Though this is part of the process, the system is actually looking for phrases to help it understand what the user wants it to do here. "Turn on", "lights", "close garage door" all require the AI to extract those phrases from what the user is saying, which is key phrase detection. You also need speech-to-text to translate the user's spoken words into text the language model can understand. It's more specific and better answers the question.
upvoted 4 times
...
tsummey
6 months ago
Selected Answer: CD
While it’s true that C. Speech-to-Text is used to interpret what is spoken and convert it into text, and B. Key Phrase Extraction can process the text to identify the main points, these two alone might not be sufficient for a complete AI solution that controls smart devices using verbal commands. it doesn’t necessarily understand the context or the specific actions that need to be taken based on those key phrases. For example, in a command like “Turn on the living room lights”, key phrase extraction might identify “turn on”, “living room”, and “lights” as key phrases, but it doesn’t inherently understand that “turn on” is an action that needs to be applied to the “living room lights”. That's my reason why language modeling is a better answer than key phrase extraction.
upvoted 3 times
...
Alex_W
6 months, 3 weeks ago
Selected Answer: BC
STT plus key phrase extraction perfectly fit the job.
upvoted 2 times
...
sujitwarrier11
7 months, 1 week ago
Selected Answer: BC
I believe the given answer is correct. We dont need chat GPT like functionality here. We just need to know what action needs to be performed on what device. Key phrase extraction is perfect for the job.
upvoted 2 times
...
Scott123
9 months, 4 weeks ago
It should ne AD: Certainly! The AI solution for controlling smart devices via verbal commands utilizes two key types of Natural Language Processing (NLP) workloads: Text-to-Speech (TTS): TTS is a critical component that converts written text into spoken language. It enables the system to communicate with users by generating human-like speech from textual input1. Speech-to-Text (STT): STT, also known as Automatic Speech Recognition (ASR), performs the opposite function. It transcribes spoken language into written text, allowing the system to understand and process verbal commands2
upvoted 2 times
...
TDAC
11 months, 1 week ago
The answer is correct. We all agree speech-to-text is correct. Key Phrase Detection is also correct. Here is why: Key phrase extraction is a technique that identifies the most important phrases in a given text. eg: "Turn the light on", or "What is today's date?" Language modeling is a technique that is used to predict the probability of a sequence of words in a given language. It is used to generate text that is similar to the input text. For example, given the text “The cat sat on the”, a language model would predict that the next word is “mat” with a higher probability than “car”. Since we are talking about a smart device, the answer will be key phrase detection. Upvote me it makes sense to you.
upvoted 6 times
Mehe323
11 months, 1 week ago
I disagree. According to Microsoft, key phrase extraction lists the main concepts from unstructured text. In this case there is no unstructured text. https://learn.microsoft.com/en-us/azure/search/cognitive-search-skill-keyphrases
upvoted 3 times
Alex_W
6 months, 3 weeks ago
Of course there is: extracted from speech-to-text.
upvoted 1 times
...
...
...
1StepGrow
1 year, 2 months ago
Thank you for the valuable information on the blog.I am not an expert in blog writing, but I am reading your content slightly, increasing my confidence in how to give the information properly. Your presentation was also good, and I understood the information easily. For more information Please visit the 1stepGrow website or data science course. https://1stepgrow.com/advance-data-science-and-artificial-intelligence-course/
upvoted 1 times
...
rdemontis
1 year, 7 months ago
Selected Answer: CD
Considering the scenario described where the goal is to control smart devices using voice commands, the most appropriate choice would be to use speech-to-text conversion as the first step in the process and then apply language modeling to generate consistent and meaningful responses or actions based on the commands recognized in the produced text. This could allow the AI to understand the users' intent and respond appropriately. Key phrase extraction could also work but is more complex because an additional layer would have to be added that understands user intent based on the combination of keywords extracted. But it would become complex and probably less efficient as well. Language modeling solves this problem natively.
upvoted 3 times
...
master_yoda
1 year, 8 months ago
Selected Answer: BC
These two types of NLP workloads are key phrase extraction and speech-to-text. Key phrase extraction is used to quickly identify the main concepts in text while speech-to-text is used to convert spoken words into written text. The key here is control smart devices, not a human conversation.
upvoted 3 times
...
XtraWest
1 year, 8 months ago
Selected Answer: CD
C. Speech to text D. Natural language understanding (NLU)
upvoted 4 times
...
Rosviul
1 year, 9 months ago
it should be C-D... as per Language modeling: Identify key terms and phrases, understand sentiments, and build conversational interfaces into applications.
upvoted 3 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago