Exam AI-900 All Questions

View all questions & answers for the AI-900 exam

Exam AI-900 topic 1 question 122 discussion

Actual exam question from Microsoft's AI-900

Question #: 122
Topic #: 1

You have an AI solution that provides users with the ability to control smart devices by using verbal commands.
Which two types of natural language processing (NLP) workloads does the solution use? Each correct answer presents part of the solution.
NOTE: Each correct selection is worth one point.

A. text-to-speech
B. key phrase extraction
C. speech-to-text
D. language modeling
E. translation

Show Suggested Answer

Suggested Answer: CD 🗳️

by Ous01 at Dec. 6, 2022, 3:59 p.m.

Comments

Submit Cancel

ThariCD

Highly Voted 2 years, 4 months ago

Selected Answer: CD

This should be speech-to-text and language modeling. You need to use language modeling to determine the intent of the utterance and to perform an action based on that intent.

upvoted 16 times

...

fguglia

Highly Voted 2 years, 2 months ago

Selected Answer: CD

For me the answer is Speech to Text and Language Modeling

upvoted 10 times

...

Moon

Most Recent 3 months, 1 week ago

Selected Answer: BC

Deepseek answer is (C & D). ChatGPT and Gimni answer is A & C. My answer is: B & C.

upvoted 1 times

...

mmmmmnm

3 months, 2 weeks ago

Selected Answer: CD

The question only said "by using verbal commands". That means the Key phrase extraction doesn't need to be used.

upvoted 1 times

...

QueenShi

5 months, 1 week ago

Selected Answer: BC

Not a huge fan of this question. It is missing an option that allows for intent recognition like Conversation language understanding or LUIS.

upvoted 2 times

...

argb30

6 months ago

Selected Answer: BC

Completely agree with VintageLady explanation

upvoted 1 times

...

M2000F007fubar

6 months, 1 week ago

Selected Answer: CD

answer is C D

upvoted 1 times

...

VintageLady

9 months ago

Language modeling is based on prediction; it uses the preceding words in a sentence to determine context and "predict" the next word (e.g., autofill), so I don't think it's as useful here (your Alexa does not need to predict what you're asking, only to process the command you give and respond appropriately, so extracting key phrases in what you say to her is more important).

upvoted 5 times

...

AplUSAndmINUS

10 months ago

Language modeling is too broad of a term to apply here. Though this is part of the process, the system is actually looking for phrases to help it understand what the user wants it to do here. "Turn on", "lights", "close garage door" all require the AI to extract those phrases from what the user is saying, which is key phrase detection. You also need speech-to-text to translate the user's spoken words into text the language model can understand. It's more specific and better answers the question.

upvoted 4 times

...

tsummey

10 months, 1 week ago

Selected Answer: CD

While it’s true that C. Speech-to-Text is used to interpret what is spoken and convert it into text, and B. Key Phrase Extraction can process the text to identify the main points, these two alone might not be sufficient for a complete AI solution that controls smart devices using verbal commands. it doesn’t necessarily understand the context or the specific actions that need to be taken based on those key phrases. For example, in a command like “Turn on the living room lights”, key phrase extraction might identify “turn on”, “living room”, and “lights” as key phrases, but it doesn’t inherently understand that “turn on” is an action that needs to be applied to the “living room lights”. That's my reason why language modeling is a better answer than key phrase extraction.

upvoted 3 times

...

Alex_W

11 months ago

Selected Answer: BC

STT plus key phrase extraction perfectly fit the job.

upvoted 2 times

...

sujitwarrier11

11 months, 2 weeks ago

Selected Answer: BC

I believe the given answer is correct. We dont need chat GPT like functionality here. We just need to know what action needs to be performed on what device. Key phrase extraction is perfect for the job.

upvoted 2 times

...

Scott123

1 year, 2 months ago

It should ne AD: Certainly! The AI solution for controlling smart devices via verbal commands utilizes two key types of Natural Language Processing (NLP) workloads: Text-to-Speech (TTS): TTS is a critical component that converts written text into spoken language. It enables the system to communicate with users by generating human-like speech from textual input1. Speech-to-Text (STT): STT, also known as Automatic Speech Recognition (ASR), performs the opposite function. It transcribes spoken language into written text, allowing the system to understand and process verbal commands2

upvoted 2 times

...

TDAC

1 year, 3 months ago

The answer is correct. We all agree speech-to-text is correct. Key Phrase Detection is also correct. Here is why: Key phrase extraction is a technique that identifies the most important phrases in a given text. eg: "Turn the light on", or "What is today's date?" Language modeling is a technique that is used to predict the probability of a sequence of words in a given language. It is used to generate text that is similar to the input text. For example, given the text “The cat sat on the”, a language model would predict that the next word is “mat” with a higher probability than “car”. Since we are talking about a smart device, the answer will be key phrase detection. Upvote me it makes sense to you.

upvoted 6 times

Mehe323

1 year, 3 months ago

I disagree. According to Microsoft, key phrase extraction lists the main concepts from unstructured text. In this case there is no unstructured text. https://learn.microsoft.com/en-us/azure/search/cognitive-search-skill-keyphrases

upvoted 3 times

Alex_W

11 months ago

Of course there is: extracted from speech-to-text.

upvoted 1 times

...

1StepGrow

1 year, 6 months ago

Thank you for the valuable information on the blog.I am not an expert in blog writing, but I am reading your content slightly, increasing my confidence in how to give the information properly. Your presentation was also good, and I understood the information easily. For more information Please visit the 1stepGrow website or data science course. https://1stepgrow.com/advance-data-science-and-artificial-intelligence-course/

upvoted 1 times

...

rdemontis

1 year, 11 months ago

Selected Answer: CD

Considering the scenario described where the goal is to control smart devices using voice commands, the most appropriate choice would be to use speech-to-text conversion as the first step in the process and then apply language modeling to generate consistent and meaningful responses or actions based on the commands recognized in the produced text. This could allow the AI to understand the users' intent and respond appropriately. Key phrase extraction could also work but is more complex because an additional layer would have to be added that understands user intent based on the combination of keywords extracted. But it would become complex and probably less efficient as well. Language modeling solves this problem natively.

upvoted 3 times

...

master_yoda

2 years ago

Selected Answer: BC

These two types of NLP workloads are key phrase extraction and speech-to-text. Key phrase extraction is used to quickly identify the main concepts in text while speech-to-text is used to convert spoken words into written text. The key here is control smart devices, not a human conversation.

upvoted 3 times

...

Load full discussion...

Exam AI-900 All Questions

View all questions & answers for the AI-900 exam

Exam AI-900 topic 1 question 122 discussion

Comments

ThariCD

fguglia

Moon

mmmmmnm

QueenShi

argb30

M2000F007fubar

VintageLady

AplUSAndmINUS

tsummey

Alex_W

sujitwarrier11

Scott123

TDAC

Mehe323

Alex_W

1StepGrow

rdemontis

master_yoda

SY0-701