exam questions

Exam AI-102 All Questions

View all questions & answers for the AI-102 exam

Exam AI-102 topic 4 question 3 discussion

Actual exam question from Microsoft's AI-102
Question #: 3
Topic #: 4
[All AI-102 Questions]

You have an existing Azure Cognitive Search service.
You have an Azure Blob storage account that contains millions of scanned documents stored as images and PDFs.
You need to make the scanned documents available to search as quickly as possible.
What should you do?

  • A. Split the data into multiple blob containers. Create a Cognitive Search service for each container. Within each indexer definition, schedule the same runtime execution pattern.
  • B. Split the data into multiple blob containers. Create an indexer for each container. Increase the search units. Within each indexer definition, schedule a sequential execution pattern.
  • C. Create a Cognitive Search service for each type of document.
  • D. Split the data into multiple virtual folders. Create an indexer for each folder. Increase the search units. Within each indexer definition, schedule the same runtime execution pattern.
Show Suggested Answer Hide Answer
Suggested Answer: D 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
syupwsh
6 days, 12 hours ago
Selected Answer: D
https://learn.microsoft.com/en-us/azure/search/search-how-to-large-index#run-indexers-in-parallel If your data source is an Azure Blob Storage container or Azure Data Lake Storage Gen 2, enumerating a large number of blobs can take a long time (even hours) until this operation is completed. As a result, your indexer's documents succeeded count doesn't appear to increase during that time and it might seem it's not making any progress, when it is. If you would like document processing to go faster for a large number of blobs, consider partitioning your data into multiple containers and create parallel indexers pointing to a single index. D for sure
upvoted 1 times
...
krzkrzkra
7 months, 1 week ago
Selected Answer: D
Selected Answer: D
upvoted 2 times
...
reigenchimpo
8 months, 2 weeks ago
Selected Answer: D
In my opinion, D is correct on this question.
upvoted 2 times
...
anto69
8 months, 2 weeks ago
Selected Answer: D
D makes sense. "virtual folders".
upvoted 2 times
...
reiwanotora
9 months ago
Selected Answer: D
FOCUS "virtual folders" word.
upvoted 3 times
...
Murtuza
10 months, 4 weeks ago
Tricky question think of virtual folder AS blob containers and the answer will be obvious
upvoted 2 times
...
evangelist
1 year ago
Selected Answer: D
e, option D is the best choice because it leverages the scalability and parallel processing capabilities of Azure Cognitive Search to efficiently index a large volume of documents. By organizing documents into virtual folders and creating an indexer for each folder, you can maximize the throughput of the indexing process. Increasing search units further supports this by allocating more resources to the task, thereby minimizing the time required to make the scanned documents searchable.
upvoted 2 times
famco
5 months, 1 week ago
how is that different from separate containers?
upvoted 2 times
...
...
rdemontis
1 year, 3 months ago
Selected Answer: D
I think correct answer is D https://learn.microsoft.com/en-us/azure/search/search-howto-large-index#run-indexers-in-parallel
upvoted 3 times
...
sl_mslconsulting
1 year, 3 months ago
Selected Answer: D
"One search unit in your service can run one indexer at any given time. Creating multiple indexers is only useful if they can run in parallel" so A and C are out. B is out as you are not running the indexers in parallel. Besides it's hard to image that with millions of scanned you don't have virtual folders in place to split the data already.
upvoted 1 times
...
zellck
1 year, 7 months ago
Selected Answer: D
D is the answer. https://learn.microsoft.com/en-us/azure/search/search-howto-large-index#run-indexers-in-parallel If you partition your data, you can create multiple indexer-data-source combinations that pull from each data source and write to the same search index. Because each indexer is distinct, you can run them at the same time, populating a search index more quickly than if you ran them sequentially. Make sure you have sufficient capacity. One search unit in your service can run one indexer at any given time. Creating multiple indexers is only useful if they can run in parallel.
upvoted 2 times
zellck
1 year, 7 months ago
If your data source is an Azure Blob Storage container or Azure Data Lake Storage Gen 2, enumerating a large number of blobs can take a long time (even hours) until this operation is completed. This will cause that your indexer's documents succeeded count isn't increased during that time and it may seem it's not making any progress, when it is. If you would like document processing to go faster for a large number of blobs, consider partitioning your data into multiple containers and create parallel indexers pointing to a single index.
upvoted 4 times
...
...
Eltooth
2 years, 7 months ago
Selected Answer: D
D is correct answer. Also marked correct on Udemy course practice test.
upvoted 4 times
...
PHD_CHENG
2 years, 8 months ago
Was on exam 7 Jun 2022
upvoted 2 times
...
prabhjot
3 years ago
correct ans
upvoted 1 times
...
azurelearner666
3 years, 7 months ago
how to do this is defined here: https://docs.microsoft.com/en-us/azure/search/search-howto-indexing-azure-blob-storage#index-large-datasets The response is missing the data source creation for each virtual folder or blob container. D is not correct, but the less wrong of a response… So I give it a "pass", nowadays it is misleading and not fully correct...
upvoted 3 times
...
azurelearner666
3 years, 7 months ago
seems to be correct
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago