Exam AI-102 All Questions

View all questions & answers for the AI-102 exam

Exam AI-102 topic 4 question 3 discussion

Actual exam question from Microsoft's AI-102

Question #: 3
Topic #: 4

You have an existing Azure Cognitive Search service.
You have an Azure Blob storage account that contains millions of scanned documents stored as images and PDFs.
You need to make the scanned documents available to search as quickly as possible.
What should you do?

A. Split the data into multiple blob containers. Create a Cognitive Search service for each container. Within each indexer definition, schedule the same runtime execution pattern.
B. Split the data into multiple blob containers. Create an indexer for each container. Increase the search units. Within each indexer definition, schedule a sequential execution pattern.
C. Create a Cognitive Search service for each type of document.
D. Split the data into multiple virtual folders. Create an indexer for each folder. Increase the search units. Within each indexer definition, schedule the same runtime execution pattern.

Show Suggested Answer

Suggested Answer: D 🗳️

by azurelearner666 at June 30, 2021, 8 p.m.

Comments

Submit Cancel

syupwsh

6 days, 12 hours ago

Selected Answer: D

https://learn.microsoft.com/en-us/azure/search/search-how-to-large-index#run-indexers-in-parallel If your data source is an Azure Blob Storage container or Azure Data Lake Storage Gen 2, enumerating a large number of blobs can take a long time (even hours) until this operation is completed. As a result, your indexer's documents succeeded count doesn't appear to increase during that time and it might seem it's not making any progress, when it is. If you would like document processing to go faster for a large number of blobs, consider partitioning your data into multiple containers and create parallel indexers pointing to a single index. D for sure

upvoted 1 times

...

krzkrzkra

7 months, 1 week ago

Selected Answer: D

upvoted 2 times

...

reigenchimpo

8 months, 2 weeks ago

Selected Answer: D

In my opinion, D is correct on this question.

upvoted 2 times

...

anto69

8 months, 2 weeks ago

Selected Answer: D

D makes sense. "virtual folders".

upvoted 2 times

...

reiwanotora

9 months ago

Selected Answer: D

FOCUS "virtual folders" word.

upvoted 3 times

...

Murtuza

10 months, 4 weeks ago

Tricky question think of virtual folder AS blob containers and the answer will be obvious

upvoted 2 times

...

evangelist

1 year ago

Selected Answer: D

e, option D is the best choice because it leverages the scalability and parallel processing capabilities of Azure Cognitive Search to efficiently index a large volume of documents. By organizing documents into virtual folders and creating an indexer for each folder, you can maximize the throughput of the indexing process. Increasing search units further supports this by allocating more resources to the task, thereby minimizing the time required to make the scanned documents searchable.

upvoted 2 times

famco

5 months, 1 week ago

how is that different from separate containers?

upvoted 2 times

...

rdemontis

1 year, 3 months ago

Selected Answer: D

I think correct answer is D https://learn.microsoft.com/en-us/azure/search/search-howto-large-index#run-indexers-in-parallel

upvoted 3 times

...

sl_mslconsulting

1 year, 3 months ago

Selected Answer: D

"One search unit in your service can run one indexer at any given time. Creating multiple indexers is only useful if they can run in parallel" so A and C are out. B is out as you are not running the indexers in parallel. Besides it's hard to image that with millions of scanned you don't have virtual folders in place to split the data already.

upvoted 1 times

...

zellck

1 year, 7 months ago

Selected Answer: D

D is the answer. https://learn.microsoft.com/en-us/azure/search/search-howto-large-index#run-indexers-in-parallel If you partition your data, you can create multiple indexer-data-source combinations that pull from each data source and write to the same search index. Because each indexer is distinct, you can run them at the same time, populating a search index more quickly than if you ran them sequentially. Make sure you have sufficient capacity. One search unit in your service can run one indexer at any given time. Creating multiple indexers is only useful if they can run in parallel.

upvoted 2 times

zellck

1 year, 7 months ago

If your data source is an Azure Blob Storage container or Azure Data Lake Storage Gen 2, enumerating a large number of blobs can take a long time (even hours) until this operation is completed. This will cause that your indexer's documents succeeded count isn't increased during that time and it may seem it's not making any progress, when it is. If you would like document processing to go faster for a large number of blobs, consider partitioning your data into multiple containers and create parallel indexers pointing to a single index.

upvoted 4 times

...

Eltooth

2 years, 7 months ago

Selected Answer: D

D is correct answer. Also marked correct on Udemy course practice test.

upvoted 4 times

...

PHD_CHENG

2 years, 8 months ago

Was on exam 7 Jun 2022

upvoted 2 times

...

prabhjot

3 years ago

correct ans

upvoted 1 times

...

azurelearner666

3 years, 7 months ago

how to do this is defined here: https://docs.microsoft.com/en-us/azure/search/search-howto-indexing-azure-blob-storage#index-large-datasets The response is missing the data source creation for each virtual folder or blob container. D is not correct, but the less wrong of a response… So I give it a "pass", nowadays it is misleading and not fully correct...

upvoted 3 times

...

azurelearner666

3 years, 7 months ago

seems to be correct

upvoted 2 times

...

Exam AI-102 All Questions

View all questions & answers for the AI-102 exam

Exam AI-102 topic 4 question 3 discussion

Comments

syupwsh

krzkrzkra

reigenchimpo

anto69

reiwanotora

Murtuza

evangelist

famco

rdemontis

sl_mslconsulting

zellck

zellck

Eltooth

PHD_CHENG

prabhjot

azurelearner666

azurelearner666

SY0-701