Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.
exam questions

Exam Certified Generative AI Engineer Associate All Questions

View all questions & answers for the Certified Generative AI Engineer Associate exam

Exam Certified Generative AI Engineer Associate topic 1 question 34 discussion

Actual exam question from Databricks's Certified Generative AI Engineer Associate
Question #: 34
Topic #: 1
[All Certified Generative AI Engineer Associate Questions]

A Generative Al Engineer has successfully ingested unstructured documents and chunked them by document sections. They would like to store the chunks in a Vector Search index. The current format of the dataframe has two columns: (i) original document file name (ii) an array of text chunks for each document.
What is the most performant way to store this dataframe?

  • A. Split the data into train and test set, create a unique identifier for each document, then save to a Delta table
  • B. Flatten the dataframe to one chunk per row, create a unique identifier for each row, and save to a Delta table
  • C. First create a unique identifier for each document, then save to a Delta table
  • D. Store each chunk as an independent JSON file in Unity Catalog Volume. For each JSON file, the key is the document section name and the value is the array of text chunks for that section
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
trendy01
4 weeks, 1 day ago
Selected Answer: B
B appears to be the most appropriate choice. By flattening your dataframe into one row per chunk and creating a unique identifier for each row, you can maximize search performance and maintain a manageable structure for your data.
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...