Exam Certified Generative AI Engineer Associate topic 1 question 31 discussion

Actual exam question from Databricks's Certified Generative AI Engineer Associate

Question #: 31
Topic #: 1

[All Certified Generative AI Engineer Associate Questions]

After changing the response generating LLM in a RAG pipeline from GPT-4 to a model with a shorter context length that the company self-hosts, the Generative AI Engineer is getting the following error:

What TWO solutions should the Generative AI Engineer implement without changing the response generating model? (Choose two.)

A. Use a smaller embedding model to generate embeddings
B. Reduce the maximum output tokens of the new model
C. Decrease the chunk size of embedded documents
D. Reduce the number of records retrieved from the vector database
E. Retrain the response generating model using ALiBi

Show Suggested Answer

Suggested Answer: CD 🗳️

by trendy01 at Oct. 26, 2024, 5:58 a.m.

Comments

Submit Cancel

Lg22

1 month, 2 weeks ago

Selected Answer: CD

C,D should be

upvoted 1 times

...

trendy01

8 months, 1 week ago

Selected Answer: CD

C: You can reduce the number of tokens included in the prompt by reducing the chunk size of the embedded document. D: Reducing the number of records retrieved from the vector database can solve the token overflow issue by reducing the size of the final prompt.

upvoted 2 times

...