Exam Certified Generative AI Engineer Associate topic 1 question 8 discussion

Actual exam question from Databricks's Certified Generative AI Engineer Associate

Question #: 8
Topic #: 1

[All Certified Generative AI Engineer Associate Questions]

A Generative Al Engineer is creating an LLM-based application. The documents for its retriever have been chunked to a maximum of 512 tokens each. The Generative Al Engineer knows that cost and latency are more important than quality for this application. They have several context length levels to choose from.
Which will fulfill their need?

A. context length 514; smallest model is 0.44GB and embedding dimension 768
B. context length 2048: smallest model is 11GB and embedding dimension 2560
C. context length 32768: smallest model is 14GB and embedding dimension 4096
D. context length 512: smallest model is 0.13GB and embedding dimension 384

Show Suggested Answer

Suggested Answer: D 🗳️

by srihdar at Oct. 19, 2024, 4:39 p.m.

Comments

Submit Cancel

Qix

2 months, 3 weeks ago

Selected Answer: D

D is the correct solution, as the example present in the official Databricks exam guide

upvoted 1 times

...

Arifai900

4 months, 3 weeks ago

Selected Answer: D

D is correct.

upvoted 1 times

...

awron_durat

5 months, 3 weeks ago

Selected Answer: D

D is correct.

upvoted 1 times

...

srihdar

6 months ago

D is the answer Since cost and latency are more important than quality for this application, the smallest model with the shortest context length (512 tokens) will be the most efficient in terms of resource usage and speed. This option provides lower latency and cost compared to the larger models, making it a suitable choice for the engineer’s requirements.

upvoted 2 times

...

Exam Certified Generative AI Engineer Associate All Questions

View all questions & answers for the Certified Generative AI Engineer Associate exam

Exam Certified Generative AI Engineer Associate topic 1 question 8 discussion

Comments

Qix

Arifai900

awron_durat

srihdar

SY0-701