Exam Certified Generative AI Engineer Associate topic 1 question 37 discussion

Actual exam question from Databricks's Certified Generative AI Engineer Associate

Question #: 37
Topic #: 1

[All Certified Generative AI Engineer Associate Questions]

A Generative AI Engineer developed an LLM application using the provisioned throughput Foundation Model API. Now that the application is ready to be deployed, they realize their volume of requests are not sufficiently high enough to create their own provisioned throughput endpoint. They want to choose a strategy that ensures the best cost-effectiveness for their application.
What strategy should the Generative AI Engineer use?

A. Switch to using External Models instead
B. Deploy the model using pay-per-token throughput as it comes with cost guarantees
C. Change to a model with a fewer number of parameters in order to reduce hardware constraint issues
D. Throttle the incoming batch of requests manually to avoid rate limiting issues

Show Suggested Answer

Suggested Answer: B 🗳️

by trendy01 at Oct. 26, 2024, 6:21 a.m.

Comments

Submit Cancel

trendy01

5 months, 3 weeks ago

Selected Answer: B

B appears to be the most appropriate choice. A strategy that can maximize cost effectiveness in low request volume situations is to deploy in a usage-based token throughput manner.

upvoted 1 times

...

Exam Certified Generative AI Engineer Associate All Questions

View all questions & answers for the Certified Generative AI Engineer Associate exam

Exam Certified Generative AI Engineer Associate topic 1 question 37 discussion

Comments

trendy01

SY0-701