You have built a custom model that performs several memory-intensive preprocessing tasks before it makes a prediction. You deployed the model to a Vertex AI endpoint, and validated that results were received in a reasonable amount of time. After routing user traffic to the endpoint, you discover that the endpoint does not autoscale as expected when receiving multiple requests. What should you do?
b1a8fae
Highly Voted 10 months, 2 weeks agob1a8fae
10 months, 2 weeks agopikachu007
Highly Voted 10 months, 2 weeks agoVinaoSilva
Most Recent 5 months agofitri001
7 months, 1 week agofitri001
7 months, 1 week agoguilhermebutzke
9 months, 2 weeks ago