You have deployed a scikit-team model to a Vertex AI endpoint using a custom model server. You enabled autoscaling: however, the deployed model fails to scale beyond one replica, which led to dropped requests. You notice that CPU utilization remains low even during periods of high load. What should you do?
sonicclasps
Highly Voted 9 months, 3 weeks agosonicclasps
9 months, 3 weeks agof084277
Most Recent 6 days, 21 hours agofitri001
7 months, 1 week agopinimichele01
7 months, 1 week agopinimichele01
7 months agoCarlose2108
8 months, 4 weeks agoguilhermebutzke
9 months, 1 week agopikachu007
10 months, 1 week agoBlehMaks
10 months, 1 week agoguilhermebutzke
9 months, 1 week agoasmgi
4 months, 1 week ago