You want to schedule a number of sequential load and transformation jobs. Data files will be added to a Cloud Storage bucket by an upstream process. There is no fixed schedule for when the new data arrives. Next, a Dataproc job is triggered to perform some transformations and write the data to BigQuery. You then need to run additional transformation jobs in BigQuery. The transformation jobs are different for every table. These jobs might take hours to complete. You need to determine the most efficient and maintainable workflow to process hundreds of tables and provide the freshest data to your end users. What should you do?
cuadradobertolinisebastiancami
Highly Voted 9 months ago8ad5266
Most Recent 5 months agoJyoGCP
9 months, 1 week agoMatt_108
10 months, 2 weeks agoJordan18
10 months, 3 weeks agocuadradobertolinisebastiancami
9 months agoAllenChen123
10 months, 2 weeks agoraaad
10 months, 3 weeks agoscaenruy
10 months, 4 weeks ago