You need to modernize your existing on-premises data strategy. Your organization currently uses:
• Apache Hadoop clusters for processing multiple large data sets, including on-premises Hadoop Distributed File System (HDFS) for data replication.
• Apache Airflow to orchestrate hundreds of ETL pipelines with thousands of job steps.
You need to set up a new architecture in Google Cloud that can handle your Hadoop workloads and requires minimal changes to your existing orchestration processes. What should you do?
raaad
Highly Voted 9 months, 4 weeks agodatasmg
Most Recent 7 months, 3 weeks agocuadradobertolinisebastiancami
8 months, 1 week agoJyoGCP
8 months, 2 weeks agoMatt_108
9 months, 2 weeks agoscaenruy
10 months ago