You created an analytics environment on Google Cloud so that your data scientist team can explore data without impacting the on-premises Apache Hadoop solution. The data in the on-premises Hadoop Distributed File System (HDFS) cluster is in Optimized Row Columnar (ORC) formatted files with multiple columns of Hive partitioning. The data scientist team needs to be able to explore the data in a similar way as they used the on-premises HDFS cluster with SQL on the Hive query engine. You need to choose the most cost-effective storage and processing solution. What should you do?
raaad
Highly Voted 10 months, 2 weeks agonadavw
2 months, 3 weeks agoSamuelTsch
Most Recent 3 weeks, 2 days agokaisarfarel
8 months, 1 week agohanoverquay
8 months, 1 week ago0725f1f
8 months, 2 weeks agoJyoGCP
9 months agoMatt_108
10 months, 2 weeks agoSmakyel79
10 months, 2 weeks ago