You are designing a cloud-native historical data processing system to meet the following conditions:
✑ The data being analyzed is in CSV, Avro, and PDF formats and will be accessed by multiple analysis tools including Dataproc, BigQuery, and Compute
Engine.
✑ A batch pipeline moves daily data.
✑ Performance is not a factor in the solution.
✑ The solution design should maximize availability.
How should you design data storage for this solution?
jkhong
Highly Voted 2 years, 6 months agoserch_engine
Most Recent 7 months, 1 week agoAzureDP900
2 years, 3 months agodconesoko
2 years, 4 months agozellck
2 years, 4 months agodevaid
2 years, 6 months agokenanars
2 years, 7 months ago