You are designing a cloud-native historical data processing system to meet the following conditions:
✑ The data being analyzed is in CSV, Avro, and PDF formats and will be accessed by multiple analysis tools including Dataproc, BigQuery, and Compute
Engine.
✑ A batch pipeline moves daily data.
✑ Performance is not a factor in the solution.
✑ The solution design should maximize availability.
How should you design data storage for this solution?
jkhong
Highly Voted 2 years, 1 month agoserch_engine
Most Recent 2 months agoAzureDP900
1 year, 11 months agodconesoko
1 year, 11 months agozellck
1 year, 11 months agodevaid
2 years, 1 month agokenanars
2 years, 2 months ago