You are building a new application that you need to collect data from in a scalable way. Data arrives continuously from the application throughout the day, and you expect to generate approximately 150 GB of JSON data per day by the end of the year. Your requirements are:
✑ Decoupling producer from consumer
✑ Space and cost-efficient storage of the raw ingested data, which is to be stored indefinitely
✑ Near real-time SQL query
✑ Maintain at least 2 years of historical data, which will be queried with SQL
Which pipeline should you use to meet these requirements?
[Removed]
Highly Voted 4 years, 8 months ago[Removed]
Highly Voted 4 years, 8 months agoedre
Most Recent 4 months, 1 week agojuliorevk
1 year, 2 months agobarnac1es
1 year, 2 months agoFP77
1 year, 3 months agovaga1
1 year, 5 months agoforepick
1 year, 5 months agoOberstK
1 year, 9 months agodesertlotus1211
1 year, 10 months agoAzureDP900
1 year, 11 months agozellck
1 year, 12 months agombacelar
2 years agoclouditis
2 years, 2 months agoPrasanna_kumar
2 years, 9 months agoMaxNRG
2 years, 10 months agomedeis_jar
2 years, 10 months ago