A junior data engineer has been asked to develop a streaming data pipeline with a grouped aggregation using DataFrame df. The pipeline needs to calculate the average humidity and average temperature for each non-overlapping five-minute interval. Events are recorded once per minute per device.
Streaming DataFrame df has the following schema:
"device_id INT, event_time TIMESTAMP, temp FLOAT, humidity FLOAT"
Code block:
Choose the response that correctly fills in the blank within the code block to complete this task.
imatheushenrique
5 months, 4 weeks agoJay_98_11
10 months, 2 weeks agokz_data
10 months, 2 weeks agoBIKRAM063
1 year agosturcu
1 year, 1 month agoEertyy
1 year, 2 months agothxsgod
1 year, 2 months ago