A junior data engineer has been asked to develop a streaming data pipeline with a grouped aggregation using DataFrame df. The pipeline needs to calculate the average humidity and average temperature for each non-overlapping five-minute interval. Events are recorded once per minute per device.
Streaming DataFrame df has the following schema:
"device_id INT, event_time TIMESTAMP, temp FLOAT, humidity FLOAT"
Code block:
Choose the response that correctly fills in the blank within the code block to complete this task.
imatheushenrique
2 months, 1 week agoJay_98_11
7 months agokz_data
7 months agoBIKRAM063
9 months, 2 weeks agosturcu
10 months agoEertyy
10 months, 3 weeks agothxsgod
11 months, 1 week ago