exam questions

Exam Professional Data Engineer All Questions

View all questions & answers for the Professional Data Engineer exam

Exam Professional Data Engineer topic 1 question 145 discussion

Actual exam question from Google's Professional Data Engineer
Question #: 145
Topic #: 1
[All Professional Data Engineer Questions]

You receive data files in CSV format monthly from a third party. You need to cleanse this data, but every third month the schema of the files changes. Your requirements for implementing these transformations include:
✑ Executing the transformations on a schedule
✑ Enabling non-developer analysts to modify transformations
✑ Providing a graphical tool for designing transformations
What should you do?

  • A. Use Dataprep by Trifacta to build and maintain the transformation recipes, and execute them on a scheduled basis
  • B. Load each month's CSV data into BigQuery, and write a SQL query to transform the data to a standard schema. Merge the transformed tables together with a SQL query
  • C. Help the analysts write a Dataflow pipeline in Python to perform the transformation. The Python code should be stored in a revision control system and modified as the incoming data's schema changes
  • D. Use Apache Spark on Dataproc to infer the schema of the CSV file before creating a Dataframe. Then implement the transformations in Spark SQL before writing the data out to Cloud Storage and loading into BigQuery
Show Suggested Answer Hide Answer
Suggested Answer: A 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
madhu1171
Highly Voted 4 years, 7 months ago
A should be the answer
upvoted 35 times
...
[Removed]
Highly Voted 4 years, 7 months ago
Answer: A Description: Dataprep is used by non developers
upvoted 18 times
...
CGS22
Most Recent 6 months, 3 weeks ago
Selected Answer: A
A. Use Dataprep by Trifacta to build and maintain the transformation recipes, and execute them on a scheduled basis Addresses Requirements: Scheduled Execution: Dataprep supports running transformations on a schedule. Analyst-Friendly: Dataprep's visual interface is designed for non-developer analysts to build and modify transformations easily. Graphical Tool: It provides a drag-and-drop environment for designing data transformations. Schema Flexibility: Dataprep can handle schema changes. Analysts can adapt recipes using the visual interface
upvoted 2 times
...
barnac1es
1 year, 1 month ago
Selected Answer: A
Scheduled Transformations: Dataprep by Trifacta allows you to design and schedule transformation recipes to process data on a regular basis. You can automate the data cleansing process by scheduling it to run monthly. User-Friendly Interface: Dataprep provides a user-friendly graphical interface that enables non-developer analysts to design, modify, and maintain transformation recipes without writing code. This empowers analysts to work with the data effectively. Transformation Flexibility: Dataprep supports flexible data transformations, making it suitable for scenarios where the schema of the incoming data changes. Analysts can adapt the transformations to new schemas using the visual tools provided by Dataprep.
upvoted 1 times
...
vaga1
1 year, 5 months ago
Selected Answer: A
Providing a graphical tool for designing transformations is enough for A
upvoted 4 times
...
Dhruv28
1 year, 8 months ago
Your company receives a lot of financial data in CSV files. The files need to be processed, cleaned and transformed before they are made available for analytics. The schema of the data also changes every third month. The Data analysts should be able to perform the tasks 1. No prior knowledge of any language with no coding 2. Provided a GUI tool to build and modify the schema What solution best fits the need?
upvoted 1 times
...
zellck
1 year, 11 months ago
Selected Answer: A
A is the answer. https://cloud.google.com/dataprep Dataprep by Trifacta is an intelligent data service for visually exploring, cleaning, and preparing structured and unstructured data for analysis, reporting, and machine learning. Because Dataprep is serverless and works at any scale, there is no infrastructure to deploy or manage. Your next ideal data transformation is suggested and predicted with each UI input, so you don’t have to write code.
upvoted 3 times
...
arpitagrawal
2 years, 1 month ago
Selected Answer: A
non-developer analysts
upvoted 2 times
...
devdimidved
2 years, 5 months ago
Selected Answer: A
Dataprep is for non developers
upvoted 1 times
...
amitsingla012
2 years, 5 months ago
Selected Answer: A
Option A -- Dataprep is the right answer
upvoted 1 times
...
Prasanna_kumar
2 years, 8 months ago
Answer is A
upvoted 1 times
...
MaxNRG
2 years, 9 months ago
Selected Answer: A
A: https://cloud.google.com/dataprep/
upvoted 2 times
...
medeis_jar
2 years, 9 months ago
Selected Answer: A
Cloud Dataprep is a tool to do the job.
upvoted 1 times
...
JG123
2 years, 11 months ago
Why there are so many wrong answers? Examtopics.com are you enjoying paid subscription by giving random answers from people? Ans: A
upvoted 7 times
duytran_d
2 years, 3 months ago
this comment is being repeated and i really appreciate this feeling :D
upvoted 1 times
...
...
sandipk91
3 years, 2 months ago
vote for option A
upvoted 4 times
...
sumanshu
3 years, 3 months ago
Vote for 'A', because of requirement - Enabling non-developer analysts to modify transformations
upvoted 5 times
...
haroldbenites
4 years, 2 months ago
A is correct
upvoted 3 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago