exam questions

Exam Professional Data Engineer All Questions

View all questions & answers for the Professional Data Engineer exam

Exam Professional Data Engineer topic 1 question 47 discussion

Actual exam question from Google's Professional Data Engineer
Question #: 47
Topic #: 1
[All Professional Data Engineer Questions]

You are designing the database schema for a machine learning-based food ordering service that will predict what users want to eat. Here is some of the information you need to store:
✑ The user profile: What the user likes and doesn't like to eat
✑ The user account information: Name, address, preferred meal times
✑ The order information: When orders are made, from where, to whom
The database will be used to store all the transactional data of the product. You want to optimize the data schema. Which Google Cloud Platform product should you use?

  • A. BigQuery
  • B. Cloud SQL
  • C. Cloud Bigtable
  • D. Cloud Datastore
Show Suggested Answer Hide Answer
Suggested Answer: A 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
jvg637
Highly Voted 4 years, 10 months ago
You want to optimize the data schema + Machine Learning --> Bigquery. So A
upvoted 61 times
yoshik
3 years, 4 months ago
BigQuery is a datawarehouse, not a transactional db. You need to store transactional data as a requirement.
upvoted 28 times
alecuba16
2 years, 6 months ago
Biquery Supports transactions: https://cloud.google.com/bigquery/docs/reference/standard-sql/transactions , but indeed is not a good DB for OLTP. But I would said or CloudSQL or BigQuery
upvoted 4 times
...
alexmirmao
3 years, 4 months ago
In my opinion transactional data doesnt mean transactions they could be grouped so there is no need to write register by register.
upvoted 8 times
yoshik
3 years, 3 months ago
In other questions they talk about 'transactional log data' when referring to past transactions, but you could be right, agree. In that case ok A BigQuery. Nevertheless, the question is formulated ambiguously.
upvoted 5 times
...
...
...
...
[Removed]
Highly Voted 4 years, 10 months ago
Answer: Should be D - Datastore
upvoted 26 times
GeeBeeEl
4 years, 2 months ago
There is SQLML with BigQuery, you know that? You cannot optimize a schema in datastore, it is a NoSQL document database built for automatic scaling, high performance, and ease of application development. It does not work based on schemas!
upvoted 20 times
BigQuery
3 years, 2 months ago
BQML is there. But, In question do they want to do ML on BQ?? Its saying just ML Based Company.
upvoted 7 times
...
...
cetanx
1 year, 8 months ago
It was also a difficult one for Chat GPT, it did give different answers each time I inquiry more about the question. After a few iterations, we also agreed on "D" :) - because; In the context of a food ordering service, storing data about what a user likes and doesn't like to eat can potentially involve a varied and dynamic set of data. Some users might have a long list of food preferences, while others might have only a few. Some users might update their likes and dislikes frequently, while others rarely or never. This kind of data is a good match for a NoSQL database like Datastore, which can easily accommodate such variations.
upvoted 5 times
...
...
cqrm3n
Most Recent 4 days, 11 hours ago
Selected Answer: B
The answer should be Cloud SQL because it is a relational database suitable for transactional data. BigQuery is for analytics and querying - not suitable for transactional workload. Bigtable is for unstructured and time series data. Datastore is a nosql document database for semi structured data.
upvoted 1 times
...
Yad_datatonic
2 weeks ago
Selected Answer: A
For a machine learning-based food ordering service that requires optimised storage of transactional data, Google Cloud BigQuery is a suitable choice
upvoted 1 times
...
grshankar9
2 weeks, 4 days ago
Selected Answer: A
ithin Google Cloud, the database that most readily allows for data schema optimization is BigQuery; it provides features like schema auto-detection, columnar storage, and the ability to manually define your schema to tailor it for efficient querying and analysis of large datasets.
upvoted 1 times
...
manikolbe
4 weeks, 1 day ago
Selected Answer: B
Cloud SQL is the best choice for your application as it provides relational database management and is optimized for storing transactional data with SQL querying capabilities. It is well-suited for managing user profiles, account information, and orders, ensuring data integrity, and supporting complex queries necessary for the food ordering service.
upvoted 3 times
...
Ronn27
1 month ago
Selected Answer: B
Use BigQuery for analyzing aggregated data (e.g., predicting food trends or training ML models). Use Cloud Bigtable for large-scale real-time recommendation engines if needed in the future. Use Firestore for dynamic, semi-structured data with real-time updates if you need flexibility over transactional consistency. Cloud SQL strikes the right balance for this use case due to its support for structured data, transactions, and easy integration with other GCP services. So B. CloudSQL is the right answer
upvoted 1 times
...
sravi1200
1 month, 2 weeks ago
Selected Answer: B
Cloud SQL can store transactional data not Big Query. Big Query is an analytical service.
upvoted 1 times
...
DGames
1 month, 2 weeks ago
Selected Answer: A
Easy implement data schema + Machine Learning model in Big Query
upvoted 1 times
...
julydev82
1 month, 4 weeks ago
Selected Answer: B
database will be used to storage all transactional data.... I think that you need a relational database for that, then federated tables to bigquery to analysis.
upvoted 1 times
...
decipher9
2 months, 3 weeks ago
For a machine learning-based food ordering service that needs to store transactional data, Cloud SQL is the most suitable option. Here's why: Cloud SQL is a fully-managed relational database service that supports transactional workloads, making it ideal for storing user profiles, account information, and order details. It provides strong consistency and supports complex queries, which are essential for managing and retrieving transactional data efficiently. While BigQuery is excellent for large-scale data analysis, it is not optimized for transactional data storage. Cloud Bigtable is designed for high-throughput and low-latency workloads but lacks the transactional capabilities needed for this use case. Cloud Datastore is a NoSQL database that supports transactions but is generally less powerful than a relational database for complex transactional schemas12. So, the best choice for your needs is B. Cloud SQL.
upvoted 1 times
...
SamuelTsch
3 months, 2 weeks ago
Selected Answer: B
No idea why so many people go to A. But as transactional data, I think B is correct.
upvoted 1 times
...
baimus
4 months, 2 weeks ago
Selected Answer: D
The details of the information definitely look suited to noSql to me, so that means C or D. Datastore is designed for this sort of thing - transactional nosql for an App. I took the question to mean "the machine learning app already exists" so the fact bigquery allows ML isn't relevant. It would be a leap to assume that the ML is done in Bigquery (I have a current Google ML pro cert, and this wouldn't say bigquery to me from that cert)
upvoted 1 times
...
Nittin
5 months, 1 week ago
Selected Answer: B
Cloud SQL is a fully-managed relational database service that supports MySQL, PostgreSQL, and SQL Server. It is well-suited for transactional workloads, allowing you to store structured data with relationships between different entities, such as users, orders, and profiles.
upvoted 1 times
...
39405bb
8 months, 3 weeks ago
he best answer for this scenario is B. Cloud SQL. Here's why: Relational Data: The information you need to store (user profile, account information, order information) is highly structured and relational. Cloud SQL, being a relational database service, is designed to handle this type of data efficiently. Transactional Workloads: Food ordering involves transactional operations (placing orders, updating user preferences, etc.). Cloud SQL is optimized for transactional workloads, ensuring data consistency and integrity. Ease of Use: Cloud SQL is a managed service, meaning Google handles maintenance, updates, and backups, making it easier to manage than some other options. Integration with Machine Learning: Cloud SQL can easily integrate with other Google Cloud Platform products like BigQuery and Vertex AI, which are crucial for machine learning tasks.
upvoted 5 times
...
I__SHA1234567
10 months, 4 weeks ago
Selected Answer: A
BigQuery is a fully managed, serverless data warehouse that enables scalable analysis of large datasets. It is designed to handle large volumes of data and support complex queries, making it suitable for storing transactional data and performing analytics. With BigQuery, you can optimize your data schema and easily scale as your data grows. Additionally, BigQuery integrates well with other Google Cloud Platform services, including machine learning services, enabling you to build advanced analytics and predictive models on your transactional data.
upvoted 1 times
...
philli1011
1 year ago
C It says that the database will be used to store the transactions data. BigQuery is not usually characterized as a data storage system. Also a databased is used for storing transactional Data not a Data Wharehouse.
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago