exam questions

Exam Professional Data Engineer All Questions

View all questions & answers for the Professional Data Engineer exam

Exam Professional Data Engineer topic 1 question 47 discussion

Actual exam question from Google's Professional Data Engineer
Question #: 47
Topic #: 1
[All Professional Data Engineer Questions]

You are designing the database schema for a machine learning-based food ordering service that will predict what users want to eat. Here is some of the information you need to store:
✑ The user profile: What the user likes and doesn't like to eat
✑ The user account information: Name, address, preferred meal times
✑ The order information: When orders are made, from where, to whom
The database will be used to store all the transactional data of the product. You want to optimize the data schema. Which Google Cloud Platform product should you use?

  • A. BigQuery
  • B. Cloud SQL
  • C. Cloud Bigtable
  • D. Cloud Datastore
Show Suggested Answer Hide Answer
Suggested Answer: A 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
jvg637
Highly Voted 4 years, 9 months ago
You want to optimize the data schema + Machine Learning --> Bigquery. So A
upvoted 60 times
yoshik
3 years, 3 months ago
BigQuery is a datawarehouse, not a transactional db. You need to store transactional data as a requirement.
upvoted 27 times
alecuba16
2 years, 5 months ago
Biquery Supports transactions: https://cloud.google.com/bigquery/docs/reference/standard-sql/transactions , but indeed is not a good DB for OLTP. But I would said or CloudSQL or BigQuery
upvoted 4 times
...
alexmirmao
3 years, 2 months ago
In my opinion transactional data doesnt mean transactions they could be grouped so there is no need to write register by register.
upvoted 8 times
yoshik
3 years, 2 months ago
In other questions they talk about 'transactional log data' when referring to past transactions, but you could be right, agree. In that case ok A BigQuery. Nevertheless, the question is formulated ambiguously.
upvoted 5 times
...
...
...
...
[Removed]
Highly Voted 4 years, 9 months ago
Answer: Should be D - Datastore
upvoted 26 times
GeeBeeEl
4 years, 1 month ago
There is SQLML with BigQuery, you know that? You cannot optimize a schema in datastore, it is a NoSQL document database built for automatic scaling, high performance, and ease of application development. It does not work based on schemas!
upvoted 20 times
BigQuery
3 years ago
BQML is there. But, In question do they want to do ML on BQ?? Its saying just ML Based Company.
upvoted 7 times
...
...
cetanx
1 year, 6 months ago
It was also a difficult one for Chat GPT, it did give different answers each time I inquiry more about the question. After a few iterations, we also agreed on "D" :) - because; In the context of a food ordering service, storing data about what a user likes and doesn't like to eat can potentially involve a varied and dynamic set of data. Some users might have a long list of food preferences, while others might have only a few. Some users might update their likes and dislikes frequently, while others rarely or never. This kind of data is a good match for a NoSQL database like Datastore, which can easily accommodate such variations.
upvoted 5 times
...
...
sravi1200
Most Recent 2 days ago
Selected Answer: B
Cloud SQL can store transactional data not Big Query. Big Query is an analytical service.
upvoted 1 times
...
DGames
3 days, 10 hours ago
Selected Answer: A
Easy implement data schema + Machine Learning model in Big Query
upvoted 1 times
...
julydev82
1 week, 6 days ago
Selected Answer: B
database will be used to storage all transactional data.... I think that you need a relational database for that, then federated tables to bigquery to analysis.
upvoted 1 times
...
decipher9
1 month ago
For a machine learning-based food ordering service that needs to store transactional data, Cloud SQL is the most suitable option. Here's why: Cloud SQL is a fully-managed relational database service that supports transactional workloads, making it ideal for storing user profiles, account information, and order details. It provides strong consistency and supports complex queries, which are essential for managing and retrieving transactional data efficiently. While BigQuery is excellent for large-scale data analysis, it is not optimized for transactional data storage. Cloud Bigtable is designed for high-throughput and low-latency workloads but lacks the transactional capabilities needed for this use case. Cloud Datastore is a NoSQL database that supports transactions but is generally less powerful than a relational database for complex transactional schemas12. So, the best choice for your needs is B. Cloud SQL.
upvoted 1 times
...
SamuelTsch
2 months ago
Selected Answer: B
No idea why so many people go to A. But as transactional data, I think B is correct.
upvoted 1 times
...
baimus
3 months ago
Selected Answer: D
The details of the information definitely look suited to noSql to me, so that means C or D. Datastore is designed for this sort of thing - transactional nosql for an App. I took the question to mean "the machine learning app already exists" so the fact bigquery allows ML isn't relevant. It would be a leap to assume that the ML is done in Bigquery (I have a current Google ML pro cert, and this wouldn't say bigquery to me from that cert)
upvoted 1 times
...
Nittin
3 months, 3 weeks ago
Selected Answer: B
Cloud SQL is a fully-managed relational database service that supports MySQL, PostgreSQL, and SQL Server. It is well-suited for transactional workloads, allowing you to store structured data with relationships between different entities, such as users, orders, and profiles.
upvoted 1 times
...
39405bb
7 months ago
he best answer for this scenario is B. Cloud SQL. Here's why: Relational Data: The information you need to store (user profile, account information, order information) is highly structured and relational. Cloud SQL, being a relational database service, is designed to handle this type of data efficiently. Transactional Workloads: Food ordering involves transactional operations (placing orders, updating user preferences, etc.). Cloud SQL is optimized for transactional workloads, ensuring data consistency and integrity. Ease of Use: Cloud SQL is a managed service, meaning Google handles maintenance, updates, and backups, making it easier to manage than some other options. Integration with Machine Learning: Cloud SQL can easily integrate with other Google Cloud Platform products like BigQuery and Vertex AI, which are crucial for machine learning tasks.
upvoted 5 times
...
I__SHA1234567
9 months, 1 week ago
Selected Answer: A
BigQuery is a fully managed, serverless data warehouse that enables scalable analysis of large datasets. It is designed to handle large volumes of data and support complex queries, making it suitable for storing transactional data and performing analytics. With BigQuery, you can optimize your data schema and easily scale as your data grows. Additionally, BigQuery integrates well with other Google Cloud Platform services, including machine learning services, enabling you to build advanced analytics and predictive models on your transactional data.
upvoted 1 times
...
philli1011
10 months, 4 weeks ago
C It says that the database will be used to store the transactions data. BigQuery is not usually characterized as a data storage system. Also a databased is used for storing transactional Data not a Data Wharehouse.
upvoted 1 times
...
philli1011
10 months, 4 weeks ago
My choice is C It says "DataBase Schema" not DataWharehouse Schema. It didn't mention if the ML is to be done in the DataBase or not, it just states that a database is to be created.
upvoted 1 times
...
Camaro
1 year ago
I asked ChatGPT this question. It first answered Datastore. I said the question asks us to optimize the data schema and Datastore has no schema. Then it answered CloudSQL. I said the question asks about Machine learning aspect. Then it answered Bigquery. I said its a food ordering service and must need low latency Then it answered Bigtable. GPT is clearly not a good tool to use for the prep please avoid it. Its flawed currently that is DEC2023!
upvoted 3 times
LaxmanTiwari
1 year ago
Hello Camaro, when u going to take the exam ? am appearing on 23rd Dec and keen to know if this set is still valid ?
upvoted 1 times
...
...
rocky48
1 year, 1 month ago
Selected Answer: A
A. BigQuery -> Most Probably the answer because for analytic it should automatically be big query. But my question is why not others. I've used BigQuery and I know that it allows schema optimization as well as one of it's feature is built in machine learning. More here:https://cloud.google.com/bigquery/docs/introduction B. Cloud SQL -> I am not being about to find the doc for cloud sql where it talks about being able to optimize the schema and if it can be used for machine learning applications. C. Cloud Bigtable -> Possible answer because there's schema and can be used for machine learning application. Doc: https://cloud.google.com/bigtable/docs/overview D. Cloud Datastore -> Wrong because Datastore doesn’t have a schema. Doc: https://cloud.google.com/datastore/docs/concepts/overview
upvoted 1 times
...
A_Nasser
1 year, 2 months ago
Selected Answer: D
The right answer is D because Datastore is transactional, scalable, and can deliver an output to ML.
upvoted 1 times
...
Mark_86
1 year, 4 months ago
Selected Answer: B
Although you could argue with the formulation of the question, I did also read that this is about a transactional database which BigQuery is not. Thus I would go with Cloud SQL.
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago