Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.
exam questions

Exam Professional Cloud Architect All Questions

View all questions & answers for the Professional Cloud Architect exam

Exam Professional Cloud Architect topic 1 question 48 discussion

Actual exam question from Google's Professional Cloud Architect
Question #: 48
Topic #: 1
[All Professional Cloud Architect Questions]

Your company has multiple on-premises systems that serve as sources for reporting. The data has not been maintained well and has become degraded over time.
You want to use Google-recommended practices to detect anomalies in your company data. What should you do?

  • A. Upload your files into Cloud Storage. Use Cloud Datalab to explore and clean your data.
  • B. Upload your files into Cloud Storage. Use Cloud Dataprep to explore and clean your data.
  • C. Connect Cloud Datalab to your on-premises systems. Use Cloud Datalab to explore and clean your data.
  • D. Connect Cloud Dataprep to your on-premises systems. Use Cloud Dataprep to explore and clean your data.
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
JohnWick2020
Highly Voted 3 years, 7 months ago
Answer is B: Keynotes from question: 1- On-premise data sources 2- Unfit data; not well maintained and degraded 3- Google-recommended best practice to "detect anomalies" <<-Very important. Explanation: A & C - incorrect; Datalab does not provide anomaly detection OOTB. It is used more for data science scenarios like interactive data analysis and build ML models. B - CORRECT; DataPrep OOTB provides for fast exploration and anomaly detection and lists cloud storage as an ingestion medium. Refer to ELT pipeline architecture here = https://cloud.google.com/dataprep D - incorrect; At this time DataPrep cannot connect to SaaS or on-premise source. Not to be confused for DataFlow which can!
upvoted 58 times
...
Eroc
Highly Voted 5 years ago
Both B and D work, because the question says "Google's Best Practices" uploading the files first would keep the original copies Google encrypted and stored.
upvoted 12 times
skywalker
4 years, 6 months ago
Both of them works....
upvoted 1 times
Musk
4 years, 3 months ago
You can't connect DataPrep to your on-prem systems. You simply upload a file, but that is not connecting it to your systems. Because of that, I'd discard D and stay with B.
upvoted 9 times
...
...
tartar
4 years, 3 months ago
B is ok
upvoted 9 times
...
nitinz
3 years, 8 months ago
B, dataprep = visually explore, clean, and prepare data for analysis
upvoted 7 times
...
AzureDP900
2 years, 1 month ago
B is better choice
upvoted 1 times
...
...
Ekramy_Elnaggar
Most Recent 2 days, 15 hours ago
Selected Answer: B
Why Cloud Storage is important ? 1. Centralized repository: Cloud Storage provides a secure and scalable place to store your data. This makes it accessible to various GCP services. 2. Data lake concept: This aligns with the idea of a data lake, where you bring raw data into a central location before processing and refining it. Why Cloud Dataprep is a good fit ? 1. Visual data exploration: Dataprep excels at helping you quickly understand your data through visualizations and profiling. This is crucial for identifying anomalies. 2. Data cleaning and transformation: Dataprep makes it easy to clean and standardize your data, which is essential before anomaly detection. Inconsistent formats, missing values, and errors can skew your analysis. 3. Built-in anomaly detection: Dataprep has features specifically designed to help you find anomalies. It can highlight unusual values, outliers, and patterns.
upvoted 1 times
...
snehaso
3 months, 1 week ago
Datalab was shutdown. Its replacement is vertex AI. Read question accordingly
upvoted 1 times
...
thewalker
1 year ago
Cloud Datalab is a powerful interactive tool created to explore, analyze, transform, and visualize data and build machine learning models on Google Cloud Platform. Dataprep by Trifacta is an intelligent data service for visually exploring, cleaning, and preparing structured and unstructured data for analysis, reporting, and machine learning. Dataprep do not have an integration for on-prem: https://console.cloud.google.com/marketplace/product/endpoints/cloud-dataprep-editions-v2?project=fast-art-401415 So, clearly, the only option left is B.
upvoted 3 times
...
heretolearnazure
1 year, 3 months ago
B is correct.
upvoted 1 times
...
n_nana
1 year, 8 months ago
Today, data ingestion to DataPrep can be Application, file upload, database. so B is also now valid
upvoted 1 times
...
omermahgoub
1 year, 11 months ago
The recommended approach for detecting anomalies in your company data using Google-recommended practices is option B: Upload your files into Cloud Storage. Use Cloud Dataprep to explore and clean your data. Cloud Storage is a highly scalable, durable, and secure object storage service that can be used to store and retrieve data from anywhere on the web. You can use Cloud Storage to store your company data files and make them available for analysis. Cloud Dataprep is a fully managed data preparation service that allows you to quickly and easily explore, clean, and transform your data for analysis. It can help you detect anomalies in your data by providing features such as data profiling, data cleansing, and data transformation.
upvoted 2 times
omermahgoub
1 year, 11 months ago
Option A: Using Cloud Datalab to explore and clean your data is not a recommended approach, as Cloud Datalab is a collaborative data exploration and visualization platform that is not specifically designed for data preparation tasks such as cleansing and transformation. Option C: Connecting Cloud Datalab to your on-premises systems is not a recommended approach, as Cloud Datalab is a collaborative data exploration and visualization platform and is not designed for data preparation tasks such as cleansing and transformation. Option D: Connecting Cloud Dataprep to your on-premises systems is not necessary, as you can use Cloud Dataprep to explore and clean data stored in Cloud Storage.
upvoted 1 times
...
...
allen_y_q_huang
1 year, 11 months ago
ok for B & D, but B is suitable to gcp
upvoted 1 times
...
Smaks
1 year, 11 months ago
Selected Answer: B
Datalab is deprecated : https://cloud.google.com/datalab/docs New Cloud Dataprep options will give connectivity to relational databases, business applications and extend our integrations across Google Cloud with Google Sheets: https://www.trifacta.com/blog/cloud-dataprep-trifacta/
upvoted 1 times
...
megumin
2 years ago
Selected Answer: B
ok for B
upvoted 1 times
...
Cloudexplorer
2 years, 4 months ago
Could anyone provide a link where it explicitly says that Datprep does not connect to on-premises data sources. In the ingestion layer on the diagram at https://cloud.google.com/dataprep it shows databases as a source. I can't see anywhere that there is a limitation connecting to on-premises. Would be great if someone could share that.
upvoted 3 times
...
BigSteveO
2 years, 4 months ago
Selected Answer: B
It's gotta be B.
upvoted 1 times
...
Dhiraj03
2 years, 5 months ago
Keyword : Anamolies Data prep is the only product ... So options A and C is eliminated ... Cost effective is storing the data in GCS Cloud storage ... So option is B
upvoted 1 times
...
nkit
2 years, 7 months ago
Selected Answer: B
Dataprep to detect anomalies in Data is the right choice.
upvoted 1 times
...
GMats
2 years, 10 months ago
B...It supports only CloudStorage and Bigquery..."So you can start transforming datasets, you hereby instruct Google to allow Trifacta, who provides the service Dataprep in collaboration with Google, to view and modify project data in Cloud Storage and BigQuery, run Dataflow jobs, and use all project service accounts."
upvoted 1 times
...
haroldbenites
2 years, 11 months ago
Go for B.
upvoted 1 times
haroldbenites
2 years, 11 months ago
The question says “best practice”. In GCP , a best practice for many use cases is load to cloud storage and then processing data.
upvoted 1 times
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...