Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.
exam questions

Exam Professional Cloud Architect All Questions

View all questions & answers for the Professional Cloud Architect exam

Exam Professional Cloud Architect topic 1 question 18 discussion

Actual exam question from Google's Professional Cloud Architect
Question #: 18
Topic #: 1
[All Professional Cloud Architect Questions]

Your company is forecasting a sharp increase in the number and size of Apache Spark and Hadoop jobs being run on your local datacenter. You want to utilize the cloud to help you scale this upcoming demand with the least amount of operations work and code change.
Which product should you use?

  • A. Google Cloud Dataflow
  • B. Google Cloud Dataproc
  • C. Google Compute Engine
  • D. Google Kubernetes Engine
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
AWS56
Highly Voted 4 years, 10 months ago
"B. Google Cloud Dataproc" is the answer
upvoted 19 times
...
VinayakBudapanahalli
Highly Voted 3 years, 10 months ago
Dataproc is a managed Spark and Hadoop service that lets you take advantage of open source data tools for batch processing, querying, streaming, and machine learning. Dataproc automation helps you create clusters quickly, manage them easily, and save money by turning clusters off when you don't need them. With less time and money spent on administration, you can focus on your jobs and your data. https://cloud.google.com/dataproc/docs/concepts/overview#:~:text=Dataproc%20is%20a%20managed%20Spark,%2C%20streaming%2C%20and%20machine%20learning.&text=With%20less%20time%20and%20money,your%20jobs%20and%20your%20data.
upvoted 12 times
AzureDP900
2 years, 1 month ago
Agreed
upvoted 1 times
...
...
Ekramy_Elnaggar
Most Recent 1 week, 5 days ago
Selected Answer: B
1. Managed Hadoop and Spark: Dataproc is specifically designed for running and managing Apache Spark and Hadoop clusters, which directly addresses your company's needs. 2. Scalability: Dataproc allows you to easily scale your clusters to handle the increasing number and size of jobs. You can add or remove nodes as needed to accommodate the workload. 3. Minimal Operations Work: Dataproc automates cluster creation, configuration, and management, minimizing the operational overhead. This is crucial since you want to reduce operations work. 4. Code Compatibility: Dataproc is compatible with existing Spark and Hadoop code, so you can migrate your jobs with minimal or no code changes.
upvoted 1 times
...
JohnJamesB1212
2 months, 2 weeks ago
Selected Answer: B
B is correct because Dataproc is uded for Apache Hadoop and Spark
upvoted 1 times
...
eka_nostra
1 year, 4 months ago
Selected Answer: B
Dataflow for data stream and batch. Dataproc for data process with Apache Spark and Hadoop. Compute Engine for VM. Kubernetes Engine for Kubernetes Cluster with Compute Engine under the hood.
upvoted 3 times
...
alekonko
1 year, 8 months ago
Selected Answer: B
B, Dataproc is Hadoop/Spark managed service in GCP
upvoted 2 times
...
examch
1 year, 11 months ago
Selected Answer: B
Dataproc is a fully managed and highly scalable service for running Apache Hadoop, Apache Spark, Apache Flink, Presto, and 30+ open source tools and frameworks. Use Dataproc for data lake modernization, ETL, and secure data science, at scale, integrated with Google Cloud, at a fraction of the cost. https://cloud.google.com/dataproc
upvoted 1 times
...
omermahgoub
1 year, 11 months ago
To scale the number and size of Apache Spark and Hadoop jobs being run on a local datacenter with the least amount of operations work and code change, you should consider using Google Cloud Dataproc, option B. Google Cloud Dataproc is a fully-managed service that makes it easy to run Apache Spark and Hadoop workloads in the cloud. It is designed to simplify the process of setting up and managing clusters for data processing, and allows you to scale quickly and easily as demand increases. With Cloud Dataproc, you can create and delete clusters in just a few minutes, and you can use the familiar Apache Spark and Hadoop APIs and tools to process data. This means that you can utilize the cloud to scale your workloads with minimal changes to your code and operations work. Option A: Google Cloud Dataflow, option C: Google Compute Engine, and option D: Google Kubernetes Engine, would not be suitable for this use case, as they do not provide the same level of support for running Apache Spark and Hadoop workloads as Cloud Dataproc.
upvoted 2 times
...
AniketD
2 years ago
Selected Answer: B
B. Dataproc is managed Apache Spark and Hadoop in GCP
upvoted 1 times
...
zr79
2 years, 1 month ago
Dataproc for Hadoop and spark ecosystem
upvoted 1 times
...
minmin2020
2 years, 1 month ago
Selected Answer: B
B. Google Cloud Dataproc
upvoted 1 times
...
holerina
2 years, 2 months ago
B data proc for hadoop and spark
upvoted 1 times
...
Dhiraj03
2 years, 5 months ago
Keyword - Apache Spark and Hadoop jobs - Go with Dataproc
upvoted 1 times
...
Superr
2 years, 5 months ago
Selected Answer: B
dataproc
upvoted 1 times
...
Nirca
2 years, 7 months ago
Selected Answer: B
Google Cloud Dataproc == managed Spark and Hadoop service
upvoted 2 times
...
pakochiu
2 years, 7 months ago
Selected Answer: B
B - Dataproc Lift&Shift of Apache Spark and Hadoop jobs
upvoted 1 times
...
llanerox
2 years, 10 months ago
B is ok.
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...