An administrator is deploying Spark on Amazon EMR for two distinct use cases: machine learning algorithms and ad-hoc querying. All data will be stored in Amazon S3. Two separate clusters for each use case will be deployed. The data volumes on Amazon S3 are less than 10 GB.
How should the administrator align instance types with the clusters purpose?
exams
Highly Voted 3 years, 7 months agomatthew95
Most Recent 3 years, 6 months agoMichRox
3 years, 6 months agosan2020
3 years, 6 months ago