A company is building an ML model. The company collected new data and analyzed the data by creating a correlation matrix, calculating statistics, and visualizing the data. Which stage of the ML pipeline is the company currently in?
C: Exploratory data analysis
Explanation:
Exploratory Data Analysis (EDA) involves examining and summarizing data to understand its underlying structure, detect patterns, identify relationships (e.g., via a correlation matrix), and highlight any anomalies. The company's activities, such as creating a correlation matrix, calculating statistics, and visualizing the data, are typical tasks performed during EDA.
Why not the other options?
A: Data pre-processing:
Data pre-processing involves cleaning and preparing data for modeling, such as handling missing values, scaling features, or encoding categorical data. While pre-processing may follow EDA, the tasks described in the question focus on analysis rather than preparation.
C. Exploratory data analysis
Exploratory Data Analysis (EDA) involves examining and visualizing data to understand its structure, patterns, and relationships. Creating a correlation matrix, calculating statistics, and visualizing data are all typical tasks during the EDA phase, which helps inform later stages such as data preprocessing and feature engineering.
upvoted 2 times
...
Log in to ExamTopics
Sign in:
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.
Upvoting a comment with a selected answer will also increase the vote count towards that answer by one.
So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.
Moon
3 weeks, 1 day agodehkon
2 months, 2 weeks ago