Your team is working on an image recognition system to help identify plants. They have collected a large amount of data but need to get this data labeled.
Which phase of CPMAI is this done?
Correct Answer:C
Phase III: Data Preparation includes the Data Labeling generic task group. Specifically, the Label data task covers "identifying methods for data labeling and engaging in data labeling efforts," which is essential for supervised learning workflows like image recognition.
You have been tasked with creating a model that will recommend products based on what other customers have similarly purchased. Which algorithm is the best choice given this situation?
Correct Answer:A
CPMAI's Generic Task Group: Select Modeling Technique in Phase IV: Model Development outlines common cognitive algorithms. For recommendation systems—which rely on finding similar user or item profiles—the K-Nearest Neighbor algorithm is the canonical choice, using customer purchase vectors to locate "nearest neighbors." In contrast, K-means is purely unsupervised clustering, Neural Networks are more complex and not necessary for basic collaborative filtering, and Hyperpersonalization is an AI pattern, not an algorithm.
The team is working to build a data preparation pipeline for the conversational chatbot project. Which phase of CPMAI is this done?
Correct Answer:C
Phase III: Data Preparation focuses on constructing and documenting reusable data pipelines - including training and inference pipelines - alongside cleansing, augmentation, and labeling tasks to prepare data for modeling . This is where teams build the end-to-end data preparation workflows for AI solutions such as chatbots.
- [Data for AI]
You have been brought on to manage a recognition project, specifically an image recognition project, for an Autonomous Retail application. You know that you need to make sure you have sufficient data for this project. What??s the best way to approach this?
Correct Answer:B
In Phase II: Data Understanding, CPMAI??s Data Selection tasks require teams to ??Decide on the data to be used for analysis?? by first listing all available sources and then selecting only those records and attributes that meet quality and relevance criteria . Taking a company-wide data inventory ensures you don??t overlook relevant datasets before narrowing down to what truly applies.
You're working with an inexperienced team and this is all their first AI project. You're trying to work on a supervised learning binary classification problem to determine if emails are spam or not.
What is the best approach for this project?
Correct Answer:A
Naive Bayes classifiers are a family of "simple probabilistic classifiers based on Bayes' theorem with the 'naive' assumption of feature independence," making them fast to train and easy to interpret - ideal for teams new to AI tackling binary tasks like spam detection .
- [Machine Learning]
Recently your company has been getting a large number of spam emails and some employees have been clicking on these suspicious emails causing a headache for IT. The head of IT wants to create a more robust spam filter and your team has been tasked with this project.
What type of algorithm would you select for this problem?
Correct Answer:C
A spam filter must decide between exactly two categories—spam or not spam—making it a binary (or binomial) classification task. The CPMAI Glossary defines binary classification as ??a classification task where data is categorized into one of two classes (e.g., spam vs. not spam).??