Understanding Components
Which of the following is typically responsible for transforming raw data into a usable format in a machine learning system?
- A. Data preprocessing module
- B. Result visualization block
- C. Model inference controller
- D. Feedback loop engine
- E. Error logging thread
Selecting Storage Types
When storing large volumes of unstructured text data for an ML project, which storage type is most suitable?
- A. Relational database
- B. Image cache
- C. Object storage
- D. Memory buffer
- E. Configuration file
Model Training Basics
In a basic ML pipeline, what is the primary function of the training phase?
- A. Predicting outputs for new inputs
- B. Transforming visualizations
- C. Creating a feedback report
- D. Learning patterns from labeled data
- E. Deleting outdated files
Concept of Model Deployment
If you want users to submit data to your model using an application interface, which component should handle this task?
- A. Data cleaning pipeline
- B. User endpoint API
- C. Randomization module
- D. Feature engineering block
- E. Plotting library
Monitoring Models
Why is monitoring important after deploying a machine learning model to production?
- A. It helps track model accuracy and detect drift over time.
- B. It increases training dataset size.
- C. It reduces hardware requirements.
- D. It eliminates need for versions.
- E. It speeds up user registration.
Role of Feature Engineering
In an ML system analyzing customer reviews, creating a 'review_length' feature from text is an example of what?
- A. Model validation
- B. Feature engineering
- C. Label encoding
- D. Output transformation
- E. Hyperperimeter tuning
Batch vs Real-Time Processing
Which type of processing is ideal for making immediate predictions, such as classifying incoming SMS messages as spam or not?
- A. Batch processing
- B. Deferred analysis
- C. Real-time (online) processing
- D. Manual review
- E. Round-robbin method
Modular System Design
What is a primary benefit of designing an ML system using a modular architecture?
- A. Easier to reuse and update parts independently
- B. Forces single programming language usage
- C. Reduces need for monitoring
- D. Speeds up internet connections
- E. Increases data duplication
Input Data Quality
In a scenario where user-submitted images are sometimes blurry or poorly lit, which approach can improve the ML model's robustness?
- A. Automatically accept all images
- B. Use data augmentation during training
- C. Only use the largest images
- D. Disregard image quality
- E. Limit training to one image
Iterative Improvement
After deploying an ML system, why is it important to collect feedback on prediction errors?
- A. To allow ongoing model improvements with new data
- B. To identify spelling mistakes in code
- C. To track server energy usage
- D. To increase product advertisements
- E. To reduce RAM capacity