Question 1
Given a dataset with an imbalanced binary target variable, which approach best addresses the tendency of logistic regression to favor the majority class during model training?
- A. Oversampling the minority class or using class weighting
- B. Increasing the learning rate hyperparameter
- C. Applying principal component analysis to input features
- D. Reducing the decision boundary threshold below 0.5
- E. Performing k-means clustering before fitting the model
Question 2
In multinomial logistic regression applied to a classification task with five classes (A, B, C, D, E), how many sets of coefficients (including intercepts) does the final model estimate, assuming one-vs-rest (OvR) strategy?
- A. Five sets, one for each class
- B. Four sets, as one class is the reference
- C. Ten sets, two per class
- D. One single set shared by all classes
- E. Six sets corresponding to all pairwise class combinations
Question 3
Which one of the following statements correctly describes the difference in the mathematical formulation between binary logistic regression and multinomial logistic regression?
- A. Binary logistic regression uses a single sigmoid function, while multinomial uses multiple softmax functions
- B. Binary logistic regression models log-odds for two classes; multinomial models log-odds for each class relative to a baseline
- C. Both models use exactly the same loss function and probability output
- D. Multinomial logistic regression relies on least squares estimation instead of maximum likelihood
- E. There are no differences; both are mathematically identical
Question 4
Consider a logistic regression model predicting whether a transaction is fraudulent (1) or legitimate (0); if the model’s predicted probability for fraud is 0.95 but the actual event is legitimate, which metric would reflect the impact of this confident misclassification most strongly?
- A. Log loss (cross-entropy loss)
- B. Precision
- C. Recall
- D. F1 score
- E. Euclidian loss
Question 5
When interpreting the coefficient of a feature in binary logistic regression, what does exponentiating the coefficient (e.g., exp(β₁)) represent?
- A. The change in probability for a one-unit increase in the predictor
- B. The odds ratio representing the multiplicative change in odds for a one-unit increase in the predictor
- C. The slope of the linear regression line for that feature
- D. The regularized inverse-odds of the outcome
- E. The p-value of the hypothesis test for that feature