Practice Overfitting (Intuition) in Math
Use these practice problems to test your method after reviewing the concept explanation and worked examples.
Quick Recap
Overfitting occurs when a model learns the noise in training data instead of just the underlying pattern, performing well on training data but poorly on new data.
The model memorized the training data instead of learning the underlying pattern.
Showing a random 20 of 50 problems.
Example 1
challengeYou build a stock-trading model that achieves accuracy on historical data from 2010–2020. In live trading during 2021 it loses money consistently. Explain what likely happened and what you would change.
Example 2
hardFor ridge regression , what happens to overfitting risk as ?
Example 3
hardExplain the bias-variance tradeoff: how does increasing model complexity affect bias and variance, and where is the optimal model?
Example 4
mediumMore training data tends to reduce overfitting. Why?
Example 5
mediumIn the bias-variance decomposition, overfitting corresponds primarily to high ___.
Example 6
mediumA decision tree is grown until every leaf contains exactly one training example. Training error is . Most likely diagnosis?
Example 7
mediumA model memorizes the training set, scoring perfectly there but failing on a held-out set. What single word names this?
Example 8
hardWhy does dropping out random neurons during training help prevent overfitting in neural networks?
Example 9
hardA study reports an for a model with 50 predictors fit on 60 observations. The author claims high explanatory power. Critique this claim.
Example 10
mediumA spam filter is tuned until it flags every training email perfectly, but it misses new spam. What happened?
Example 11
hardTrue or false: increasing the training-set size NEVER reduces overfitting, no matter how large.
Example 12
easyOverfitting means the model has learned the ___ instead of the underlying pattern.
Example 13
challengeA polynomial of degree is fit to 8 points. At what degree can it pass through all points exactly (interpolate), risking maximal overfit?
Example 14
easyTrue or false: a perfect training error always indicates the model will generalize well.
Example 15
mediumTrue or false: adding a regularization term like typically increases training error but reduces test error when the model is overfit.
Example 16
easyWhich model is more likely overfit: a straight line, or a degree-15 polynomial through 16 points?
Example 17
challengeModel train errors by complexity: ; test errors: . Which complexity should you pick and why?
Example 18
hardA random forest trained on 1000 examples has training error and test error . Is this concerning overfitting?
Example 19
mediumCross-validation reports a mean of across folds, but a single train/test split shows on training and on test. Which estimate of generalization is more trustworthy?
Example 20
mediumA model fits 10 data points with a degree-9 polynomial (perfect fit, ). A simpler linear model has . Explain which model is better for prediction and why.