How do AIC and BIC help with model selection, and why can't I just use R-squared?
I'm studying CFA Level II and confused about why we need information criteria like AIC and BIC when R-squared already tells us how well a model fits. What do these criteria add, and when would they give different recommendations?
R-squared measures goodness of fit — how much variance your model explains. The problem: R-squared ALWAYS increases when you add variables, even useless ones. This makes it a terrible tool for model selection because it rewards complexity without penalizing overfitting.
The Overfitting Problem:
Adding a random noise variable to your model will increase R-squared slightly (in-sample) but degrade out-of-sample prediction. You need a metric that balances fit against complexity.
AIC (Akaike Information Criterion):
AIC = 2k - 2ln(L)
Where k = number of parameters, L = maximum likelihood. Lower AIC is better. The 2k term penalizes model complexity.
BIC (Bayesian Information Criterion):
BIC = k*ln(n) - 2ln(L)
Where n = number of observations. Lower BIC is better. BIC penalizes complexity MORE heavily than AIC because ln(n) > 2 for n > 8 (which is always true in practice).
Key Differences:
| Criterion | Penalty for Complexity | Tends to Select | Best For |
|---|---|---|---|
| R-squared | None | Most complex model | Never use alone for selection |
| Adjusted R-squared | Mild penalty | Slightly simpler models | Quick comparison |
| AIC | Moderate (2k) | Best predictive model | Forecasting accuracy |
| BIC | Strong (k*ln(n)) | Most parsimonious model | Finding the 'true' model |
When They Disagree:
AIC and BIC often recommend different models. AIC prefers slightly more complex models (better predictions), while BIC prefers simpler ones (more interpretable).
Practical Example:
You're modeling quarterly corporate earnings growth using potential predictors: GDP growth, interest rates, credit spreads, oil prices, and consumer confidence.
| Model | Variables | R-squared | Adj R-sq | AIC | BIC |
|---|---|---|---|---|---|
| A | GDP only | 0.35 | 0.34 | -120 | -116 |
| B | GDP + Rates | 0.42 | 0.40 | -128 | -122 |
| C | GDP + Rates + Spreads | 0.44 | 0.41 | -130 | -121 |
| D | All 5 variables | 0.46 | 0.40 | -126 | -112 |
- R-squared picks Model D (always picks the fullest model)
- AIC picks Model C (best balance of fit and parsimony)
- BIC picks Model B (most parsimonious adequate model)
For forecasting, you'd likely go with Model C (AIC). For understanding the key drivers, Model B (BIC) provides a cleaner story.
Exam Tip: If a CFA exam question asks which model to select using a given criterion, simply pick the model with the LOWEST AIC or BIC value. No calculation needed — just comparison.
Practice model selection problems in our CFA Level II question bank.
Master Level II with our CFA Course
107 lessons · 200+ hours· Expert instruction
Related Questions
What exactly is the Capital Market Expectations (CME) framework and why does it matter for asset allocation?
How do business cycle phases affect asset class return expectations?
Can someone explain the Grinold–Kroner model step by step with numbers?
How do you forecast fixed-income returns using the building-blocks approach?
PPP vs Interest Rate Parity for forecasting exchange rates — when do I use which?
Join the Discussion
Ask questions and get expert answers.