A
AcadiFi
CL
CFA_L2_Grinder2026-04-02
cfaLevel IIQuantitative Methods

How do AIC and BIC help with model selection, and why can't I just use R-squared?

I'm studying CFA Level II and confused about why we need information criteria like AIC and BIC when R-squared already tells us how well a model fits. What do these criteria add, and when would they give different recommendations?

147 upvotes
AcadiFi TeamVerified Expert
AcadiFi Certified Professional

R-squared measures goodness of fit — how much variance your model explains. The problem: R-squared ALWAYS increases when you add variables, even useless ones. This makes it a terrible tool for model selection because it rewards complexity without penalizing overfitting.

The Overfitting Problem:

Adding a random noise variable to your model will increase R-squared slightly (in-sample) but degrade out-of-sample prediction. You need a metric that balances fit against complexity.

AIC (Akaike Information Criterion):

AIC = 2k - 2ln(L)

Where k = number of parameters, L = maximum likelihood. Lower AIC is better. The 2k term penalizes model complexity.

BIC (Bayesian Information Criterion):

BIC = k*ln(n) - 2ln(L)

Where n = number of observations. Lower BIC is better. BIC penalizes complexity MORE heavily than AIC because ln(n) > 2 for n > 8 (which is always true in practice).

Key Differences:

CriterionPenalty for ComplexityTends to SelectBest For
R-squaredNoneMost complex modelNever use alone for selection
Adjusted R-squaredMild penaltySlightly simpler modelsQuick comparison
AICModerate (2k)Best predictive modelForecasting accuracy
BICStrong (k*ln(n))Most parsimonious modelFinding the 'true' model

When They Disagree:

AIC and BIC often recommend different models. AIC prefers slightly more complex models (better predictions), while BIC prefers simpler ones (more interpretable).

Practical Example:

You're modeling quarterly corporate earnings growth using potential predictors: GDP growth, interest rates, credit spreads, oil prices, and consumer confidence.

ModelVariablesR-squaredAdj R-sqAICBIC
AGDP only0.350.34-120-116
BGDP + Rates0.420.40-128-122
CGDP + Rates + Spreads0.440.41-130-121
DAll 5 variables0.460.40-126-112
  • R-squared picks Model D (always picks the fullest model)
  • AIC picks Model C (best balance of fit and parsimony)
  • BIC picks Model B (most parsimonious adequate model)

For forecasting, you'd likely go with Model C (AIC). For understanding the key drivers, Model B (BIC) provides a cleaner story.

Exam Tip: If a CFA exam question asks which model to select using a given criterion, simply pick the model with the LOWEST AIC or BIC value. No calculation needed — just comparison.

Practice model selection problems in our CFA Level II question bank.

📊

Master Level II with our CFA Course

107 lessons · 200+ hours· Expert instruction

#aic#bic#model-selection#information-criteria#overfitting#r-squared