What are the key assumptions of multiple regression and how do I detect violations?
I'm working through the regression section of CFA Level I Quant. The curriculum lists assumptions like linearity, homoscedasticity, and no multicollinearity — but I find it overwhelming. What are the most testable assumptions, and how would I spot violations in a vignette?
Multiple regression is a CFA Level I favorite, and the exam almost always tests whether you can identify assumption violations from given data. Here are the key assumptions and how to spot trouble:
The Core Assumptions
- Linearity — The relationship between each independent variable and the dependent variable is linear.
- Homoscedasticity — Error terms have constant variance across all levels of the independent variables.
- No Serial Correlation — Error terms are independent of each other (especially relevant for time-series data).
- No Multicollinearity — Independent variables are not highly correlated with each other.
- Normality of Errors — Error terms are normally distributed (needed for valid hypothesis tests).
How to Detect Violations in a CFA Vignette
| Violation | Detection Method | Red Flag in Data |
|---|---|---|
| Heteroscedasticity | Breusch-Pagan test | Residuals fan out as fitted values increase |
| Serial Correlation | Durbin-Watson statistic | DW far from 2.0 (close to 0 or 4) |
| Multicollinearity | Variance Inflation Factor (VIF) | VIF > 5 or 10; high R² but insignificant coefficients |
Worked Example:
An analyst at Cornerstone Research regresses quarterly fund returns (Y) on interest rate changes (X₁) and credit spread changes (X₂). Results:
- R² = 0.81
- t-stat for X₁ = 1.12 (insignificant)
- t-stat for X₂ = 0.97 (insignificant)
- Correlation between X₁ and X₂ = 0.93
The red flag: High R² with insignificant individual coefficients and high correlation between regressors — classic multicollinearity. The model explains returns well overall but can't disentangle the individual effects.
Fix: Drop one variable, combine them into a composite, or use principal component analysis.
Exam Strategy: When you see a regression output in a vignette, immediately check:
- Is R² high but individual t-stats low? → Multicollinearity
- Is the Durbin-Watson far from 2? → Serial correlation
- Do residuals show a pattern? → Heteroscedasticity or non-linearity
Practice identifying these patterns with our CFA Level I question bank.
Master Level I with our CFA Course
107 lessons · 200+ hours· Expert instruction
Related Questions
What exactly is the Capital Market Expectations (CME) framework and why does it matter for asset allocation?
How do business cycle phases affect asset class return expectations?
Can someone explain the Grinold–Kroner model step by step with numbers?
How do you forecast fixed-income returns using the building-blocks approach?
PPP vs Interest Rate Parity for forecasting exchange rates — when do I use which?
Join the Discussion
Ask questions and get expert answers.