When two variables are correlated, how do I determine which one is actually predictive? The curriculum says there are four possible explanations.

Question

AcadiFi · Accepted Answer

This is one of the most important conceptual traps in CME development. When you observe a significant correlation between variable A and variable B, the data alone cannot distinguish among four very different realities:

```mermaid
flowchart TD
    A[Observed: A and B are correlated] --> B[Explanation 1:
A predicts B]
    A --> C[Explanation 2:
B predicts A]
    A --> D[Explanation 3:
C drives both A and B]
    A --> E[Explanation 4:
Spurious — pure coincidence]
    B --> F[Use A to forecast B ✓]
    C --> G[Reverse causation — using A to forecast B is wrong]
    D --> H[Omitted variable — find C for better model]
    E --> I[No predictive value at all]
```

**Practical Example — Harborview Research:**

Harborview discovers a strong correlation (r = 0.72) between US construction spending and Australian equity returns over the past 15 years. Four possible interpretations:

**1. US construction → Australian equities (A predicts B):**
Perhaps US construction booms signal global economic expansion, which benefits Australia's export-driven economy. Plausible but indirect.

**2. Australian equities → US construction (B predicts A):**
Perhaps rising Australian equities signal a strong commodities cycle (iron ore, copper), and commodity wealth flows into US real estate. Possible but unlikely as the primary channel.

**3. Third variable C drives both:**
This is most likely. Chinese economic growth simultaneously drives Australian commodity exports (lifting Australian equities) AND stimulates global construction demand (including US). The true predictor is Chinese demand, not either observed variable directly.

**4. Spurious correlation:**
The relationship may reflect coincidental trends over 15 years (both happened to grow during the same global expansion). Out-of-sample testing would reveal whether the relationship has any durability.

**How to Investigate:**

1. **Temporal ordering (Granger causality tests):** Does A lead B in time, or does B lead A? If neither leads, a third variable or spurious relationship is more likely.
2. **Control for candidate third variables:** If adding Chinese GDP growth to the regression eliminates the correlation between A and B, then C (China) was driving both.
3. **Economic mechanism:** Map out the causal chain. Is there a plausible direct mechanism from A to B? How many intermediate steps are required?
4. **Out-of-sample testing:** Spurious correlations collapse out of sample. Genuine relationships (whether direct or through C) persist.
5. **Natural experiments:** Look for periods where A changed for reasons unrelated to B. Did B still respond?

**The Nonlinear Trap — Don't Dismiss Low Correlations Too Quickly:**

The curriculum also warns about the opposite mistake: concluding that no relationship exists because the linear correlation is low. Consider the VIX index and S&P 500 returns. The Pearson correlation might be modest (around -0.3 to -0.4), but the relationship is strongly nonlinear — the VIX barely moves when markets rise slowly, but spikes dramatically during selloffs. A negligible linear correlation can mask a powerful nonlinear relationship.

```mermaid
flowchart LR
    A[Low Linear Correlation] --> B{Check for Nonlinearity}
    B -->|Plot the scatterplot| C[U-shaped or threshold pattern?]
    C -->|Yes| D[Strong nonlinear relationship exists]
    C -->|No| E[Genuinely weak relationship]
    D --> F[Use nonlinear model or regime-dependent specification]
```

**Key Exam Takeaway:** Never use a correlation in a predictive model without investigating the underlying causal structure. The observed statistic is the beginning of the analysis, not the end.

Explore more correlation pitfalls in our CFA Level III question bank.

When two variables are correlated, how do I determine which one is actually predictive? The curriculum says there are four possible explanations.

Master Level III with our CFA Course

Related Questions

Related Articles

Practice Questions