What makes a benchmark valid for performance evaluation and what are the key properties a good benchmark must have?
CFA Level III lists properties of a valid benchmark but I'm finding it hard to remember them all and apply them in vignette questions. Can someone explain the required properties of a good benchmark and give examples of benchmarks that fail each criterion?
Benchmark selection is foundational to performance evaluation because a poorly chosen benchmark makes all subsequent analysis (attribution, alpha measurement, risk budgeting) meaningless. The CFA Level III curriculum identifies seven key properties of a valid benchmark.
The Seven Properties:
1. Specified in Advance
- The benchmark must be defined before the evaluation period begins
- Failure: A manager who claims 'I was targeting the Russell 2000 Value' after the fact because it was the index she happened to beat
2. Appropriate
- Must match the manager's investment style, asset class, and opportunity set
- Failure: Benchmarking a small-cap value manager against the S&P 500 (large-cap blend)
3. Measurable
- Returns must be calculable on a timely basis
- Failure: Using a custom universe of 'comparable private equity funds' whose returns are only available annually with a 6-month lag
4. Unambiguous
- The benchmark's composition and weights must be clearly defined
- Failure: 'A diversified portfolio of blue-chip stocks' — which stocks? what weights?
5. Reflective of Current Investment Opinions
- The manager should have knowledge and views on the securities in the benchmark
- Failure: A US equity manager benchmarked against the MSCI EAFE (international developed)
6. Accountable
- The manager must accept the benchmark as appropriate and be willing to be measured against it
- Failure: A benchmark imposed by the client that the manager argues is unfair for their strategy
7. Investable
- It should be possible to replicate the benchmark by holding its components
- Failure: Using the Russell 2000 as a benchmark for a concentrated 20-stock portfolio where many Russell constituents are too illiquid to actually trade
| Property | Key Question | Common Violation |
|---|---|---|
| Specified in advance | Was it set before the period? | After-the-fact selection |
| Appropriate | Does it match the strategy? | Style mismatch |
| Measurable | Can returns be calculated? | Illiquid/opaque benchmarks |
| Unambiguous | Is composition clear? | Vague definitions |
| Reflective | Does the manager know these securities? | Wrong market/geography |
| Accountable | Does the manager accept it? | Imposed, disputed benchmark |
| Investable | Can you hold it? | Uninvestable index |
Types of Benchmarks:
- Broad market indexes (S&P 500, MSCI World) — simple but may not match manager style
- Style indexes (Russell 1000 Growth, MSCI Value) — better style match
- Custom benchmarks — tailored to the manager's universe but expensive to maintain
- Manager universes — median of peer group; criticized for survivorship bias and being uninvestable
Exam Tip: Vignettes often describe a benchmark and ask which property it violates. Focus on appropriateness (style mismatch) and investability as the most commonly tested failures.
Test your benchmark analysis skills in our CFA Level III question bank.
Master Level III with our CFA Course
107 lessons · 200+ hours· Expert instruction
Related Questions
What exactly is the Capital Market Expectations (CME) framework and why does it matter for asset allocation?
How do business cycle phases affect asset class return expectations?
Can someone explain the Grinold–Kroner model step by step with numbers?
How do you forecast fixed-income returns using the building-blocks approach?
PPP vs Interest Rate Parity for forecasting exchange rates — when do I use which?
Join the Discussion
Ask questions and get expert answers.