# How do I adjust R-squared for the number of predictors in a model?

The \$\$R^text{2}\$\$ statistic which summarises a multiple regression has a drawback that it never decreases with the addition of subsequent predictors. An adjusted form of R-squared can decrease with the addition of a subsequently uninformative predictor. This aids model selection by penalising overly parametrised models.

For a sample size of n, with p predictors,

Adjusted \$\$R2 \$\$ = 1 - [ (1 - \$\$R2 \$\$) (n-1)/(n-p-1) ]

The adjusted \$\$R2 \$\$ may take negative values. In such a case the adjusted \$\$R2 \$\$ is given the value of zero.

Example

A sample of size 10 has a \$\$R2 \$\$ of 0.623 . A predictor, probability, is added which increases the \$\$R2 \$\$ to 0.635.

The adjusted \$\$R2 \$\$ are 0.576 and 0.531 for the one and two predictor models which, by convention, we equate to zero. Taken together the two predictors do not explain much variance in the outcome. The addition of the second predictor, probability, actually reduces the value of the adjusted R-squared.

Reference

Cohen, J. & Cohen P. (1983) Applied multiple regression/correlation analysis for the behavioral sciences. lawrence Erlbaum, Hillsdale, NJ.

None: FAQ/rsqadj (last edited 2014-09-29 13:27:04 by PeterWatson)