FAQ/roc - CBU statistics Wiki

Revision 20 as of 2011-07-26 11:12:18

Clear message
location: FAQ / roc

Plotting ROC curves

The Receiver Operating Characteristic (ROC) curve is a graph which illustrates just how well a set of predictor variables, measured on various cases, predict the group to which that case belongs.

In the example data below is collected to assess how well a cases's test score and sex can predict if the case is a control or a patient (the group).

Group

Score

Sex

1

12

1

1

15

2

1

23

1

1

16

2

1

10

2

0

24

1

0

34

1

0

21

1

0

25

2

0

9

2

Binary logistic regression can be used to produce estimates of group membership based on test score and sex and compared to the observed "true" group using a classification table. (Correct and incorrect classification probabilities may be obtained from [http://imaging.mrc-cbu.cam.ac.uk/statswiki/FAQ/criteria: this table.] Two of these diagnostics may then be plotted by a ROC curve (available in the graph menu) using the predicted group membership probabilities using score and sex as predictors which can be outputted from the logistic regression procedure. The syntax in the box below does the ROC analysis. The area under the ROC curve is also used as a discrimination diagnostic. The area under the curve ranges from 0.50 to 1.00. The nearer to 1 the better the discrimination. [http://gim.unmc.edu/dxtests/ROC3.htm There are rules of thumb based on deciles]. These are reproduced in the table.

Area

Point system

0.50-0.60

Fail

0.60-0.70

Poor

0.70-0.80

Fair

0.80-0.90

Good

0.90-1.00

Excellent

There is no ROC analysis for more than two groups but an assessment of fit could be carried out by obtaining a classification table or predicted versus observed groups from a multinomial or ordinal logistic regression procedure.

LOGISTIC REGRESSION VAR=group
  /METHOD=ENTER score sex
  /SAVE PRED (pred)
  /CRITERIA PIN(.05) POUT(.10) ITERATE(20) CUT(.5) .

ROC pred by group(1)
/MISSING = EXCLUDE
/PLOT = CURVE
/PRINT = SE COORDINATES.

The bigger the area under the ROC curve the better the prediction. This may also be done by inputting specificities and sensitivites into a [:FAQ/rocplot: SPSS macro.]

  • [:FAQ/criteria: See here for examples of working out sensitivities and specificities used to plot the ROC curve.]