FAQ/NDcasestat - CBU statistics Wiki

Upload page content

You can upload content for the page named below. If you change the page name, you can also upload content for another page. If the page name is empty, we derive the page name from the file name.

File to load page content from
Page name
Comment
Type the odd letters out: ONlY twO thinGs aRE infiNite

Revision 5 as of 2009-01-30 10:35:24

location: FAQ / NDcasestat

Which output criteria should I use when using the casewise statistics option with the Normal discriminant method in SPSS?

The discriminant procedure, which is optimal when the groups follow a Normal distribution, uses the Mahalanobis distance (d) as a summary measure of the difference both between the groups and within each group. The Mahalanobis distance can also be [:FAQ/mahal: used] as a means of identifying multivariate outliers.

There are two conditional probabilities outputted in tables using the casewise statistics option in SPSS. These are P(G=g conditional on D=d) and P(D>d conditional on G=g) for the predicted group with the former also outputted for the other groups. Here g represents the group of interest and d represents the outputted Mahalanobis distance from a group centre for a particular case.

In particular P(G=g conditional on D=d) is the posterior probability of a case falling in the predicted group (for which this probability is a maximum) based on that case's Mahalanobis distance. The Mahalanobis distance for a particular case represents how typical that case is with respect to other cases in the group, g. In particular it measures the standardised distance of a case from the centre of the group.

For a particular group, g, the P(G=g conditional on D=d) equals in SPSS

$$ \frac{\mbox{exp}(\mbox{-0.5 }d(g)text{2})}{\sum_text{groups} \mbox{exp}(\mbox{-0.5 }d(group)text{2})} $$

where d(g) is the Mahalanobis distance for the g-th group.

Further, since the Mahalanobis distance based on p predictors follows a chi-square distribution on p degrees of freedom if it is equal to zero, it follows the group typicality probability of observing a case from a particular group having a greater mahalanobis distance than the observed case is

P (D>d conditional on G=g) = 1 - $$\chi(p)$$.

The interpretation of both of the above probabilities stems from the intuitive result that the further away a case is from a particular group centre (ie the more atypical it is with respect to a group) the less likely it is to be predicted to be in that group.