FAQ/points - CBU statistics Wiki

Revision 14 as of 2011-09-15 10:14:33

Clear message
location: FAQ / points

How do I scatterplot observations which have the same set of co-ordinates?

Sometimes the same co-ordinates are shared by more than one observation. A scatterplot will only, however, show one point for each unique x,y combination regardless of the number of observations that share this combination.

One way of disentangling this is to add a proportionately small amount to the observed values of one of the variables, say, y. The below syntax uses the rv.uniform function in SPSS to add a small random amount to the y values when an x,y combination has previously occurred. The new values which are now all unique (ynew) can then be plotted against x.

sort cases by x.
sort cases by y.
COMPUTE copy=0. 
IF (x EQ LAG(x) AND y EQ LAG(y)) copy = 1. 

compute ynew = y.
if (copy eq 1) YNEW = Y + RV.UNIFORM(Y*0.01,Y*0.015). 

The above process, whereby concurrent points are plotted so they are visible individually on the scatterplot, is called jittering. In versions 13 to 16 of SPSS there is an option for 'jittering' points in the graph menu bar. This is only available using the Interactive mode: Select Graphs > Interactive > Scatterplot.... Double click on the plot and then on any data point. You should then see a 'jitter' tab. You can use this tab to jitter both x and y co-ordinates in the scatterplot so all overlapping points are then visible.

In later versions of SPSS (18 and 19) the 'jitter' and other graphical options are no longer available in the menu bar and can only be done using the SPSS syntax given below. This syntax assumes there are two variables, min (y-axis) and max (x-axis), which we wish to scatterplot. You will, therefore, need to substitute your own variable names into this syntax. You can then edit the graph (e.g. to change the scale of axes) in the usual way by double-clicking on it.

        /Y=VAR(min) TYPE=SCALE
        /X1=VAR(max) TYPE=SCALE