FAQ/lag - CBU statistics Wiki
location: FAQ / lag

How do I detect duplicate cases without having an ID number?

This example syntax detects if cases have identical values jointly on three variables, a, b and c. If a case has the same three values as an earlier case in the file then the variable copy will equal 3 (or more generally the number of variables being compared). As an alternative Version 19 of SPSS and later now has an 'Identify duplicate cases' option in the data menu which may be used.

sort cases by a.
exe.
compute copy=0.
exe.
do repeat r=a b c.
if (r eq lag(r)) copy=copy+1.
end repeat.
exe.

None: FAQ/lag (last edited 2013-03-26 16:14:31 by PeterWatson)