On Tue, 28 Nov 2006 09:43:54 -0500, Ran S <raan67@YAHOO.COM> wrote:
>I have a dataset where I am looking for duplicates. In my dataset I would
>like to keep and look at the IDS if it has same ID but slightly different
>data values for atleast one or more variables.What's the best way of
>comparing data values for all these variables for 2 or more same IDs?
>I would appreciate your help in this regard.
how big is your data ( #rows and #columns) ?
Is it ordered by ID ?
What will you do with the information?
For example : create a table of just duplicates or
just reduce the data to unique values or
distribute some value according to other distribution share
the interested await.