Date:         Mon, 12 May 2003 09:01:52 -0400
Reply-To:     Michael Kruger <m_kruger@WAYNE.EDU>
Sender:       "SPSSX(r) Discussion" <SPSSX-L@LISTSERV.UGA.EDU>
From:         Michael Kruger <m_kruger@WAYNE.EDU>
Subject:      Help with selecting controls for cases
Content-Type: text/plain; charset=ISO-8859-1; format=flowed

I am trying to select 2 controls for each case in a dataset of about 16,000 subjects. The data is structured as follows:

ID BIRTHYR CASE SURVIVAL 1 1915 1 87 2 1915 0 80 3 1915 0 63 4 1915 0 87 6 1915 0 71 7 1925 1 77 8 1925 0 87 9 1925 0 67 10 1925 0 52

I need to pick 2 'controls' (case=0) matched on birhtyr for each of my cases. But for each case there may be over 100 controls in the file, while for some cases (years) there may be only 3 or 4 controls in the file. Many of the cases have the same birth year, but I don't think that should affect the matching process. I want to randomly select 2 controls for each case mathced by 'birhtyr' There are 147 cases in the file and approximately 5400 controls that match the birth years of the cases. I wnat to end up with a sekected data set if 146 cases abd 292 randomly selected but birthyear matched controls.

Any help in writing syntax for this would be appreciated. Thanks. .

-- Michael Kruger Statistical Analyst C.S. Mott Center for Human Growth & Development Dept. of OB/GYN Wayne State Univ. School of Medicine (313)-577-1794

