Date: Mon, 23 Jan 2006 09:49:26 -0500
Reply-To: Peter Flom <flom@NDRI.ORG>
Sender: "SAS(r) Discussion" <SAS-L@LISTSERV.UGA.EDU>
From: Peter Flom <flom@NDRI.ORG>
Subject: Re: QR:Logistic regression (1 values are much more less than the
Content-Type: text/plain; charset=US-ASCII
>>> "adel F." <firstname.lastname@example.org> 1/23/2006 9:40 am >>> wrote
My total sample is 7350, and I have got 16 categorical variables,
most of them are binary (2 values) others with 3 to 4 categories, I am
interested for the moment to the main effect.
I will be interested to some interactions
What will be the problems if I consider the logistic regression in
the situation that I have described?
Any comments are very welcome
I have a simple question, I want to model a binary variable using
proc logistic, my depedent variable has 5% values of 1 and 95% values
So 5% of 7350 = about 350, and you have 16 IVs, so, at least without
you should be OK (there's a rule of thumb that you want 10 cases in
your smaller outcome
for each IV).
I will let people with more knowledge comment on whether it's good
rule of thumb.
Of course, if some of the categories on your IVs are very small, that
might cause problems.
Peter L. Flom, PhD
Assistant Director, Statistics and Data Analysis Core
Center for Drug Use and HIV Research
National Development and Research Institutes
71 W. 23rd St
New York, NY 10010
(212) 845-4485 (voice)
(917) 438-0894 (fax)