LISTSERV at the University of Georgia
Menubar Imagemap
Home Browse Manage Request Manuals Register
Previous messageNext messagePrevious in topicNext in topicPrevious by same authorNext by same authorPrevious page (February 1998, week 4)Back to main SAS-L pageJoin or leave SAS-L (or change settings)ReplyPost a new messageSearchProportional fontNon-proportional font
Date:         Wed, 25 Feb 1998 10:19:44 -600
Reply-To:     Louise Lawson <llawson@MRG.SOPH.UAB.EDU>
Sender:       "SAS(r) Discussion" <SAS-L@UGA.CC.UGA.EDU>
From:         Louise Lawson <llawson@MRG.SOPH.UAB.EDU>
Organization: University of Alabama at Birmingham
Subject:      GENMOD v GLM, redux
Comments: To: "Zuckier, Gerald" <Zuckier@CHIME.ORG>
Comments: cc: austin@fox.sph.emory.edu, mmacaluso@mrg.soph.uab.edu

Thanks to everyone who set me straight on GLM. I have to admit that I think in terms of repeated measures analysis, and thus evaluated GLM from that perspective. I apologize if I confused any of my fellow students.

Here's a second try, using all the helpful information y'all sent. Again, I would appreciate your corrections if I still have it wrong.

GLM uses least squares with an identity link to fit a generalized linear model. The assumptions are that the dependent variable is normally distibuted (the jury is divided on how and when this assumption can be violated) and that residuals are i.i.d. random variables and sum to zero. GENMOD is also a generalized linear model, which uses maximum likelihood estimation and allows you to specify the distribution of your dv and the link function you want to use. Jon uses it for any type of count data, and I tend to use it if I need to actually calculate a prevalence rate ratio (as opposed to estimating it with proc LOGISTIC).

Repeated measures analysis with GLM is an ANOVA model that expects the dv to be continuous and normally distributed, the predictors to be categorical and requires that the cluster sizes be equal. It is computationally the least expensive, and what you want to use when the above requirements are met (which has never happened for me, but I can dream). GLM has a random statement, but you really shouldn't use it if you have important random effects (see below).

In GENMOD, the repeated statement invokes a marginal generalized linear model (aka GEE, generalized estimating equations) that treats within subject effects as nuisance parameters. You are not required to have a balanced design, and this is theoretically the best model to use if you have a few repeated measures with a large number of subjects . GENMOD cannot model random effects.

If you have random effects, you need to use proc MIXED, which also requires a continuous, normal dv. If this isn't true in your case, but you do have random effects, you can model them with the GLIMMIX macro, which invokes proc MIXED. You also want to use proc MIXED or GLIMMIX if you have repeated measures with a small sample size or a large number of observations relative to the number of experimental units (subjects), or if you're specifically interested in testing within subject (repeated statment) or between subject (random statement) effects.

M. Louise Lawson, MPH Progammer/Analyst II University of Alabama at Birmingham Department of Epidemiology Mortimer Jordan Hall, Room 108 Birmingham, AL 35294-2010 (205)975-5716


Back to: Top of message | Previous page | Main SAS-L page