LISTSERV at the University of Georgia
Menubar Imagemap
Home Browse Manage Request Manuals Register
Previous messageNext messagePrevious in topicNext in topicPrevious by same authorNext by same authorPrevious page (May 2008, week 1)Back to main SAS-L pageJoin or leave SAS-L (or change settings)ReplyPost a new messageSearchProportional fontNon-proportional font
Date:         Fri, 2 May 2008 14:58:59 -0400
Reply-To:     Susan Durham <sdurham@BIOLOGY.USU.EDU>
Sender:       "SAS(r) Discussion" <SAS-L@LISTSERV.UGA.EDU>
From:         Susan Durham <sdurham@BIOLOGY.USU.EDU>
Subject:      Re: Hierarchical Models--Centering predictors--WHY
Comments: To: Steve Denham <stevedrd@YAHOO.COM>

Another advantage of centering when the covariate is involved in interactions (whether with itself, as a quadratic term, or with another explanatory variable) lies in the interpretation of the overall tests of significance.

For example, in Steve's model

MODEL daily_yield=treatment|group covariate covariate*treatment;

the overall test of TREATMENT (from the MIXED procedure I know, and I think for many other SAS procedures) compares treatment means estimated at covariate=0. If a zero value for the covariate is far from the mean of the covariate and if COVARIATE*TREATMENT is not significant, this test can be very misleading because the comparison is extrapolated beyond the observed range of the covariate and the regression lines for each level of treatment are not exactly parallel. The test isn't wrong, it's just testing a hypothesis that might be different than what many users (like *my* biologists) expect.

(If COVARIATE*TREATMENT is significant, then we wouldn't be much interested in the overall test of TREATMENT because treatment differences would vary depending on the value of COVARIATE.)

This lack of scale invariance is clearly discussed in

LS Aiken & SG West, 1991, Multiple Regression: Testing and Interpreting Interactions, Sage Publications.

I work with biologists, too, and although they must "suffer" through the social science framework of this book, they still manage to glean the information--plus we draw lots of pictures to illustrate the problem.

Cheers, Susan

On Fri, 2 May 2008 11:33:34 -0700, Steve Denham <stevedrd@YAHOO.COM> wrote:

>Warren, > >Give it a try with a dataset. For group 1, let the covariate range from 55 to 85, and for group 2 from 80 to 120 (those are kind of typical pounds of milk per day numbers in a pretreatment period). Now fit a model that looks like (all random stuff left out): > >MODEL daily_yield=treatment|group covariate; > >and look at the lsmeans. All of the group 1 are adjusted upward, all of group 2 downward, so that it would seem that adjusted means for heifers are greater than those for mature cows. > >Now change the covariate to the deviation from the group mean (centering on the group mean) and fit > >MODEL daily_yield=treatment|group deviate_covariate; > >and see what the results look like. Treatment differences remain constant, but the adjusted means now do not seem paradoxical (mature cows>heifers). > >On to the last case, where the heterogeneous slopes model is fit-- > >MODEL daily_yield=treatment|group covariate covariate*treatment; > >When you write the LSMEANS @<value> to explain heterogeneity, it's quickly apparent that for a fixed pretreatment yield (say 82), groups will be adjusted in opposite directions. By centering, we can easily look at values that are 15 pounds below the contemporary group or 15 pounds above the contemporary group. > >I never learned this stuff until I started working with animal geneticists, who in the animal science field, seem to express everything as a deviation from the contemporary group mean. > >There is a carry over advantage to expressing covariates as a deviation in matrix operations as well. Think about inverting a big old sparse matrix with ones and zeros everywhere except one column with values two or three orders of magnitude larger. Compare the results to the matrix where that column sums to zero (at least across factors of interest). > >Steve Denham >Associate Director, Biostatistics >MPI Research, Inc. > >----- Original Message ---- >From: Warren Schlechte <Warren.Schlechte@TPWD.STATE.TX.US> >To: SAS-L@LISTSERV.UGA.EDU >Sent: Friday, May 2, 2008 11:26:13 AM >Subject: Re: Hierarchical Models--Centering predictors--WHY > >Steve, > >I don't follow, why would the LSMeans be higher without centering? > >Are you saying the treatment effect looks like it is doing more in first >lactation cows because the cows are growing, and naturally producing >more milk? If so, I don't see how centering helps. > >I am curious. I too work with biologists, and prefer not to be laughed >out of the room. > >Warren Schlechte > > > -----Original Message----- > From: Steve Denham [mailto:stevedrd@YAHOO.COM] > Sent: Thursday, May 01, 2008 12:36 PM > Subject: Re: Hierarchical Models--Centering predictors--WHY > > Centering is good, standardizing not necessarily--as it changes >the interpretation of any parameters. Peter hit the nail on the head >regarding centering with the comment about quadratics. Here is another >example: > > The best example I know of with centering is with regard to the >analysis of milk production by dairy cows. > > First lactation animals are still growing and have a lower >production than older cows. > > Now suppose you are running a trial and you have a pretreatment >milk yield as a covariate. If you have lactation number (=1 for first >lactation, =2 for all subsequent) in the model, and do not center the >pretreatment yield on the mean for that lactation, you rapidly will find >paradoxical (in the biological sense) effects: > > LSmeans for the first lactation animals will be greater than >LSmeans for subsequent lactation animals. > > Treatment differences remain constant, provided you do not have >treatment by covariate interaction included. If that particular >interaction is included (heterogeneous slopes model) it is vitally >important that you center the covariate. Failure to do so will almost >certainly result in something that will cause the biologist you are >consulting for to laugh you out of the room (does this sound like >personal experience? Could be...) > > Steve Denham > Associate Director, Biostatistics > MPI Research, Inc. > > > ----- Original Message ---- > From: Citam <citam.sasl@GMAIL.COM> > To: SAS-L@LISTSERV.UGA.EDU > Sent: Thursday, May 1, 2008 12:43:30 PM > Subject: Re: Hierarchical Models--Centering predictors--WHY > > On 5/1/08, Tom White <tw2@mail.com> wrote: > > Hi Peter and thanks for this response. > > > > (1) > > So, then: If I don't care about easier interpretation or any >interpretation at all, > > I only care about how well the model fits (and >predicts) my data, > > then, I'll just don't mess with centering. > > Am I right about this? > > Because I really won't bother about centering my >vars. > > Correct, adding a constant will not affect the variance. >However, you may later want to interpret the results. As precedence, >you may wish to make the extra effort, if it suits your purpose. > > > > (2) > > Do you know how centering is done? > > Consider the STANDARD procedure, available in BASE for one. You >could also use the MEANS procedure and a datastep. > > "The STANDARD procedure standardizes variables in a SAS data set >to a given mean and standard deviation, and it creates a new SAS data >set containing the standardized values." > > > >________________________________________________________________________ >____________ > Be a better friend, newshound, and > know-it-all with Yahoo! Mobile. Try it now. >http://mobile.yahoo.com/;_ylt=Ahu06i62sR8HDtDypao8Wcj9tAcJ > > > ____________________________________________________________________________________ >Be a better friend, newshound, and >know-it-all with Yahoo! Mobile. Try it now. http://mobile.yahoo.com/;_ylt=Ahu06i62sR8HDtDypao8Wcj9tAcJ


Back to: Top of message | Previous page | Main SAS-L page