Date: Thu, 8 Feb 2007 18:08:29 -0800
Reply-To: David L Cassell <davidlcassell@MSN.COM>
Sender: "SAS(r) Discussion" <SAS-L@LISTSERV.UGA.EDU>
From: David L Cassell <davidlcassell@MSN.COM>
Subject: Re: Proc Summary Error..
In-Reply-To: <403593359CA56C4CAE1F8F4F00DCFE7D064C6598@MAILBE2.westat.com>
Content-Type: text/plain; format=flowed
michaelraithel@WESTAT.COM replied:
>
>Dear SAS-L-ers,
>
>Replying to queries about his original post, Wensui Liu posted the
>following:
>
> > Thanks for responding
> >
> > The proc summary step is
> >
> > proc summary data = outp nway missing;
> > where '01jan2003'd <= dosdte <= '31dec2003'd;
> > class lobcode category specialty yrqrt claimcod plan prv_stat
> > prvcnty memcnty his_par acctuw acctuw2 yrmon prvcode provdx
> > cpt proccode level3 prv_type claimtyp risk_deal servflag prod
> > placecod2 groupcod; var payamt copayamt dedamt coinsamt
> > cobamt reqamt wthldamt allowamt dscntamt procqty; output out
> > = saved.trendoutp1 (drop = _type_ rename = (_freq_ = lines))
> > sum = ; run;
> >
> > the outp dataset has 25,683,373 records which gets summarized
> > down to 23,402,550 records
> >
> > the problem occurs whenever we do not use -sumsize 0,
> > regardless of using -memseize 0
> >
>
>Wensui, you have already received enough advice on this topic to fill a
>SAS conference paper. Speaking of which, I happen to have an old SAS
>conference paper (SUGI 25) on this very topic: Summarizing Impossibly
>Large SAS Data Sets for the Warehouse Server Using Horizontal
>Summarization.
>
>You can find that paper on the SAS web site at:
>
> http://www2.sas.com/proceedings/sugi25/25/sy/25p290.pdf
>
>The paper requires some brute-force pre-programming, and predates SAS
>V7+ advances on the SUMMARY procedure (eg. the TYPES statement), but it
>can still be very useful in the type of situation that you
>described--IFF for some bizarre reason the other suggestions do not work
>out quiet right for you.
>
>Wensui, best of luck in getting all of your data summarized--BY ANY
>MEANS NECESSARY!
>
I think this paper is worth reading, whether the Original Poster solves his
problem via TYPES or not. It's still my favorite paper of SUGI 25.
David
--
David L. Cassell
mathematical statistician
Design Pathways
3115 NW Norwood Pl.
Corvallis OR 97330
_________________________________________________________________
Get in the mood for Valentine's Day. View photos, recipes and more on your
Live.com page.
http://www.live.com/?addTemplate=ValentinesDay&ocid=T001MSN30A0701
|