Date: Mon, 28 Jan 2008 13:46:03 -0500
Reply-To: Wensui Liu <liuwensui@GMAIL.COM>
Sender: "SAS(r) Discussion" <SAS-L@LISTSERV.UGA.EDU>
From: Wensui Liu <liuwensui@GMAIL.COM>
Subject: Re: Efficiency with large datasets
In-Reply-To: <479e2059$1_2@mk-nntp-2.news.uk.tiscali.com>
Content-Type: text/plain; charset=ISO-8859-1
Hi, JB,
I have a macro to do the same thing you want to do but can't show you here.
however, the idea is very simple. i am using a macro to loop through
each ind variable and using data step instead of proc freq to
calculate the percent and count.
for thousands of variables, it still takes at least a day to run on unix.
On Jan 28, 2008 1:35 PM, JB <john_banana@yahoo.com> wrote:
> I have a data set with a target variable and 45000 other variables.
> I wish to produce an Information Value for each variable.
>
> In order to cut the processing time down I have cut the number of records
> down to 2000 but each run through the data to produce a crosstab still takes
> ages.
>
>
> proc freq data = tran.var_deriv_stg3 noprint;
> tables &tvar1 * target
> /missing norow nocol nopercent nocum
> out = temp1(keep = &tvar1 target count);
> run;
>
>
> Is there any way I can produce these xtabs without the whole file being
> processed each time?
>
> Thanks
>
--
===============================
WenSui Liu
Statistical Project Manager
ChoicePoint Precision Marketing
(http://spaces.msn.com/statcompute/blog)
===============================
|