Date: Fri, 18 Apr 2003 10:46:47 -0600
Reply-To: Jack Hamilton <JackHamilton@FIRSTHEALTH.COM>
Sender: "SAS(r) Discussion" <SAS-L@LISTSERV.UGA.EDU>
From: Jack Hamilton <JackHamilton@FIRSTHEALTH.COM>
Subject: Re: suggest metadata N of by-vars
Content-Type: text/plain; charset=us-ascii
Yes, having an "is unique on" report would be useful when exploring
data.
I think that "validated" means "SAS has sorted it"; if you set the sort
order with the SORTEDBY= data set option, validated will be NO.
--
JackHamilton@FirstHealth.com
Manager, Technical Development
Metrics Department, First Health
West Sacramento, California USA
>>> "Fehd, Ronald J. (PHPPO)" <rjf2@CDC.GOV> 04/18/2003 7:17 AM >>>
> From: diskin.dennis@kendle.com [mailto:diskin.dennis@kendle.com]
> Are you suggestting a ballot item ?
yes
> It sounds doable: a new columnn in dictionary plus a mod to
> the sort (would require corresponding mods to any optional
> sorts (eg. syncsort, tagsort. The value would have to missing
> when a datastep was not produced by sorting
I agree.
> I'm not sure how
> much interest you could generate since I do not think most
> sorts are on UNIQUE keys and it's simple enough to do a
> unique count via datastep or SQL..
YeahBut
since there is some data coming out of SORT usage
I figure this would be 'easy' to add a counter
It may be simple enough
but once we get past several zeros worth of rows
it is expensive
having this var
would promote proactive (macro) programming.
while we're at it
how about adding a message from sort:
NOTE: There were 3 observations read from the data set WORK.A.
NOTE: The data set WORK.A has 3 observations and 1 variables.
newNOTE: The data set WORK.A is unique on by_Var(s) <VarName(s)>;
NOTE: PROCEDURE SORT used (Total process time):
Here is the relevant CONTENTS info
The CONTENTS Procedure
Data Set Name WORK.A Observations 3
...
Data Set Type Sorted YES <---<<<
Alphabetic List of Variables and Attributes
# Variable Type Len
1 id Num 8
Sort Information
Sortedby id <---<<<
Validated YES
hmmm, what does 'Validated' mean?
newSort Information
N_ByVars 3
Ron Fehd the macro maven CDC Atlanta GA USA RJF2@cdc.gov