Hi...a number of other people have offered excellent suggestions to
solve Kathleen's problems.
PROC MEANS is an extremely powerful and versitile tool that can
calcuate a total of 31 different srptatisitcs, present the results in
your output window and/or create data sets. I think that the more a
SAS user knows about PROC MEANS the more reasons they will find to use
in in their projects and programs.
I have two user group papers on PROC MEANS (among the most popular I
present) available as PDFs for free download from my company website
at www.sierrainformation.com You can access them from the "User Group
Presentations" link on the site home page.
Sierra Information Services
On May 15, 11:06�am, k.askl...@GMAIL.COM (Kathleen Askland) wrote:
> I am hoping to get some advice about SAS code to solve the following problem.
> I have a dataset for which there are multiple values for each of the primary
> identifiers (which are genes). �For example, my dataset contains the following:
> gene � �score
> gene1 � 3.5
> gene1 � 4.2
> gene1 � 7.1
> gene2 � 3.7
> gene2 � 5.6
> gene3 � 7.2
> gene4 � 8.7
> gene4 � 1.2
> There are approximately 30,000 genes represented this way in the dataset. �I
> would like to have SAS provide me with an average of the scores for each
> gene (and preferably create a new dataset that contains only one row of data
> for each gene: the gene and its average score). �Also, I have other columns
> of data that are repeated by gene (e.g., gene1 always associated with a
> particular molecular function term, but different from that for gene2.)
> Any suggestions?