| Date: | Thu, 17 Nov 2011 09:09:14 -0600 |
| Reply-To: | "Swank, Paul R" <Paul.R.Swank@uth.tmc.edu> |
| Sender: | "SPSSX(r) Discussion" <SPSSX-L@LISTSERV.UGA.EDU> |
| From: | "Swank, Paul R" <Paul.R.Swank@uth.tmc.edu> |
| Subject: | Re: Cluster analysis |
|
| In-Reply-To: | <C4D92F04EFE1174DA9640FEEB18C06BC0E9EBF96@HAEMAIL01.melbourne.vic.gov.au> |
| Content-Type: | multipart/alternative;
|
|---|
Squared Euclidean distance doesn't make much sense for categorical variables. It seems to me that your variables are at least ordered but that not sure should be between yes and no.
Dr. Paul R. Swank,
Children's Learning Institute
Professor, Department of Pediatrics, Medical School
Adjunct Professor, School of Public Health
University of Texas Health Science Center-Houston
From: SPSSX(r) Discussion [mailto:SPSSX-L@LISTSERV.UGA.EDU] On Behalf Of Boreak Silk
Sent: Thursday, November 17, 2011 12:04 AM
To: SPSSX-L@LISTSERV.UGA.EDU
Subject: Cluster analysis
Hi Listers,
I am trying to run a cluster analysis of 90+ records data set with 10 variables. All variables are
categorical:
1- Yes
2- No
88 - Not sure
I've used Hierarchical Cluster Analysis with Between-Groups Linkage method and Squared Euclidean Distance.
I got only two clusters, see below frequency results.
CLU9_1 Average Linkage (Between Groups)
$B!! (B
Frequency
Percent
Valid Percent
Cumulative Percent
Valid
1
5
5.4
5.8
5.8
2
48
52.2
55.8
61.6
3
1
1.1
1.2
62.8
4
5
5.4
5.8
68.6
5
2
2.2
2.3
70.9
6
1
1.1
1.2
72.1
7
1
1.1
1.2
73.3
8
22
23.9
25.6
98.8
9
1
1.1
1.2
100.0
Total
86
93.5
100.0
$B!! (B
Missing
System
6
6.5
$B!! (B
$B!! (B
Total
92
100.0
$B!! (B
$B!! (B
Is there any way to increase the number of clusters?
Thank in advance.
Boreak
This email is intended solely for the named addressee.
If you are not the addressee indicated please delete it immediately.
[text/html]
|