Chapter 18

File Name = psscls1.sps

get file="E:\rdda\pssstf16.sav".

CLUSTER tense satisfie easygoin caring good friendly confiden suspicio lazy

forced busy ordered whattodo share goingon think worktoge selfeste learn

joke comeandg personal goodtime thingsge easyfiti conflict peopleli

/METHOD BAVERAGE

/MEASURE= SEUCLID

/ID=personra

/PRINT SCHEDULE CLUSTER(2)

/PRINT DISTANCE

/PLOT DENDROGRAM HICICLE.

The above file was generated with the following clicks:

Click Analyze

Click Classify

Click Hierarchical Cluster

Select ID

Click right delta for Label Cases By:

Select Variables to use for clustering

Click right delta for Variables

Click Statistics

Select Agglomeration

Select Proximity Matrix

Select Single Solution and 2 clusters

Click Continue

Click Plots

Select Dendrogram

Select Horizontal

Click Continue

Click Method

Click OK

PERSONRA

TENSE

SATISFIED

EASYGOING

CARING

GOOD

FRIENDLY

CONFIDENT

SUSPICIOUS

LAZY

FORCED

BUSY

ORDERED

WHATTODO

GOINGON

THINK

WORKTOGETH

SELFESTEEM

LEARN

JOKE

COMEANDGO

PERSONAL

GOODTIME

THINGSGET

EASYFITIN

CONFLICT

PEOPLELIKE

Barb

John

Leona

Leslie

Nolita

Reece

Ruth

Sue

Couns

Cluster

Average Linkage (Between Groups)

Dendrogram

* * * * * * H I E R A R C H I C A L C L U S T E R A N A L Y S I S * * * * * *

Dendrogram using Average Linkage (Between Groups)

Rescaled Distance Cluster Combine

C A S E 0 5 10 15 20 25

Label Num +---------+---------+---------+---------+---------+

Nolita 5 òûòòòòòòòòòòòòòòòòòòòòòòòø

Couns 9 ò÷ ùòòòòòø

Ruth 7 òòòòòòòòòòòûòòòòòòòòòòòòò÷ ùòòòòòòòòòòòòòòòòòø

Sue 8 òòòòòòòòòòò÷ ó ó

John 2 òûòòòòòòòòòòòòòòòòòòòòòòòòòòòòò÷ ó

Reece 6 ò÷ ó

Barb 1 òòòòòòòòòûòòòòòòòòòø ó

Leslie 4 òòòòòòòòò÷ ùòòòòòòòòòòòòòòòòòòòòòòòòòòòòò÷

Leona 3 òòòòòòòòòòòòòòòòòòò÷

The next analysis request four clusters.

Transposing a File

Click on Data; Click on Transpose; Click on PERSONA; Click on delta to Variable Name; Select remaining variables; Click on delta to Variables; Click OK. SAVE AS pssstf18.sav.

ITEM

BARBE

JOHN

LEONA

LESLIE

NOLITA

REECE

RUTH

SUE

COUNS

TENSE

SATIS

EASY

CARE

GOOD

FRIEND

CONFI

SUSP

LAZY

FORCED

BUSY

ORDER

WHATDO

GOON

THINK

WORKT

SELFE

LEARN

JOKE

COMEGO

PERSONAL

TOODT

THINGSD

EASYF

CONFLCT

PEOPLL

File Name = psscls3.sps

get file = '\rdda\pssstf18.sav'.

cluster barb to couns

/id=case_lbl

/print=distance

/print=schedule cluster(3)

/plot=dendrogram hicicle.

Cluster

>Warning # 708 in column 18. Text: PEOPLELIA

>A variable name is more than 8 characters long. Only the first 8

>characters will be used.

Average Linkage (Between Groups)

Dendrogram

* * * * * * H I E R A R C H I C A L C L U S T E R A N A L Y S I S * * * * * *

Dendrogram using Average Linkage (Between Groups)

Rescaled Distance Cluster Combine

C A S E 0 5 10 15 20 25

Label Num +---------+---------+---------+---------+---------+

Nolita 5 òûòòòòòòòòòòòòòòòòòòòòòòòø

Couns 9 ò÷ ùòòòòòø

Ruth 7 òòòòòòòòòòòûòòòòòòòòòòòòò÷ ùòòòòòòòòòòòòòòòòòø

Sue 8 òòòòòòòòòòò÷ ó ó

John 2 òûòòòòòòòòòòòòòòòòòòòòòòòòòòòòò÷ ó

Reece 6 ò÷ ó

Barb 1 òòòòòòòòòûòòòòòòòòòø ó

Leslie 4 òòòòòòòòò÷ ùòòòòòòòòòòòòòòòòòòòòòòòòòòòòò÷

Leona 3 òòòòòòòòòòòòòòòòòòò÷

Cluster

Average Linkage (Between Groups)

Dendrogram

Dendrogram using Average Linkage (Between Groups)

Rescaled Distance Cluster Combine

C A S E 0 5 10 15 20 25

Label Num +---------+---------+---------+---------+---------+

SHARE 14 òø

WORKTOGE 17 òú

SELFESTE 18 òú

GOOD 5 òôòø

PEOPLELI 27 òú ó

FRIENDLY 6 òú ó

GOINGON 15 ò÷ ó

CONFIDEN 7 òûò÷

THINGSGE 24 ò÷ ó

WHATTODO 13 òûòüòø

EASYFITI 25 ò÷ ó ó

THINK 16 òûò÷ ùòø

LEARN 19 ò÷ ó ó

SATISFIE 2 òûòòò÷ ùòòòòòø

BUSY 11 ò÷ ó ó

CARING 4 òòòòòòò÷ ó

JOKE 20 òø ùòòòòòòòòòòòòòòòòòòòòòòòòòòòòòòòòòòòø

GOODTIME 23 òôòø ó ó

EASYGOIN 3 ò÷ ùòø ó ó

COMEANDG 21 òòò÷ ùòòòø ó ó

PERSONAL 22 òòòòò÷ ùòòò÷ ó

CONFLICT 26 òòòòòòòòò÷ ó

SUSPICIO 8 òûòòòòòø ó

LAZY 9 ò÷ ùòòòòòòòòòòòòòòòòòòòòòòòòòòòòòòòòòòòòòòòòò÷

FORCED 10 òûòø ó

ORDERED 12 ò÷ ùòòò÷

TENSE 1 òòò÷

The purpose of this section is to show the relationships among correlation and cluster analysis. In this example 4 people have taken 4 tests (tests are like variables). The data are as follows:

The purpose of this next section is twofold: (1) to demonstrate another method of the use of the statistics and (2) compare the various statistics methodologically.

The purpose of this section is to show the relationships between correlation (and factor analysis), and cluster analysis. In this example 4 people have taken 4 tests (tests are like variables). The data are as follows:

CLSDAT1.TXT

"PER1",2,3,5,2

"PER2",3,2,6,3

"PER3",2,3,5,3

"PER4",3,2,6,2

The data is presented graphically:

┌────────────────────────────────────────────────────────────────────────────┐

│ PERFAC5.LIS │

├────────────────────────────────────────────────────────────────────────────┤

│Final Statistics: │

│ │

│Variable Communality * Factor Eigenvalue Pct of Var Cum Pct │

│ * │

│DRIVE .54238 * 1 6.98937 30.4 30.4 │

│GOAL .50485 * 2 2.15730 9.4 39.8 │

│HEDON .54444 * 3 1.72904 7.5 47.3 │

│COG .56063 * 4 1.47348 6.4 53.7 │

│VALUE .66169 * 5 1.32890 5.8 59.5 │

│ACTIVE .70979 * │

│EARLY .58670 * │

│IMPOSE .64661 * │

│LEARN .58716 * │

│GOOD .51995 * │

│HERED .58137 * │

│CONSCI .64024 * │

│UNCONS .68112 * │

│SOCIAL .61566 * │

│PERCEP .61891 * │

│INFLU .59501 * │

│TIME .58200 * │

│DATA .56921 * │

│PARSI .60125 * │

│FREE .61128 * │

│THERA .64608 * │

│PATH .52881 * │

│AGREE .54294 * │

│ │

│Rotated Factor Matrix: │

│ │

│ FACTOR 1 FACTOR 2 FACTOR 3 FACTOR 4 FACTOR 5 │

│ │

│DRIVE ‑.67035** ‑.10424 ‑.12588 ‑.21679 .13893 │

│GOAL .44300 .44580* .16215 .17344 .23128 │

│HEDON ‑.72226** ‑.01600 .14498 .01324 .03653 │

│COG .50422* .28914 .40228 .23887 ‑.06251 │

│VALUE .15529 .79294** ‑.08091 ‑.04701 .00768 │

│ACTIVE .58000** .41073 .21364 .39876 ‑.00630 │

│EARLY ‑.69231** .27344 ‑.13863 .07009 .09220 │

│IMPOSE .22239 .23344 ‑.10607 .72878** .01706 │

│LEARN .02767 .45879 .49137* .21355 ‑.29809 │

│GOOD .57563** .41920 .00750 ‑.00350 .11316 │

│HERED .10169 .28821 ‑.34325 ‑.60077** ‑.09606 │

│CONSCI .55734** .40202 .29750 .26467 ‑.09712 │

│UNCONS ‑.48833* ‑.19803 ‑.48205 ‑.38498 .15119 │

│SOCIAL ‑.05895 .71266** .26140 .18852 .02080 │

│PERCEP .29944 .16227 ‑.10921 .69839** .05684 │

│INFLU ‑.10405 .01029 .21463 ‑.17453 .71242**│

│TIME .72841** .04942 .11085 .18045 ‑.06419 │

│DATA .29151 .04499 .63344** ‑.20723 .19498 │

│PARSI .05321 .06473 .76207** .00803 .11581 │

│FREE .51295* .32588 .28510 .39853 .04315 │

│THERA ‑.13541 ‑.11914 ‑.26013 .24730 .69622**│

│PATH ‑.51195* ‑.08011 ‑.40072 ‑.11859 .29269 │

│AGREE ‑.01068 .34696 .25436 .21198 .55930**│

└────────────────────────────────────────────────────────────────────────────┘

We were somewhat arbitrary in selecting 5 factors in this solution so that it would match with the five cluster solution in the cluster analysis solution that follows. It should be noted that one should not be so casual in determining the number of factors in a solution; the reader is referred to chapter __ when testing for the number of factors. In developing theory the researcher may do that in an armchair fashion, reviewing the literature or with exploratory factor analysis. The major purpose here to compare factor analysis with cluster analysis so that the number of factors is done with that purpose in mind.

The next example shows how cluster analysis can be used to group the same set of data. The data needs to be conditioned before the cluster analysis can be run. The means are computed within each theorist for each item. For example, the first item DRIVE for all respondents to Freud were summed and divided by the number of respondents (the number was also rounded to the nearest integer to keep it on the same scale). The matrix was then transposed because the computer program requires that format for this problem. This data is presented in the frame THER11.sav.

ITEM

FREUD

ADLER

JUNG

ROGERS

KELLY

HORNEY

SULLIVA

BANDURA

CATTELL

MASLOW

BINSWAN

ERIKSON

DRIVE

GOAL

HEDON

COG

VALUE

ACTIVE

EARLY

IMPOSE

LEARN

GOOD

HERED

CONSCI

UNCONS

SOCIAL

PERCEP

INFLU

TIME

DATA

PARSI

FREE

THERA

PATH

AGREE

File Name = percls3.sps

get file = '\proeval\ther11.sav'/keep=

ITEM FREUD ADLER JUNG ROGERS KELLY HORNEY

SULLIVA BANDURA CATTELL MASLOW BINSWAN ERIKSON.

cluster freud to erikson

/id=item

/print=distance

/print=schedule cluster(5)

/plot=dendrogram hicicle.

* * * * * * H I E R A R C H I C A L C L U S T E R A N A L Y S I S * * * * * *

Dendrogram using Average Linkage (Between Groups)

Rescaled Distance Cluster Combine

C A S E 0 5 10 15 20 25

Label Num +---------+---------+---------+---------+---------+

IMPOSE 8 òûòø

PERCEP 15 ò÷ ó

GOAL 2 òòòüòø

COG 4 òø ó ó

CONSCI 12 òôò÷ ùòòòø

ACTIVE 6 ò÷ ó ùòø

FREE 20 òòòòò÷ ó ó

VALUE 5 òòòòòòòûò÷ ùòòòòòòòòòø

GOOD 10 òòòòòòò÷ ó ó

LEARN 9 òûòòòòòø ó ó

SOCIAL 14 ò÷ ùòòò÷ ùòòòòòòòòòòòø

PARSI 19 òûòòòòò÷ ó ó

AGREE 23 ò÷ ó ó

INFLU 16 òòòòòòòòòòòòòø ó ùòòòòòòòòòòòòòòòø

THERA 21 òòòòòòòòòòòòòüòòòòòòò÷ ó ó

EARLY 7 òòòòòòòòòòòòò÷ ó ó

HERED 11 òòòòòòòòòòòòòòòûòòòø ó ó

TIME 17 òòòòòòòòòòòòòòò÷ ùòòòòòòòòòòòòò÷ ó

DATA 18 òòòòòòòòòòòòòòòòòòò÷ ó

DRIVE 1 òûòòòòòòòòòòòòòø ó

HEDON 3 ò÷ ùòòòòòòòòòòòòòòòòòòòòòòòòòòòòòòòòò÷

UNCONS 13 òòòòòûòòòòòòòòò÷

PATH 22 òòòòò÷

If five factors are chosen (to be comparable to the 5 factor solution above) there are as follows: