Validation of a Procedure for Clustering Expressions of Frequency and Amount
William J. Phalen
Institute for Job & Occupational Analysis
San Antonio, TX, USA
Robert M. Yadrick
Technical Training Research Division
Armstrong Laboratory Human Resources Directorate
Walter G. Albert
Manpower and Personnel Research Division
Armstrong Laboratory Human Resources Directorate
This paper is the third in a series of reports concerned with the scaling of words and phrases expressing qualitative levels of "frequency" and "amount." The initial paper by Yadrick, Phalen, Albert, Dittmar, Weissmuller, and Hand (1993) replicated a study by Bass, Cascio, and O'Connor (1974) which used a magnitude estimation procedure to scale 39 expressions of frequency and 44 expressions of amount. This study, however, used Air Force basic trainees to provide the estimates. The resultant scaling of expressions was quite similar to that of Bass et al. (1974), even though the rater populations were quite different. This suggested that there are commonly-shared perceptions regarding the weightiness of such expressions, independent of the samples which provided the estimates. Nine expressions of amount were selected to constitute an equal interval scale, based on their magnitude-estimated weights. These were tested in 40 computer administered occupational surveys to 160 cases and were found to produce more valid and reliable results than the traditional nine-point relative time spent scale, as reported in Albert, Phalen, Selander, Dittmar, Tucker, Hand, Weissmuller, and Rouse, (1994).
The next study by Yadrick, Phalen, Albert, Dittmar, Weissmuller, and Hand (1994) described the application of a new, univariate procedure for clustering the weighted expressions into groups of equivalent (or synonymous) expressions. It appeared to the researchers that the expressions within each group were sufficiently synonymous with one another that a single expression might be picked from each group to represent an equal-interval scale with the optimal number of points. It remained to be determined whether the divisions or cut points suggested by the groups in the cluster solution had captured the "psychologically" real levels of a frequency or amount scale embedded in the ordered lists of magnitude-estimated expressions. Therefore, a validation study was undertaken to determine the degree of correspondence between mathematically defined cluster groups and perceptually defined, or psychologically real, groups of expressions. This paper will report on the development and application of the criterion measure used to validate the univariate clustering procedure.
The Clustering Procedure
A special measure of "equivalence" was designed in the previous study to describe the similarity of expressions and to cluster them into homogeneous groups along a single dimension ("frequency" or "amount"). Since the dimensional values for each expression were derived by a ratio measurement technique (magnitude estimation), equivalence was computed as a ratio-based measure with exponential magnification of ratio differences to accentuate dividing lines between nonequivalent sets of equivalent expressions. In the basic equation, the pairwise ratios are converted to logarithms and the logarithms are summed algebraically so that positive logs (ratios > 1.0) and negative logs (ratios < 1.0) representing equal ratio differences from log 1.0 = 0 cancel each other (compensatory effect). Thus, two raters who disagree as to which of two expressions is greater or less will negate each other’s estimates. This feature is contrary to standard similarity or overlap measures, which treat all differences as representing dissimilarity (noncompensatory effect). A detailed description of the "equivalence" equation with example computations can be obtained from the senior author.
In this study, the equivalence equation was applied first to determine the equivalence of the top two expressions in the list, i.e., expressions "1" and "2" on the lefthand side of Table 1 for "frequency" and likewise on the lefthand side of Table 2 for "amount". If the equivalence value exceeded 80.00%, then the equivalence of expressions "1" and "3" then "1" and "4," etc., was computed, with expression "1" repeatedly being used as the target expression, until the equivalence value fell below 80.00%. At this point, the set of expressions falling within a minimum linkage of 80.00% was selected as the first group of expressions representing the highest level of the scale. Thus, in Table 1 (lefthand side) expressions "1" through "3" formed the first group of equivalent frequency expressions with a minimum equivalence (between expressions "1" and "3") of 83.81%. The next target was expression "4". Expressions "4" and "5" were compared, "4" and "6," etc. until "4" and "10" yielded the lowest acceptable equivalence of 80.66%. The remaining groups were formed in a similar manner, resulting in 12 "frequency" groups (or scale levels) and 13 "amount" groups (or scale levels). The minimum equivalence values are reported to the right of each group.
Column 1 of Tables 1 and 2 also shows the magnitude-estimated weights for each expression, as derived by Bass et. al. (1974). Although the groups of expressions generated by this clustering procedure seemed to be very reasonable in the judgment of the authors, it could not be assumed that the groupings represented psychologically real divisions without comparing them against a criterion of psychologically-derived groups of expressions.
The Criterion
A survey was developed that contained the ordered lists of frequency and amount expressions (with their weights deleted) in four different versions that presented frequency expressions first, followed by amount expressions, or vice versa, and presented the expressions in high to low order ("Always" to "Never," and "All" to "None"), or vice versa. The instructions asked respondents to follow a procedure that was analogous to what was done in the clustering procedure, but substituting his or her psychological or perceptual estimates of equivalence in place of our mathematical calculations of equivalence. More specifically, each respondent was asked to begin with expression "1" on the list, which was already circled, as the first target expression, and to compare expression "2" with it. If expression "2" appeared to be pretty much equivalent in meaning to expression "1", the respondent would proceed to compare expression "3" to expression "1," then "4" to "1" and so forth, until reaching an expression that did not appear to be reasonably synonymous or equivalent to expression "1." The respondent would then circle this expression as the next target expression against which to compare the expressions following it. This procedure would continue until the entire list was evaluated. The result would be a set of psychologically-derived groups of expressions based on the perceptions of that respondent. In a completed survey, each group of expressions could be identified as beginning with a circled expression and ending with the expression immediately preceding the next circled expression.
A sample of 42 respondents, consisting of behavioral scientists and clerical workers at the Armstrong Laboratory and at three contractor offices provided responses. Approximately equal numbers of each version of the survey were completed to provide the desired counterbalancing. The groups identified by the 42 respondents were consolidated into a matrix whose rows and columns indicated the number of times each expression was selected as a "beginning" or "ending" expression, respectively. Evaluation of the row and column totals made it possible to select the set of groups which provided the best overall fit of the individual respondent data. The resultant sets of psychologically-defined groups based on the perceptual judgments of 42 respondents are shown in the right-hand portion of Table 1 for the "frequency" expressions and in the right-hand portion of Table 2 for the "amount" expressions. The minimum equivalence values for the psychologically defined groups are also reported as additional points of comparison with the mathematically defined groups.
Frequency Results
Table 1 clearly shows a fairly high degree of correspondence between the mathematical versus psychological clustering of expressions of frequency. Three groups are identical, and three additional groups share one boundary in common. Although the mathematical clustering defined 12 groups vs 8 groups for the psychological clustering, three of the mathematical groups are single expressions which abutted sharp changes in the magnitude-estimated weights.
It might be argued that the expressions constituting the singleton groups should be dropped, especially "seldom," which is clearly out of place. This would reduce the number of mathematically defined groups to nine, while the psychologically defined groups would remain at eight. This would also increase the correspondence between the two clusterings. A major point of difference involves expressions "28" through "35" (if "seldom" is eliminated). The mathematical clustering separated this set of expressions into three groups, while the psychological clustering considered the set to be one group. In this case, the psychological grouping appears to make more sense. However, it should be noted that the minimum equivalence for this group (expressions "28" through "35") is only 0.19, which is to say that there is an extremely large ratio difference between a weight of 4.72 for "very seldom" and a weight of .33 for "seldom." If "seldom" is dropped, the minimum equivalence for the group would jump to 48.84, which is still low. Overall, the correspondence between the mathematical and the psychological clustering is reasonably good, especially considering some of the questionable expression weightings.
Amount Results
As is evident from examining Table 2, the correspondence between the mathematical and the psychological clustering is not as clear for expressions of amount. The greatest difference is at the very top, where the first four expressions in the psychological clustering constitute four singleton groups. It appears that the respondents felt that "all" was significantly more inclusive than "an exhaustive amount." However, "almost entirely" is certainly not "exhaustive", and so had to be separated out. Then comes "completely", which seems suspiciously like "all" (if adverbs can be like adjectives), and so "completely" had to be separated from "almost entirely."
We believe that the problem encountered here is one of context. The raters who made the magnitude estimates rated each expression separately, without seeing how the expressions would ultimately be ordered when listed together, whereas the respondents in our study had no choice but to follow the order in which the expressions were listed when they defined the group boundaries for equivalent expressions. Again, the mathematical clustering defined more groups in the mid- and low-ranges than did the psychological clustering. The psychological clustering seems a bit stretched in putting "a lot" in the same group as "a moderate amount", but it is harder to quibble with the psychological group that includes expressions "34" through "43".
Table 1. Mathematical and Psychological Clustering Results for Expressions of Frequency
|
Mathematical Clustering |
Psychological Clustering |
||||||
|
Mag. Est. Weight |
# |
Frequency Expression |
Min. Equiv. |
# |
Frequency Expression |
Min. Equiv. |
|
|
58.01 |
1 |
Always |
1 |
Always |
|||
|
50.16 |
2 |
Continually |
2 |
Continually |
|||
|
49.70 |
3 |
Constantly |
83.81 |
3 |
Constantly |
83.81 |
|
|
45.24 |
4 |
Frequently if not always |
4 |
Frequently if not always |
|||
|
42.45 |
5 |
Very often |
5 |
Very often |
|||
|
41.37 |
6 |
A great deal of the time |
6 |
A great deal of the time |
|||
|
40.02 |
7 |
Very frequently |
7 |
Very frequently |
|||
|
39.28 |
8 |
A great many times |
8 |
A great many times |
85.35 |
||
|
39.18 |
9 |
Usually |
9 |
Usually |
|||
|
37.64 |
10 |
Often |
80.66 |
10 |
Often |
||
|
36.07 |
11 |
Frequently |
11 |
Frequently |
|||
|
35.39 |
12 |
Quite often |
12 |
Quite often |
|||
|
34.44 |
13 |
Rather frequently |
13 |
Rather frequently |
|||
|
32.97 |
14 |
Commonly |
14 |
Commonly |
|||
|
32.64 |
15 |
Fairly often |
15 |
Fairly often |
|||
|
30.65 |
16 |
Fairly many times |
83.10 |
16 |
Fairly many times |
73.86 |
|
|
19.42 |
17 |
Sometimes |
100.00 |
17 |
Sometimes |
||
|
18.01 |
18 |
Some of the time |
18 |
Some of the time |
|||
|
15.52 |
19 |
To some degree |
19 |
To some degree |
76.95 |
||
|
15.19 |
20 |
Now and then |
20 |
Now and then |
|||
|
14.92 |
21 |
Occasionally |
80.91 |
21 |
Occasionally |
||
|
10.22 |
22 |
Once in a while |
100.00 |
22 |
Once in a while |
58.91 |
|
|
7.78 |
23 |
Not often |
23 |
Not often |
|||
|
7.23 |
24 |
Not very often |
24 |
Not very often |
|||
|
6.99 |
25 |
Fairly infrequently |
25 |
Fairly infrequently |
|||
|
6.47 |
26 |
Infrequently |
26 |
Infrequently |
|||
|
6.42 |
27 |
Rather seldom |
81.94 |
27 |
Rather seldom |
81.94 |
|
|
4.72 |
28 |
Very Seldom |
28 |
Very Seldom |
|||
|
4.56 |
29 |
Rarely |
29 |
Rarely |
|||
|
4.54 |
30 |
Very infrequently |
96.75 |
30 |
Very infrequently |
||
|
3.69 |
31 |
Seldom if ever |
31 |
Seldom if ever |
|||
|
3.47 |
32 |
Hardly at all |
95.08 |
32 |
Hardly at all |
||
|
3.34 |
33 |
Hardly ever |
33 |
Hardly ever |
|||
|
2.99 |
34 |
Very rarely |
34 |
Very rarely |
|||
|
2.63 |
35 |
Almost never |
80.77 |
35 |
Almost never |
||
|
0.33 |
36 |
Seldom |
100.00 |
36 |
Seldom |
0.19 |
|
|
0.17 |
37 |
None of the time |
37 |
None of the time |
|||
|
0.15 |
38 |
Not at all |
38 |
Not at all |
|||
|
0.08 |
39 |
Never |
91.69 |
39 |
Never |
91.69 |
|
Table 2. Mathematical and Psychological Clustering Results for Expressions of Amount
|
Mathematical Clustering |
Psychological Clustering |
||||||
|
Mag. Est. Weight |
# |
Amount Expression |
Min. Equiv. |
# |
Amount Expression |
Min. Equiv. |
|
|
66.12 |
1 |
All |
|
1 |
All |
100.00 |
|
|
59.27 |
2 |
An exhaustive amount |
88.70 |
2 |
An exhaustive amount |
100.00 |
|
|
57.61 |
3 |
Almost entirely |
|
3 |
Almost entirely |
100.00 |
|
|
57.35 |
4 |
Completely |
|
4 |
Completely |
100.00 |
|
|
54.46 |
5 |
An extraordinary amount |
|
5 |
An extraordinary amount |
|
|
|
51.38 |
6 |
Almost completely |
|
6 |
Almost completely |
|
|
|
48.89 |
7 |
An extremely abundant amount |
|
7 |
An extremely abundant amount |
|
|
|
48.20 |
8 |
An extreme amount |
81.18 |
8 |
An extreme amount |
87.37 |
|
|
41.56 |
9 |
A great amount |
|
9 |
A great amount |
|
|
|
41.36 |
10 |
A great deal |
|
10 |
A great deal |
|
|
|
40.59 |
11 |
Very much |
|
11 |
Very much |
|
|
|
40.50 |
12 |
A full amount |
|
12 |
A full amount |
97.45 |
|
|
37.10 |
13 |
A lot |
88.37 |
13 |
A lot |
|
|
|
35.14 |
14 |
Much |
|
14 |
Much |
|
|
|
34.24 |
15 |
Quite a bit |
|
15 |
Quite a bit |
|
|
|
32.65 |
16 |
A good bit |
|
16 |
A good bit |
|
|
|
31.44 |
17 |
A considerable amount |
|
17 |
A considerable amount |
|
|
|
30.04 |
18 |
Pretty much |
83.86 |
18 |
Pretty much |
|
|
|
27.70 |
19 |
Fairly much |
|
19 |
Fairly much |
|
|
|
26.22 |
20 |
An ample amount |
|
20 |
An ample amount |
|
|
|
24.07 |
21 |
An adequate amount |
85.66 |
21 |
An adequate amount |
|
|
|
21.80 |
22 |
A moderate amount |
|
22 |
A moderate amount |
42.40 |
|
|
18.63 |
23 |
Some |
84.04 |
23 |
Some |
|
|
|
13.42 |
24 |
To some extent |
|
24 |
To some extent |
|
|
|
13.10 |
25 |
To some degree |
|
25 |
To some degree |
|
|
|
11.75 |
26 |
Somewhat |
87.00 |
26 |
Somewhat |
51.46 |
|
|
9.57 |
27 |
A limited amount |
100.00 |
27 |
A limited amount |
|
|
|
7.81 |
28 |
A little |
|
28 |
A little |
|
|
|
7.51 |
29 |
A small amount |
|
29 |
A small amount |
|
|
|
7.22 |
30 |
Comparatively little |
|
30 |
Comparatively little |
|
|
|
7.20 |
31 |
A little bit |
|
31 |
A little bit |
|
|
|
7.02 |
32 |
Not much |
90.19 |
32 |
Not much |
|
|
|
5.27 |
33 |
A small degree |
|
33 |
A small degree |
41.46 |
|
|
5.21 |
34 |
Very little |
|
34 |
Very little |
|
|
|
5.09 |
35 |
A slight amount |
|
35 |
A slight amount |
|
|
|
4.28 |
36 |
A meager amount |
81.54 |
36 |
A meager amount |
|
|
|
3.68 |
37 |
A scanty amount |
|
37 |
A scanty amount |
|
|
|
3.64 |
38 |
A minimum amount |
|
38 |
A minimum amount |
|
|
|
3.13 |
39 |
A trifling amount |
|
39 |
A trifling amount |
|
|
|
2.98 |
40 |
Scarcely any |
82.65 |
40 |
Scarcely any |
|
|
|
2.85 |
41 |
A trivial amount |
|
41 |
A trivial amount |
|
|
|
2.48 |
42 |
An insignificant amount |
|
42 |
An insignificant amount |
|
|
|
2.28 |
43 |
Hardly any |
82.86 |
43 |
Hardly any |
29.86 |
|
|
0.15 |
44 |
None |
100.00 |
44 |
None |
100.00 |
|
In the mathematical clustering, the singleton group containing "a limited amount" should perhaps be dropped. It seems to us like another one of those fuzzy expressions, since all amounts other than "all" are "limited" amounts. Where the mathematical and psychological groups do not correspond, we would argue that the mathematical grouping at the upper end of the scaled list is superior, but the psychological grouping at the lower end is superior.
Discussion
It appears to us that the mathematical clustering procedure used in this study does a reasonably good job of clustering expressions of frequency and amount into groups of equivalent expressions representing psychologically real levels of frequency and amount. We plan to conduct a more precise validation study when time permits which will allow us to perform a statistical test of the correspondence between the mathematical and the psychological clustering solutions. This test will consist of a t-test between the mean correlations (as represented by Fisher Z's) of the 42 respondents’ individual groupings with the mathematically-defined groups and the psychologically-defined groups. Our hypothesis is that the mean correlations for the mathematical groups will be lower, but not significantly lower, than the mean correlations for the psychological groups. We might also replicate the study after removing from each list those expressions that appear to us to be ambiguous or debatably-weighted, or we might find or create another set of magnitude-scaled expressions.
References
Albert, W.G., Phalen, W.J., Selander, D.M., Dittmar, M.J., Tucker, D.L., Hand, D.K., Weissmuller, J.J. & Rouse, I.F. (1994, October). Large-scale laboratory test of occupational survey software and scaling procedures. In the symposium, Bennett, W. Jr., Chair, Training needs assessment and occupational measurement: Advances from recent research. Proceedings of the 36th Annual Conference of the International Military Testing Association. Rotterdam, The Netherlands: European Members of the IMTA.
Bass, B.M., Cascio, W.F., & O'Connor, E.J. (1974). Magnitude estimations of expressions of frequency and amount. Journal of Applied Psychology, 59, 313-320.
Yadrick, WM., Phalen, W.J, Albert, W.A., Dittmar, M.J., Weissniuller, J.J., & Hand, D.K. (1994, October). Clustering of magnitude estimations of frequency and amount of time. Proceedings of the 36th Annual Conference of the International Military Testing Association. Rotterdam, The Netherlands: European Members of the IMTA.
Yadrick, R.M., Phalen, W. J., Albert, W.G., Dittmar, M.J., Weissmuller, J.J., & Hand, D.K. (1993, November). Magnitude estimations of frequency and amount of time. Paper presented at the 35th Annual Conference of the International Military Testing Association. Williamsburg, VA: U.S. Coast Guard.