ebook img

ERIC ED407418: Higher-Order Factor Analysis. PDF

22 Pages·0.25 MB·English
by  ERIC
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview ERIC ED407418: Higher-Order Factor Analysis.

DOCUMENT RESUME TM 026 440 ED 407 418 Gray, B. Thomas AUTHOR Higher-Order Factor Analysis. TITLE 24 Jan 97 PUB DATE 22p.; Paper presented at the Annual Meeting of the Southwest NOTE Educational Research Association (Austin, TX, January 23-25, 1997). Speeches/Meeting Papers (150) Reports PUB TYPE Evaluative (142) EDRS PRICE MF01/PC01 Plus Postage. -z *Correlation; *Factor Analysis; *Matrices; *Orthogonal DESCRIPTORS Rotation *Second Order Effects IDENTIFIERS ABSTRACT Higher order factor analysis is an extension of factor analysis that is little used, but which offers the potential to model the hierarchical order often seen in natural (including psychological) phenomena more accurately. The process of higher order factor analysis is reviewed briefly, and various interpretive aids, including the Schmid-Leiman solution, are discussed. An example of the use of higher-order factor analysis is provided using the Alcohol Use Inventory. The basic process of factor analysis can be conceptualized in terms of a series of matrices. A matrix of data is analyzed to produce a matrix of associations. An appropriate extraction technique is used to produce the factor matrix. An interfactor matrix of associations (factors by factors) is constructed, and factors are again extracted to yield higher order factors that can be rotated. Repeating the process will yield sequentially higher-order factors until either a single factor is extracted, or the extracted factors are uncorrelated even with rotation. Interpreting the higher order factor follows. The solution proposed by J. Schmid and J. Leiman (1975)"orthogonalizes" the factors by residualizing the variance from the primary factors and attributing it to the second-order factor alone. This approach gives another look at a data set that may provide useful information. Higher-order factor analysis is not often used, but it has the potential to aid interpretation. (Contains 1 figure, 6 tables, and 29 references.) (Author/SLD) ******************************************************************************** Reproductions supplied by EDRS are the best that can be made from the original document. ******************************************************************************** -or Higher-Order Factor Analysis 1 Higher-Order Factor Analysis EDUCATION U.S. DEPARTMENT OF and Improvement PERMISSION TO REPRODUCE AND Office of Educational Research INFORMATION TIONAL RESOURCES EDU DISSEMINATE THIS MATERIAL CENTER (ERIC) reproduced as HAS BEEN GRANTED BY This document has been organization received from the person or 77/6/11)93 6,etiy originating it. to Minor changes have been made improve reproduction quality. in this Points of view or opinions stated represent TO THE EDUCATIONAL RESOURCES document do not necessarily official OERI position or policy. INFORMATION CENTER (ERIC) B. Thomas Gray Texas A&M University 77843-4225 Paper presented at the annual meeting of the Southwest Educational Research Association, Austin, TX, January 24, 1997. 2 BEST COPY AVAHA Higher-Order Factor Analysis 2 Abstract Higher-order factor analysis is an extension of factor analysis that is little used, but which offers the potential to more accurately model the hierarchical order often seen in natural (including psychological) phenomena. The process of higher order factor analysis is briefly reviewed, and various interpretive aids, including the Schmid-Leiman solution, are discussed. An example of the use of higher-order factor analysis is provided using the Alcohol Use Inventory. Higher-Order Factor Analysis 3 Higher-Order Factor Analysis Factor analysis is a technique that allows for the reduction of a data set with a large number of variables to one with a smaller, and therefore more manageable number of factors. As Gall, Borg, and Gall (1996) noted, "Factor analysis provides an empirical basis for reducing all these variables to a few factors by combining variables that are moderately or highly correlated with each other" (pp. 447-448). The researcher is thus provided with a set of information which must then be interpreted in a theoretically consistent fashion. As Gorsuch (1983, p. 2) has stated, "Usually the aim [in using factor analysis] is to summarize the interrelationships among the variables in a concise but accurate manner as an aid to conceptualization." Many phenomena, including psychological occurrences, are conceptualized as being hierarchically ordered. For example, Gorsuch (1983) discussed the way the Earth's topography is frequently categorized, being first divided according to land or water. Landforms are subdivided into continents and islands, and each can be further subdivided by location. Similarly, bodies of water are divided into oceans, lakes and streams. If we conceptualize nature as consisting of hierarchically-ordered phenomena, then it is only logical to model the phenomena in this way. This is reflected in the structure of psychological tests, which often include several levels of subtests or subscales, and are therefore implicitly hierarchical. One example is the Wechsler tests of intelligence, which subdivide g, or general intelligence, into verbal and performance domains. The examinee's score for either of these domains may be further broken down according to the subtests of which they are composed. The essential concept of higher order factor analysis follows this same line of reasoning. Factors have been conceptualized as groupings of variables that share an acceptable amount of Higher-Order Factor Analysis 4 variance, or in other words, variables that are correlated with one another. Higher order factors similarly are groupings of factors that are more closely correlated with on another than they are to other factors or factor groupings. The process involves iterations of extracting higher-order factor(s) from the relevant lower-order interfactor matrix of associations until either only a single factor is derived (for example, g) or until the lower- and higher-order factors are the same. This process is explained in greater detail below. A recent review of factor analytic studies in the field of counseling psychology (Tinsley & Tinsley, 1987) failed to mention of higher-order factor analysis, which would seem to bear out Kerlinger's (1984, p. xivv) comment that the procedure "seems not to be widely known or understood." Indeed, it is not typically mentioned in the sections of texts which include substantial introductions to factor analysis (e.g., Crocker & Algina, 1986; Stevens, 1996). This is disturbing, and given the promise that higher-order factor analysis has in terms of reflecting the hierarchical nature of many naturally occurring phenomena, it is to be hoped that future workers will become more familiar with the technique, and consider its use more frequently. Brief Review of the Factor Analytic Process The basic process of factor analysis can be readily conceptualized in terms of a series of matrices, as portrayed in Figure 1 (see Hetzel, 1995, for a very readable description of basic factor analysis; a more detailed treatment is to be found in Gorsuch, 1983). A matrix of data (X, items by variables) is analyzed to produce a matrix of associations (R, variables by variables), usually by either computing correlation coefficients between the different variables or developing the relevant variance- covariance matrix. An appropriate extraction technique, such as principal components analysis (PCA; principle factors analysis, or PFA, is an alternative technique) is then used to produce the factor 5 Higher-Order Factor Analysis 5 matrix (F, variables by factors). Any method of extraction will produce factors which are orthogonal, and the matrix of associations between the factors will therefore by definition be an "identity" matrix. Efforts to extract factors from such a matrix will not yield new factors, but will instead simply It is then possible to rotate the matrix obliquely, which will reproduce the original set of factors. redistribute the variance such that the factors are now correlated (the resultant matrix is labeled F'). The process just described produces primary factors, and the use of oblique rotation implies that they are correlated. Given the nature of the variables most often employed in behavioral science research, correlated variables are to be expected, since investigation of a given construct will almost It is therefore invariably involve measurements which tap into the same part(s) of the construct. reasonable to expect that the variables might be hierarchically related to one another, which in turn would make use of higher-order factor analysis appropriate. Continuing the process outlined above results in the extraction of second order factors (Fig.1). An interfactor matrix of associations (R, factors by factors) is constructed, and factors are then extracted from it using PCA (or PFA), or another suitable method (Gorsuch, 1983). The resultant higher-order factor matrix (H, factors by higher-order factors) can then be rotated. Repeating the process will yield sequentially higher-order factors until either only a single factor is extracted, or until the factors extracted are uncorrelated even with rotation. One notable difference from first-order factor analysis is that the statistical significance of the matrix (Bartlett, 1950) cannot be used as a test to determine the number of factors to retain in a higher-order analysis. This is because the sampling distribution of correlation coefficients will in part be a function of whatever rotation strategy is employed, and thus the distribution will vary according to rotation strategy (Gorsuch, 1983). In fact, this is not a great loss anyway, since the utility of 6 Higher-Order Factor Analysis 6 statistical significance testing is limited, at best (see Cohen, 1995; Thompson, 1989, 1994, 1996). Interpretation of Higher-order Factors The next task facing the researcher is to make sense of the higher-order factor, which is to say, to interpret its meaning. An approach is to base the interpretation of the higher-order factor on the interpretations of the lower-order factor(s) from which it is derived, as Kerlinger (1984) did in his study of social attitudes (see Thompson, 1985). This may be superficially appealing, but it must be borne in mind that any such interpretation necessarily involves elimination of some information. Certainly the purpose of analyses is to remove the information (i.e., the variance) that is not useful for explaining the phenomenon of interest. However, as Gorsuch (1983, p. 245) pointed out, this amounts to "basing interpretations based on interpretations." Information that is deleted at one step is potentially relevant at the next. Several solutions have been offered to help in resolving this problem. All provide the researcher with a variable-by-higher-order-factor matrix of factor pattern coefficients, although they are derived by different methods. Gorsuch (1983) suggested that the primary factor pattern matrix Thompson (1990) carried be postmultiplied by the higher-order factor pattern matrix (P Pm = P,h). this a step further by applying a Varimax rotation to the resultant product matrix (P,h), which seems more consistent with the procedures employed in these analyses. Schmid and Leiman (1957) offered a slightly different approach. Their procedure follows the usual process, but the final result distributes the variance somewhat differently. The Schmid-Leiman solution "orthogonalizes" the factors by residualizing the variance from the primary factors and attributing it to the second-order factor alone. Borrello and Thompson (1990) applied this method in testing the validity of Lee's (1973/1976) typology of love as formulated by Hendrick and Hendrick 7 Higher-Order Factor Analysis 7 (1986). The six basic types of love defined by Lee appeared as primary factors, but the predicted pattern of relationships among them was not supported by the second-order factors which emerged. Noteworthy was the fact that the six basic types were discernible at all steps in the analysis, even with the variance common to the first- and second-order factors removed from the former (Borrello & Thompson, 1990). Thompson (1990) has suggested that elucidating first- and second-order factors from a data set is analogous to looking at a mountain range from a close-up view and again from further away. Following the same line of thinking, it is here suggested that the initial matrix of associations derived from a large data set might be somewhat like being on the streets of New York City, while the primary factors would be like viewing it from atop the Empire State Building. A second-order factor analysis would give the perspective of an airline pilot flying over the city, and a third-order analysis would be like the view from the space shuttle. SECONDOR Thompson (1990) has developed a FORTRAN program that greatly facilitates analysis using the strategy of higher-order factors. The output generated by SECONDOR includes several sets of results. A first order PCA is provided, including an unrotated solution, as well as the Varimax and Promax rotated solutions. The program also allows the worker the option of either retaining all factors with eigenvalues greater than 1, or else manually selecting the number of factors to retain. In addition, SECONDOR provides both an unrotated and a Varimax rotated second-order solution, and a Schmid-Leiman solution. It was this program that was used to generate the output used in the analysis of the love typology as briefly discussed above (Borrello & Thompson, 1990). An Illustrative Example: The Alcohol Use Inventory (AUI) 8 Higher-Order Factor Analysis 8 The AUI is a 147 item instrument designed to evaluate an examinee's patterns of alcohol use (Horn, Wanberg, & Foster, 1974; Wanberg, Horn, & Foster, 1977). It has been thoroughly studied by Skinner (1981a, 1981b; Skinner & Allen, 1983). As reported in Table 1, sixteen scales have been defined, based on prior factor analytic studies; two separate and independent samples yielded mean internal consistency reliability indices on the order of .75 (Skinner, 1981a; Wanberg, et al., 1977). Wanberg and his colleagues (1977) conducted an analysis of the data from 2,261 administrations of the instrument over a period of about four years. The correlation matrix of the 16 scales for this sample is presented in Table 2. Unfortunately, Wanberg et al. (1977) failed to provide adequate details of the procedure they followed to be able to duplicate their analysis (see Skinner, 1983, for discussion of the general failure to adequately report factor analytic studies pertaining to alcohol misuse). The correlation matrix of Table 2 was analyzed using the SECONDOR program. Eigenvalues for the first six factors derived through the principal components analysis were 4.999, 1.518, 1.445, 1.334, 0.951, and 0.892. Given the fairly large separation between the fourth and fifth values, the Guttman rule was applied, and the The factor pattern matrix for the Varimax-rotated solution is first four factors were retained. presented in Table 3. The Promax factor pattern and structure pattern matrices yield essentially the same results. While there are some overall similarities between the analysis reported here using SECONDOR, and the previous work of Wanberg and his associates (1977), there are also some Factor I in Table 3 essentially corresponds to Factor DI (Deterioration) of notable differences. Wanberg, et al., and Factor IV to their Factors A and B together (save that the factor pattern and factor structure coefficients for Variable 3 are relatively small in Factor IV). Factor II in the present 9 Higher-Order Factor Analysis 9 analysis includes the same variables as Factor C of Wanberg, et al., and also includes Variables 15 There is no apparent correlate in the scheme of Wanberg, et al. to Factor III. It is not and 16. surprising that there is nothing in the SECONDOR analysis corresponding to Factor D2 of Wanberg, et al., since the latter is composed of some of the 25 questions not included in the 16 scales that form the variables for this study. A General Alcoholism (g) Factor was also identified by Wanberg and associates (1977), although again the basis for this, including its variable composition, were unreported. SECONDOR yielded two second-order factors in the present analysis, and the Varimax-rotated solution is presented in Table 4. This would seem to indicate that the second-order factor labeled H1 in Table 4 is made up of primary factors I, II, and IV; and that H2 is then made up only of primary factor III. However, Table 5 presents the Varimax-rotated product matrix of the primary and the second-order factor pattern matrices, and examination of this data shows clearly that the variables do not sort into higher-order factors as neatly as Table IV might imply. Indeed, these would seem to represent rather distinct constructs, whose interpretation is better left to another arena. Table 6 presents the Schmid-Leiman solution for this same data. This illustrates that the "orthogonalized" first-order factors retain the same basic composition as that seen in Table 3, and again that the higher-order factors are not composed simply of combinations of the primary factors. Summary The above discussion has presented the process of higher-order factor analysis. Various interpretation aids have been reviewed, and illustrative examples have been provided. Given this information, it is troubling that relatively few applications have been made of this analytic tool (cf. Tinsley & Tinsley, 1986). Of particular concern is the commentary of Nunnally (1978, pp. 431-432), 10

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.