Book contents
- Frontmatter
- Contents
- Preface
- 1 Why do linguists need statistics?
- 2 Tables and graphs
- 3 Summary measures
- 4 Statistical inference
- 5 Probability
- 6 Modelling statistical populations
- 7 Estimating from samples
- 8 Testing hypotheses about population values
- 9 Testing the fit of models to data
- 10 Measuring the degree of interdependence between two variables
- 11 Testing for differences between two populations
- 12 Analysis of variance – ANOVA
- 13 Linear regression
- 14 Searching for groups and clusters
- 15 Principal components analysis and factor analysis
- Appendix A Statistical tables
- Appendix B Statistical computation
- Appendix C Answers to some of the exercises
- References
- Index
15 - Principal components analysis and factor analysis
Published online by Cambridge University Press: 05 June 2012
- Frontmatter
- Contents
- Preface
- 1 Why do linguists need statistics?
- 2 Tables and graphs
- 3 Summary measures
- 4 Statistical inference
- 5 Probability
- 6 Modelling statistical populations
- 7 Estimating from samples
- 8 Testing hypotheses about population values
- 9 Testing the fit of models to data
- 10 Measuring the degree of interdependence between two variables
- 11 Testing for differences between two populations
- 12 Analysis of variance – ANOVA
- 13 Linear regression
- 14 Searching for groups and clusters
- 15 Principal components analysis and factor analysis
- Appendix A Statistical tables
- Appendix B Statistical computation
- Appendix C Answers to some of the exercises
- References
- Index
Summary
In the previous chapter we defined the idea of a multivariate observation and looked at multivariate analysis techniques for discovering and confirming the presence of special groups among the observed individuals. In the present chapter we will look at methods designed specifically to reduce the dimensionality of the data.
Reducing the dimensionality of multivariate data
Suppose that their scores on p variables, X1, X2, …, Xp, have been observed for each of n subjects (see §15.3 for a language testing example). Typically, the variables will be intercorrelated, each variable having a higher correlation with some of the other variables than it does with the remainder. It is quite common that the pattern of intercorrelations is rather complex. As is often the case, the major statistical interest will lie in considering the differences between individuals. The special problem to be faced now is that the subjects can be different in a variety of ways. Two subjects may have similar scores for some of the variables and quite dissimilar scores on some of the others. If the number of variables is large, then it may be difficult to decide whether subject A differs more from subject B than from subject C since the pattern of differences may be quite dissimilar in the two cases.
- Type
- Chapter
- Information
- Statistics in Language Studies , pp. 273 - 295Publisher: Cambridge University PressPrint publication year: 1986
- 1
- Cited by