Published online by Cambridge University Press: 22 December 2008
The Paper contains a report on the theory and methodology used in a project on finnish syntax as investigated in (some genres of written) text. 123 texts containing 10,149 clauses (5016 graphical sentences) were analyzed according to a syntactic coding frame containing 63 variables. The variables include sentences structure, clause structure, clause type, clause function, clause length, surface word order, constituent structure, the text semantic and referential properties of nominal constituents, and some movement and deletion transformations. The theoretical considerations behind the selection of the variables are discussed in detail, as are the principles for syntactic sampling. The usefulness of the computer for certain low-level tasks is demonstrated, such as finding correlations between variables. The relevance of quantitative data to ‘qualitative’ syntactic description is stressed.