1. Introduction
In this article, I present the results of a corpus study of clausal postpositioning in German regional language. This phenomenon, exemplified in (1)–(3) and defined more precisely below, involves the occurrence of a sentential constituent in the postfield of the matrix clause to which it is syntactically linked.Footnote 1
In these examples, the three clausal forms under examination in this study—relative clauses (RCs), non-relative finite clauses (NRFCs), and infinitival constructions (ICs)—occur after the right boundary of the matrix clause’s inner field, which is framed by wird ‘will’ … finden ‘find’, habe ‘have’ … gefunden ‘found’, and haben ‘have’ … vergessen gehabt ‘forgotten had’. In this placement, they display clause-final embedding.Footnote 3
The empirical focus of the present study is spoken varieties, defined as “subsystem[s] of a language characterized by internal linguistic cohesion, clear system boundaries (separating [them] from other varieties), well-defined pragmatic functions and an emic status” (Lenz Reference Lenz, Auer and Erich Schmidt2010:295–296; see also Schmidt Reference Schmidt, Lenz and Mattheier2005 and Schmidt & Herrgen Reference Schmidt and Herrgen2011:49–53). The relationships between such varieties can be modeled in terms of their relative positions in dimensions along two axes, namely a horizontal-areal (diatopic) axis representing geographic space and a vertical-social (diastratic) axis extending from the lowest pole, the dialect, to the highest pole, the standard variety (see, for example, Bellmann Reference Bellmann and Bellmann1986, Auer & Hinskens Reference Auer, Hinskens, Auer, Hinskens and Mattheier1996, and Herrgen Reference Herrgen, Auer and Erich Schmidt2010 for discussion of bidimensionality). I limit my study to regional language, which I understand as the areally bound, regionally marked aggregate of spoken forms “beneath” the codified standard language on the vertical axis ranging from dialect to the “regional accent” or “regional standard,” situated closest to the standard pole (Lenz Reference Lenz, Kallmeyer and Zifonun2007:169, Reference Lenz and Haider Munske2008:2, Schmidt & Herrgen Reference Schmidt and Herrgen2011:63–67). In particular, I examine postpositioning in base dialects, the most geographically restricted varieties with the greatest linguistic divergence from the standard language (Lenz Reference Lenz, Auer and Erich Schmidt2010:296, Schmidt Reference Schmidt, Auer and Erich Schmidt2010:217), and regiolects, “standard-divergent varieties with broader regional distribution that can be located between dialects and standard varieties” (Lenz Reference Lenz, Auer and Erich Schmidt2010:302).Footnote 4
A survey of the literature (see Vinckel-Roisin Reference Vinckel-Roisin and Vinckel-Roisin2015 for an overview) reveals that most investigations of the postfield and postpositioning in modern German deal with the spoken and written standard (for example, Lambert Reference Lambert1976, Hoberg Reference Hoberg1981, Zahn Reference Zahn1991, Filpus Reference Filpus1994, Vinckel Reference Vinckel2006, Żebrowska Reference Żebrowska2007). A few works treat postpositioning in spoken regional German, specifically dialects and regiolects, yet each offers only partial insights: Brode (Reference Brode1971) investigates the whole German linguistic area, but only regiolects; Patocka’s (Reference Patocka1997) study is limited to Bavarian dialects spoken in Austria; and Westphal Fitch (Reference Westphal Fitch2011) considers both dialect and regiolect but juxtaposes the dialect of one linguistic region with the regiolect of another. In addition, many studies (for example, Brode Reference Brode1971, Zahn Reference Zahn1991, Westphal Fitch Reference Westphal Fitch2011) examine only non-clausal postpositioning since subordinate clauses display a strong affinity for the postfield of the matrix clause (Zifonun, Hoffmann & Strecker Reference Zifonun, Hoffmann and Strecker1997:1650–1654) and a high degree of grammaticalization, particularly of complement clauses, in that position. However, there is a lack of consensus in the literature about where various subordinate clause types may or must occur, in particular which types may or may not appear in the matrix clause’s inner field, the region of the clause bounded by the left and right brackets. Eisenberg (1994:415, Reference Eisenberg1999:391), for example, contends that attributive clauses, but not complement or adverbial clauses, may be embedded in the inner field. Similarly, Dürscheid (Reference Dürscheid2012:103) claims that conjunctional clauses, including complement and adverbial clauses, as well as ICs only occur in the pre- or postfield. Helbig & Buscha (Reference Helbig and Buscha2013:477) go a step further by characterizing the occurrence of subordinate clauses in the postfield of the matrix clause, including RCs, as “grammatikalisierte Ausrahmung” ‘grammaticalized exbraciation’. By contrast, Patocka (Reference Patocka1997:129) argues against Eisenberg’s (Reference Eisenberg1994:415) assertion that only attributive clauses may occur in the inner field. Duden (2016:1067–1069) also demonstrates that RCs and adverbial clauses can readily appear not only in the postfield, but in the inner field as well (see also Zifonun, Hoffmann & Strecker Reference Zifonun, Hoffmann and Strecker1997:2348, 2352). This holds for the present dataset, as the following examples demonstrate.Footnote 5
Given the disagreement in the literature regarding subordinate clause embedding, specifically the occurrence of clausal constituents in the inner field of the matrix clause (see Sahel Reference Sahel and Vinckel-Roisin2015:169–170 for further discussion), the distribution of these constituents merits further elucidation.
To gain new insight into clausal postpositioning in regional varieties, the present investigation examines 11,027 clauses in 60 spoken regiolect and dialect texts of the Pfeffer and Zwirner Corpora (Institut für Deutsche Sprache) from the North Low German, West Central German, and East Upper German linguistic areas. The following research questions are addressed:
-
1. Which forms and functions of clausal postpositioning are attested in the dataset?
-
2. How are postpositioned clauses distributed in dialects and regiolects according to form and function?
-
3. To what extent can clausal constituents that appear in the postfield also appear in the inner field?
-
4. What is the frequency of clausal postpositioning according to variety?
Besides addressing these questions, the current study establishes a methodology not only for this work, but also for further examinations of both clausal and non-clausal postpositioning.
The present investigation is part of a larger project on both non-clausal and clausal postpositioning in German regional language that complements studies of the phenomenon in standard German (for example, Lambert Reference Lambert1976, Filpus Reference Filpus1994) and recent works that examine syntactic phenomena on the vertical spectrum (for example, Lenz Reference Lenz2013, Kallenborn Reference Kallenborn2019). As this study draws on data that were collected in the mid-twentieth century, the results can also serve as a baseline to which examinations of postpositioning in contemporary German regional language can be compared. More generally, the study fits into the larger context of research on German regional language variation (for example, REDE - Regionalsprache.de, Schmidt & Herrgen Reference Schmidt and Herrgen2011, Kehrein, Lameli & Rabanus Reference Kehrein, Lameli and Rabanus2015), syntactic variation in particular (for example, SADS – Syntaktischer Atlas der deutschen Schweiz, SynAlm – Syntax des Alemannischen, SyHD – Syntax hessischer Dialekte), as well as projects on the syntax of other Germanic languages (for example, DynaSAND – Dynamische Syntactische Atlas van de Nederlandse Dialecten, FRED – Freiburg English Dialect Corpus, ScanDiaSyn – Scandanavian Dialect Syntax, YGDP – Yale Grammatical Diversity Project).Footnote 6
The remainder of the article is outlined as follows. In section 2, I present the methodology, which lays the groundwork for the current investigation and further studies of postpositioning in German regional language. This is followed by a typology of clausal postpositionings and an analysis of variation in section 3. In this context, I discuss the results of other studies that deal with postpositioning, primarily Patocka (Reference Patocka1997), and compare them to those of the current study. Section 4 concludes the article.
2. Methodology
This section introduces the methodology employed in the present study. After describing the corpora from which the data are drawn (2.1), I discuss the linguistic areas and recordings under investigation (2.2), the criteria for analysis (2.3), the procedure for data collection (2.4), and the presentation of results in section 3 (2.5).
2.1. Corpora
The primary repository of German dialect data is the Zwirner Corpus, which was compiled between 1955 and 1972 under the direction of Eberhard Zwirner and is currently maintained online by the Institut für Deutsche Sprache (IDS) in the Datenbank für Gesprochenes Deutsch (DGD).Footnote 7 The corpus comprises 5,796 recordings of speakers from approximately 1,000 locations in Austria (Vorarlberg), France (Alsace), Liechtenstein, The Netherlands, and former West Germany. The majority of recordings (58%) are 9 to 12 minutes in duration.
Where feasible, one autochthonous speaker from each of three generations (c. 20 years old, c. 40 years old, above 60 years old), as well as three repatriates from former German settlements in the east (for example, Silesia) were selected and interviewed in each location using professional recording equipment. To date, 2,922 of the recordings in the corpus have been transcribed using standard German orthography and aligned electronically with the transcriptions. The vast majority of these primarily narrative texts, elicited through interview prompts and questions, reflect speech situated near the bottom of the vertical-social spectrum, while approximately 6% have been classified as “colloquial” and a handful as standard.
For regiolect data, I draw on material from the Pfeffer Corpus following, for example, Lenz (Reference Lenz2013). This corpus, which is also accessible through the DGD, was compiled in 1961 by Alan Pfeffer and Walter F. W. Lohnes (see Wagener & Bausch Reference Wagener and Bausch1997:120–121 for further information). It includes 398 recordings, with the majority (55%) lasting between 11 and 13 minutes, collected from 57 cities in Austria, former East and West Germany, and Switzerland. As with the Zwirner Corpus, the consultants were interviewed and professionally recorded. All of the spoken texts have been transcribed and aligned with standardized transcriptions.
Although the project leaders’ goal was to capture the urban “colloquial” speech of consultants from different age groups, levels of education, and professions, the texts in the corpus represent a wider range of varieties, from dialect to the standard (see Spiekermann Reference Spiekermann, Eichinger and Kallmeyer2005:107). Nonetheless, Lenz (Reference Lenz, Kallmeyer and Zifonun2007, Reference Lenz2013) demonstrates that the Pfeffer Corpus recordings generally represent varieties much closer to the standard language on the vertical-social spectrum compared to those from the Zwirner Corpus.
2.2. Linguistic Areas and Recordings
To provide a cross-regional perspective, I analyze data from the three non-contiguous linguistic areas shown in figure 1, namely North Low German (NLG), West Central German (WCG), and East Upper German (EUG), which were chosen as representatives of the Low German, Central German, and Upper German regions, respectively. Low German dialects distinguish themselves from Central and Upper German dialects through the absence of the High German consonant shift, while Upper and Central German dialects differ most prominently in the shift of p- and -pp- to pf- and -pf- in the former but not the latter (see Niebaum & Macha Reference Niebaum and Macha2014:250–253).
With respect to the analyses in section 3, the patterns of frequency for the individual linguistic areas mirror those for the three areas combined. Therefore, the areas are treated in aggregate in the present study.
Sixty recordings from the Zwirner and Pfeffer Corpora, 10 per linguistic area/variety combination, serve as the empirical basis for the study.Footnote 8 These were randomly selected from the pool of recordings delimited according to the criteria discussed below, namely variety, age, and gender. The recording IDs, with the corresponding information on linguistic area, speaker age, and speaker gender, are presented in table A1 of the appendix.
As noted in section 2.1, the Zwirner and Pfeffer Corpora were both compiled using professional recording equipment in interview settings. Interviewers elicited speech from each consultant through a question/prompt (or series of questions/prompts, depending on the length of the consultant’s responses) until the target recording length had been reached.Footnote 9 With one exception, the recordings selected for the current study were made in localities in former West Germany.Footnote 10 All Zwirner consultants are autochthonous, while the Pfeffer consultants, in nearly all cases, originate from the city or near the city whose “colloquial speech” the researchers intended to capture.Footnote 11
The recordings I selected to represent dialect are classified in the database catalogue (Haas & Wagener Reference Haas and Wagener1992) exclusively as Vollmundart ‘base dialect’ according to the designations set forth by the Deutsches Spracharchiv (see Pfeffer & Lohnes Reference Pfeffer and Lohnes1984, vol. 1:53). As for regiolect, recall from section 2.1 that the recordings from the Pfeffer Corpus represent a relatively wide range of varieties. I narrowed the pool of potential recordings to only those that are designated in the database catalogue (Pfeffer & Lohnes Reference Pfeffer and Lohnes1984, vol. 1) as Umgangssprache ‘colloquial speech’ with the subcategorizations landschaftliche Bildungssprache ‘regional educated speech’, allgemeine Umgangssprache ‘general colloquial speech’, and landschaftlich gefärbte Umgangssprache ‘regionally colored colloquial speech’, excluding all those coded as Hochsprache ‘standard language’ or Mundart ‘dialect’. Of those recordings selected for the study, 20% (6/30) are classified as landschaftliche Bildungssprache or landschaftlich gefärbte Umgangssprache (in one case both), while 80% (24/30) are assigned to the category allgemeine Umgangssprache alone or together with one of the other ‘colloquial’ designations. Although a complete homogenization of the Pfeffer data with respect to variety was not feasible because of restrictions on age and gender (see below), this narrowing maximizes the comparability of data across linguistic areas and varieties.Footnote 12
To control for the possible influence of age on patterns of postpositioning, I aimed to limit the inclusion of recordings to those from what Spiekermann (Reference Spiekermann2008:92–93) terms the “old generation” of speakers above 50 years old. This goal proved particularly difficult in the selection of recordings from the Pfeffer Corpus since relatively few recordings also meet the above criteria for geographic area and variety. As a result, it was necessary to include nine Pfeffer recordings from speakers in their 40s, and in one case, late 30s. However, the patterns of frequency with respect to the analyses in section 3 remain the same both with and without the data from the younger regiolect speakers. Thus, I contend that the inclusion of these data is unproblematic for the present study. Table 1 shows the average age per linguistic area/variety.
Still more difficult to achieve was an even distribution of male and female speakers by linguistic area and variety. It was not possible to find suitable recordings of five males and five females for each linguistic area/variety combination, but the distribution of speakers by gender is the same for both varieties (19 males and 11 females), and for each linguistic area, I aimed to keep the ratio of male to female speakers in regiolect and dialect as close to identical as possible. As with the factor “linguistic area,” the patterns of frequency with respect to the analyses in section 3 for male and female speakers alone reflect those for speakers of both genders combined. Therefore, the genders are treated in aggregate in the present study.
2.3. Criteria for Analysis
As has been discussed in the literature (for example, Zahn Reference Zahn1991:68, Patocka Reference Patocka1997:318, 329, Ágel Reference Ágel, Werner Besch, Reichmann and Sonderegger2000:1873–1874), studies on postpositioning vary widely in how the unit of measurement is defined and what counts as a case of postpositioning, which greatly complicates the comparison of results. In the present work, the analysis of the data according to predetermined criteria allows for the direct juxtaposition of findings by variety. I lay out these criteria in the following paragraphs.
The theoretical framework for the present study is the Topological Field Model, an early formulation of which dates back to Drach (Reference Drach1937). A revised version of this model, presented in figure 2, is commonly employed in modern treatments of the German clause (for example, Patocka Reference Patocka1997; see Wöllstein Reference Wöllstein2014 for a detailed introduction to the model; see also Dalmas & Vinckel Reference Dalmas, Vinckel, Dębski and Fries2006 and Vinckel Reference Vinckel2006:57–62 for an alternative view).
The clause may be divided into three fields (prefield, inner field, postfield), which are separated by the left and right brackets. The left bracket houses finite verbs and conjunctions, while the right bracket is overtly filled by (non-)finite verbs and separable elements such as prefixes. In main clauses, the right bracket may also be left empty, represented by a null element. The prefield is typically occupied in main clauses by subjects, adverbial phrases, and embedded clauses and is the position for relative and interrogative words/phrases in subordinate clauses.Footnote 13 When clausal material ends at the right bracket, the remaining constituents occupy the inner field. The primary focus of the present study is material that appears in the postfield,Footnote 14 which I understand here as the slot after the right bracket where postpositioned constituents that are syntactically linked to the matrix clause are located.Footnote 15 By way of illustration, (6) and (7) demonstrate how a verb-second (V2) and verb-final (VF) matrix clause are parsed within the Topological Field Model:
Most important for the current investigation are the postpositioned clauses die er bereits kennt ‘whom he already knows’ and wo die Hühner tüchtig legen ‘when the hens are busy laying eggs’; both constituents occur in the postfield of the matrix clause to which they are syntactically linked, namely er wird dabei andere Jugendliche finden ‘in doing so, he will find other young people’ and wenn die Zeit ist ‘when it’s the time’, respectively.
To determine unequivocally whether a constituent is postpositioned, it is necessary to identify the right boundary of the matrix clause’s inner field. This is clearly marked by an overtly filled right bracket, which forms a bracket construction in conjunction with the element in the left bracket (or prefield in the case of relative and interrogative clauses). This boundary can also be signaled by a constituent in the inner field itself. However, as Patocka (Reference Patocka1997:235–243) discusses, there is no clear consensus regarding which such elements may serve this purpose.
After consulting five works that deal in part or exclusively with postpositioning in modern standard German or German regional language (Brode Reference Brode1971, Lambert Reference Lambert1976, Zahn Reference Zahn1991, Filpus Reference Filpus1994, Patocka Reference Patocka1997), I selected seven V2, three non-infinitival VF, and two infinitival VF bracket types for inclusion in the catalog of bracket constructions (henceforth BCs), the closing element of which denotes the end of the inner field.Footnote 16
I begin with the V2 constructions, which are illustrated in (8)–(16) below.Footnote 17 The elements that open and close the BC are underlined.Footnote 18
In the first five BC types, the construction is opened and closed by an element in the left and right bracket, respectively. Light verb constructions, by contrast, are closed not by the right bracket, but by a non-verbal constituent at the right periphery of the inner field.Footnote 22 Finally, the adverbial BC, formed by a discontinuous pronominal adverb in a stranding or doubling construction, differs from all the other types in that neither the left nor the right bracket is part of the BC; instead, the split components of the pronominal adverb that form the BC occupy the prefield and inner field.Footnote 23 Although this bracket type is not included in the inventory of BCs in any of the works on postpositioning that I surveyed, it is considered a BC elsewhere in the literature (for example, Ronneberger-Sibold Reference Ronneberger-Sibold, Norbert Boretzky, Jessing and Stolz1991:216–218, Otte-Ford Reference Otte-Ford2016:291) and functions analogously to the other constructions by separating the inner field from the postfield.Footnote 24 Thus, I include it in the present study.
In contrast to the V2 BCs, non-infinitival VF BCs display a right bracket filled exclusively by a finite verb form (including a fused separable prefix, as the case may be). The constructions are opened by an element in the left bracket (conjunction) or prefield (relative or interrogative word/phrase):
Finally, in infinitival VF BCs, the infinitive with zu forms the right bracket, analogously to the finite verb in non-infinitival VF BCs.Footnote 25 The left bracket may be filled by a null element or a conjunction:
In all of the BCs just introduced, some overt constituent serves to close the construction, thereby clearly marking the right boundary of the inner field and facilitating the identification of postpositioned constituents. With regard to V2 clauses in which the right bracket is not overtly filled, Patocka (Reference Patocka1997:52) questions whether any constituent that occurs after the finite verb (after the left bracket) would necessarily be located in the inner field. He argues, following Reis (Reference Reis1980), that postpositioning is, indeed, possible and can be clearly identified even when the BC is closed by a null element (pp. 52–54). In the following, I summarize Patocka’s argumentation, beginning with the following examples:
In (22a,b), the adverbial phrases wegen des Tadels ‘because of the reprimand’ and sorgfältig ‘carefully’ appear unambiguously in the inner field, as indicated by the presence of the past participle geschrieben ‘written’ in the right bracket. At first blush, the grammaticality difference between these sentences would appear to follow from a violation of the rule “causal before modal,” that is, the requirement that a causal adverbial phrase (here: wegen des Tadels) appear before the modal one (here: sorgfältig). However, (23a,b) seem to be counterevidence to this account because both sentences are grammatical, regardless of the ordering of the adverbial phrases.
To explain the different patterns of grammaticality in (22a,b) vs. (23a,b), Patocka contends that wegen des Tadels in (23b) must be postpositioned, appearing after the right bracket, which is filled by a null element. This structure directly parallels corresponding sentences in the present perfect in which the right bracket is overtly realized:
While this argumentation is sound given the assumption that the order of adverbials is fixed, it is well known that German word order is sensitive to information structure and prosody and that constituents in the inner field may undergo scrambling and/or appear in multiple positions (for example, Frey & Pittner Reference Frey and Pittner1998). Therefore, it is possible that the grammaticality of (23b) is attributable not to the postpositioning of wegen des Tadels, but to the variable placement of the adverbial phrases—both of which remain in the inner field—for information structural reasons. Similarly, for all four native German speakers whom I consulted, (22b) is grammatical given the proper information structural context and prosodic pattern.Footnote 26
Patocka (Reference Patocka1997:54) makes an additional argument for postpositioning in the absence of an overtly filled right bracket, based on clausal data such as the following:
In (26a), the phrase was los war ‘what was wrong’ is clearly postpositioned, as indicated by the presence of an overtly filled right bracket, gewußt ‘known’, in the matrix clause. Patocka contends that it is implausible to assume that the phrase was los war in (26b) is located in the inner field of the matrix clause as it may at first appear, but rather that it is also postpositioned as in (26a), in this case after a right bracket filled by a null element.
However, recall from section 1 that one clause may be embedded in the inner field of another, as the following example from the present dataset shows:
Example (27) corresponds to a theoretical (unattested) sentence like (28) in which the verb übernehmen ‘to take over’ is in the preterite. In this case, the right bracket of the matrix clause is not overtly filled.
If the argument is made that the dependent clause in (26b) should be considered postpositioned in alignment with (26a), then a parallel argument can be made taking the center-embedded clause as a starting point: Since in (27) the dependent clause appears unambiguously in the inner field of the matrix clause, then for consistency’s sake, it could be considered to be located in the inner field in (28) as well. In other words, the assignment of the dependent clause to the inner field or postfield of the matrix clause in cases such as (26b) or (28) depends on the construction to which it is compared.
I agree with Reis (Reference Reis1980) and Patocka (Reference Patocka1997:52–54) that the right bracket need not be overtly filled for a constituent to occupy the postfield and, furthermore, that the adverbial phrase wegen des Tadels in (23b) and the clause was los war in (26b) may well be located in the postfield after a right bracket filled by a null element. However, as I have demonstrated, this is not necessarily the case. To avoid the problem of ambiguity with respect to the position of constituents in clauses without an overtly realized right bracket, I therefore limit my analyses, unless otherwise noted, to structures in which the BC is overtly closed, following, for example, Engel (Reference Engel1970).
With the catalog of BCs in place, it is necessary to establish what constitutes a unit of measurement in the current work and further related studies of postpositioning (see Lambert Reference Lambert1976:51–52, Zahn Reference Zahn1991:31–60, and Patocka Reference Patocka1997:33–38 for discussion). Defining this unit is not entirely straightforward because of the possibility of clausal embedding in the prefield, inner field, or postfield. This can lead to ambiguity regarding the number of units at hand.Footnote 27 Consider the following examples:
In (29), the subordinate clause weil Emden nicht genug bieten könnte ‘because Emden couldn’t offer enough’ appears after the right bracket of the matrix clause to which it is syntactically linked and is therefore clause-finally embedded. On the other hand, the subordinate clause nachdem wir alle größer waren ‘after we were all bigger’ in (30) is center-embedded within its associated matrix clause. It is possible to count both cases as one unit of measurement corresponding to the matrix clause only, the first with postpositioning and the second without. However, a constituent may be postpositioned in a subordinate clause that itself occurs in the prefield, inner field, or postfield of its matrix clause, as in (31)–(34):
The postpositioning of a constituent in the subordinate clauses—non-clausal in (31)–(33), clausal in (34)—indicates that these clauses should be counted as units of measurement in their own right. Therefore, I count each instance of a clause as one unit regardless of its relationship to other clauses and whether or not the right bracket is overtly filled. Accordingly, (29)–(33) each contain two units of measurement, while (34) contains three.
So far, I have referred to constituents located in the postfield as “postpositionings.” In the remainder of this subsection, I aim to give a more precise definition of this concept as utilized in the present article.
Various terms appear in the literature to describe the types of material that occur after the right boundary of the inner field according to their grammatical and functional status as well as the degree of prosodic integration into the preceding structure. As Vinckel (Reference Vinckel2006:26–30, 53–55) points out, however, there is a lack of terminological consistency; different terms are used to denote one particular type, while at the same time, a single term may be used as a label for different types, according to criteria that vary by author. Therefore, it is imperative to clarify precisely which constructions are the object of study. To begin, I describe and illustrate the two types of constituents that are the foci of my investigations of non-clausal and clausal postpositioning.
With respect to non-clausal material, I examine constituents, for example PPs (see Dubenion-Smith Reference Dubenion-Smith and Xing2020), that by default reside in the inner field or, rarely, the prefield but instead occur in the postfield where they are integrated into the prosodic structure of the preceding clausal material (see Auer Reference Auer1991, Schwitalla Reference Schwitalla2012:114–117, Kleemann-Krämer, Kügler & Pötzl Reference Kleemann-Krämer, Kügler, Pötzl and Vinckel-Roisin2015, and Duden 2016:1227–1229 for discussion of the prosodic integration of non-clausal postfield constituents), as the following examples illustrate:
In (35), the default placement of the PP in den letzten Jahren ‘in recent years’ is the inner field, while in (36), the PP von acht Kindern ‘of eight children’ is expected to appear in the prefield as an attribute to the DP die Mutter ‘the mother’.
As for clausal constituents, I include in the analyses of postpositioning in the present article exclusively VF clauses without a preceding pronominal or adverbial correlate that appear in the postfield of a matrix clause. In the case of RCs and clausal nominal attributes, the antecedent or modified noun usually occurs in the inner field, such as Jugendliche ‘youths’ in (37):
Rarely, the RC antecedent or the noun modified by the clausal nominal attribute, for example, Hemmungen ‘inhibitions’ in (38) and Gefahr ‘danger’ in (39), is located in the prefield:
Following Patocka (Reference Patocka1997:105, 116, 118–120, 161, 193–194), I do not consider here as cases of clausal postpositioning those clauses after the right bracket with a correlate in the prefield or inner field of the matrix clause, such as a demonstrative pronoun:
Such clauses function analogously to instances of right dislocated non-clausal constituents (see Altmann Reference Altmann1981:54–55 and Patocka Reference Patocka1997:368–377), as illustrated in (41).
Finally, I also exclude V2 complement clauses, V2 clauses with a paratactic conjunction (for example, weil ‘because’ and obwohl ‘although’), and V2 RCs, which differ from embedded VF clauses in the type and degree of syntactic linkage to the matrix clause (see, for example, Gärtner Reference Gärtner2001, Reference Gärtner2002, Catasso Reference Catasso2014, Freywald Reference Freywald, Finkbeiner and Meibauer2016, Reference Freywald2018). A V2 RC is illustrated in (42).Footnote 28
The types of postpositioned non-clausal and clausal constituents that I have chosen to examine in the current and related studies correspond most closely to the terms Ausklammerung ‘exbraciation’ and Extraposition ‘extraposition’ as described in overviews of terminology in the literature, such as Altmann (Reference Altmann1981:54–72), Auer (Reference Auer1991), Zifonun, Hoffmann & Strecker (Reference Zifonun, Hoffmann and Strecker1997:1644–1675), Bußmann & Gerstner-Link (Reference Bußmann and Gerstner-Link2002:106), Vinckel (Reference Vinckel2006:13–55), and Duden (2016:1227–1229).Footnote 29 For three reasons, however, I have decided to adopt the term “postpositioning,” akin to the German designation Nachfeldbesetzung employed in some studies (for example, Patocka Reference Patocka1997). First, the term “exbraciation” implies the movement of a non-clausal constituent from the inner field, where it is generated, outside of the bracket into the postfield.Footnote 30 Yet, as (36) above demonstrates, a constituent that appears in the postfield may be syntactically linked to the prefield as well. Only one such case occurs in the present dataset, indicating that the postpositioning of non-clausal prefield constituents is very rare. Nonetheless, the phenomenon is possible and not accurately captured by the label “exbraciation.” Second, the designation “postpositioning” is neutral with respect to the syntactic derivation of these structures in the postfield, which has been debated in the literature (for instance, Büring & Hartmann Reference Büring, Hartmann, Lutz and Pafel1995, Inaba Reference Inaba2007, Haider Reference Haider2010, and Frey Reference Frey and Vinckel-Roisin2015). Specifically, it is debated whether they occur in the postfield of the matrix clause as the result of movement or are base-generated there. For present purposes, what is important is the constituents’ appearance in the postfield. Third, this single label can be used to refer to both non-clausal and clausal constituents, highlighting the similarity between both types and unifying them.
The unifying characteristic of both non-clausal and clausal postpositionings, as defined in the current study, is their syntactic linkage to a matrix clause. In the case of non-clausal postpositioning, the linkage arises through a constituent’s potential occurrence in the prefield or inner field of the matrix clause where it fulfills a syntactic function. Such elements include adjuncts, attributive phrases, comparatives, complements, and modal modifiers (see Dubenion-Smith Reference Dubenion-Smith and Xing2020). With clausal postpositionings, the syntactic linkage arises through their dependence on and structural relationship to the matrix clause. Like non-clausal postpositionings, they may fulfill a syntactic function, for example, as an object complement (see Dürscheid Reference Dürscheid2012:59–61 and Duden 2016:1030–1031, 1038–1040). Consecutive clauses and continuative w-word RCs (see (60) and (76) below) do not fulfill a syntactic function in the matrix clause, but nonetheless refer back to the matrix structure. In the present work, adjunct (adverbial) clauses, complement clauses, clausal nominal attributes, comparative clauses, continuative clauses, and RCs are attested (see section 3.2 below).
2.4. Procedure
With all preliminaries in place, I turn now to a description of the procedure for data collection. After selecting the recordings according to the criteria laid out in section 2.2, I listened to each in its entirety to verify the accuracy of the standardized transcription.Footnote 31 I then parsed the transcriptions, excluded all material produced by the interviewers, classified all clauses according to their clausal status (V1/V2 or VF), and identified cases of V1/V2 clauses without an overtly filled right bracket.Footnote 32 I also catalogued all clauses with an empty inner field or only a pronoun or negation, including modified negators such as gar nicht ‘not at all’, which may not be postpositioned, as in (43)–(45).
These include cases in which a PP nominal attribute or RC occurs in the prefield or a clause is postpositioned:
Since the spoken texts were elicited through interview questions and prompts, it is possible that the word order in the first clause uttered by a consultant in response to a question or prompt may have been affected by syntactic priming. That is, the consultant may have reproduced the syntactic structure of the interviewer’s utterance. This is exemplified in (49).
I therefore identified potentially syntactically primed structures in the dataset and excluded them from further analysis.
Finally, I created databases to organize all cases of non-clausal and clausal postpositioning according to their form and function and whether they are postpositioned in a V1/V2 or VF clause, as well as all instances of a clause embedded in the inner field of a matrix clause.
2.5. Presentation of Results
The analyses presented in section 3 are based on the 11,027 clauses in the current dataset, which are drawn from 60 spoken regiolect and dialect texts containing a varying number of clauses.Footnote 33 Although the large size of the current dataset is advantageous, the extraction of multiple units of measurement from 60 different sized units of sampling poses two challenges. First, distributional patterns present in individual units of sampling can be under- or overrepresented in the dataset as a whole, depending on the relative size of those units. Second, the units of measurement, that is, clauses in a given unit of sampling, are not independent of one another since they are produced by the same speaker. However, the assumption of independence of data points underlies many statistical tests that might be applied to the data.
To address these challenges, Baroni & Evert (Reference Baroni, Evert, Lüdeling and Kytö2008:798) propose that the data points for analysis not be the individual units of measurement (in this case, the 11,027 clauses), but rather the units of sampling themselves (in this case, 60 texts), with each data point expressed as a proportion of the tokens that display the linguistic phenomenon in question to the total number of units of measurement in each sample. For the results of the analyses presented in section 3, I therefore do not present pooled percentages, which would be calculated by summing the number of units of measurement that display a particular linguistic phenomenon, across all texts, and dividing the sum by the total number of units of measurement in these texts. Rather, I present equally weighted mean relative frequencies, which I calculate by first determining for each text the percentage of the total number of units of measurement that display a particular linguistic phenomenon (the relative frequency), then averaging the percentages across all texts in a given group, for example, all regiolect texts or all dialect texts. I provide the standard error (SE) for each mean value as a measure of the precision of the sample mean.
Since each unit of sampling (that is, each spoken text) represents one independent data point, the equally weighted mean relative frequencies in different groups of texts can also be compared to determine if there is a statistically significant difference between the means. Following the guidance of Baroni & Evert (Reference Baroni, Evert, Lüdeling and Kytö2008:798), the primary statistical test employed in the current study is Student’s t-test for independent samples.Footnote 34 To ensure the validity of the results, this test requires that three conditions for the data be met, namely the absence of outliers, the normal distribution of data points, and the homogeneity of variances.Footnote 35 Unless otherwise stated, these conditions are met in the analyses to follow. I also utilize a related test, the paired t-test, to compare the equally weighted mean relative frequencies in the same group of texts under two conditions. In cases in which the requirement of the normal distribution of data points is not met, I perform the non-parametric counterpart to Student’s t-test, the Mann–Whitney U test. This test does not allow for the direct comparison of means but determines via mean ranks whether the values in one group (that is, the relative frequencies for the individual texts) are statistically significantly higher or lower than the values in the other group (Laerd 2022).
Finally, to control for false-positive results in the face of multiple pairwise comparisons, I apply the Bonferroni–Holm correction (Holm Reference Holm1979) and present adjusted p-values. Results are considered statistically significant when p < .05.
3. Analyses of Clausal Postpositioning
This section begins with an overview of the distribution of clauses in the dataset (3.1), followed by a typology of clausal postpositionings and analyses of their distribution by form and function in regiolect and dialect (3.2). I then examine the frequency of center-embedding of NRFCs and RCs (3.3). The section concludes with an analysis of the frequency of clause-final embedding and a discussion of restrictions on syntactic subordination (3.4).
3.1. Distribution of Clauses
As table 2 shows, the set of data derived from 60 spoken regiolect and dialect texts comprises 11,027 clauses, 6,868 (62.28%) of which have an overtly filled right bracket. The distribution of the 6,868 clauses with an overtly filled right bracket, according to the presence or absence of constituents in the inner field and postfield, is displayed in table 3.
Finally, table 4 shows the distribution of the 5,923 clauses with (a) constituent(s) other than ø, a pronoun, or negator in the inner field and/or (a) postfield constituent(s) according to clause type (V1/V2 and VF).
For the purpose of comparability to examinations of non-clausal postpositioning (Dubenion-Smith Reference Dubenion-Smith and Xing2020), I determine the equally weighted mean relative frequencies of clausal postpositioning in section 3.4 on the basis of clauses with an overtly filled right bracket that have any constituent besides ø, a pronoun, or negator in the inner field and/or any postfield constituent (5,923), whether non-clausal or clausal. Notably, I do not limit my calculations to only those cases in which a clausal constituent appears in the inner field and/or postfield.
3.2. Typology and Distribution of Clausal Postpositionings
Referring to the classification systems in Engel (Reference Engel2009a, Reference Engel2009b), Patocka (Reference Patocka1997), who bases his categorization primarily on earlier editions of Engel’s works, Zifonun, Hoffmann & Strecker (Reference Zifonun, Hoffmann and Strecker1997:1026–1118, 1448–1473, 2275–2332), and Meibauer, Steinbach & Altmann (Reference Meibauer, Steinbach and Altmann2013), I identified the form and function of the 612 cases of clausal postpositioning in the dataset (351 regiolect, 261 dialect) to address research questions 1 and 2.Footnote 36 These include 610 instances with syntactic linkage to the inner field and two instances of linkage to the prefield, presented in (38) and (39) above.Footnote 37
In the examples below, I present each function in only its most frequent form in the dataset. These are followed by table 5, which shows the equally weighted mean relative frequency for each form in regiolect and dialect.Footnote 38 The relative frequencies in each text are calculated by dividing the number of postpositionings of each type by the total number of postpositionings.Footnote 39
The group “multiple clauses” comprises all cases of postpositioning that involve more than one clause, regardless of whether they are of the same type (for example, two NRFCs as in (64) or one NRFC and one RC as in (65)). Setting these aside, table 5 shows that both regiolect and dialect display the same cline in equally weighted mean relative frequencies with respect to the three forms: NRFC > RC > IC. However, the distribution of these forms differs by variety, with NRFCs occurring at a lower equally weighted mean relative frequency and comprising a smaller proportion of the postpositionings in regiolect than in dialect, and RCs and ICs occurring at higher equally weighted mean relative frequencies and comprising larger proportions of the postpositionings. Mann–Whitney U tests indicate that for NRFCs, the relative frequencies in the individual dialect texts are statistically significantly higher than in the regiolect texts (mean rank dialect 37.00, regiolect 23.23; U = 232, z = 3.09, p = .002, adjusted p = .016), while the opposite holds for RCs (mean rank dialect 24.52, regiolect 35.30; U = 594, z = −2.42, p = .015, adjusted p = .045) and ICs (mean rank dialect 23.38, regiolect 36.40; U = 627, z = −3.05, p = .002, adjusted p = .016).
With respect to the NRFCs, this difference is likely attributable in part to the fact that such clauses are center-embedded at a higher equally weighted mean relative frequency (and consequently postpositioned at a lower frequency) in regiolect than in dialect and therefore comprise a relatively smaller proportion of total postpositionings in the former. I examine center-embedded clauses in detail in section 3.3 below. The difference also follows from a less frequent occurrence of ICs in the postfield in dialect than in regiolect as a result of the tendency for dialect speakers to produce functionally equivalent clauses in the form of NRFCs, as Patocka (Reference Patocka1997:162) points out in the context of adjunct ICs with final semantics. Patocka (Reference Patocka1997:160) notes that incoherent ICs, that is, ICs that form their own predicate in the postfield, are only roughly 1/50 as frequent as NRFCs in his corpus of Bavarian dialects spoken in Austria. In the current dialect data, the ratio of NRFCs to ICs is roughly 12:1 (74.23% vs. 6.08%), though in EUG dialect alone, the ratio is similar to Patocka’s finding: NRFCs and ICs comprise 75.82% and 1.25% of postpositionings, respectively, yielding a ratio of approximately 60:1.
The smaller proportion of postpositioned RCs in dialect than in regiolect likely follows from the greater use of V2 RCs (see (42) above) as a relativization strategy by dialect speakers than by regiolect speakers.Footnote 43 These RCs occur in the current dataset nearly 2.5 times more frequently in dialect (n = 73) than in regiolect (n = 30).Footnote 44 As discussed in section 2.3 above, such RCs are excluded from the counts of clausal postpositioning; the VF RCs alone make up a relatively smaller proportion of postpositionings in dialect.
The greater tendency among dialect speakers to use V2 RCs instead of their embedded VF counterparts is also reflected in the overall distribution of the 11,027 clauses in the dataset according to clause type (V1/V2 vs. VF), as seen in table 6. Note that these include V1/V2 clauses without an overtly filled right bracket. The equally weighted mean relative frequency of VF clauses is 12.56 percentage points (ppts) lower in dialect than in regiolect; accordingly, the degree of VF embedding is also lower. This difference is statistically significant as assessed by Student’s t-test, t(58) = 5.38, p < .0005, adjusted p < .0005, [95% CI, 7.89 to 17.25].Footnote 45
I now turn to the distribution of clausal postpositionings by function. The relative frequencies in each text are calculated by dividing the number of postpositionings of each type by the total number of postpositionings. The attested functions and their equally weighted mean relative frequencies are presented in table 7.
The group “multiple functions” comprises only those cases of postpositioning that consist of multiple clauses, each with a different function. I assign cases of multiple adjunct and complement clauses to their respective categories. Furthermore, I group non-continuative RCs separately and classify clausal nominal attributes, such as (62) above, as attributive, in contrast to Patocka (Reference Patocka1997:137), who considers only RCs to be attributive and nominal attributive clauses to be “Übergangsformen zwischen Ergänzungs- und Attributsätzen” ‘transitional forms between complement and attributive clauses’. Finally, I include in the category “continuative clauses” continuative RCs and adverbial clauses that are traditionally classified as consecutive, such as (60) above, since these occur obligatorily in the postfield.
Table 7 indicates that in both regiolect and dialect, adjunct clauses occur at the highest equally weighted mean relative frequency and comprise the largest proportion of postpositionings, followed by complement clauses. This pattern also holds for Dubenion-Smith’s (2020:244) analysis of non-clausal postpositioning in dialect. In regiolect, the equally weighted mean relative frequency for complement clauses is only slightly higher than that for RCs. However, if the attributive clauses are categorized as complements, the latter group comprises 28.09% and 26.01% of clauses in regiolect and dialect, respectively, 8.39 and 9.93 ppts higher than the next largest group in these varieties, respectively.
Patocka (Reference Patocka1997:117) includes in his counts of clausal constituents after the right bracket cases with a correlate in the matrix clause, which he considers right dislocations; nonetheless, a comparison of the more detailed breakdown of clausal postpositionings presented in table A2 of the appendix of the present study to Patocka’s results reveals several similarities. First, in both the current dialect data and Patocka’s data for Bavarian dialects spoken in Austria, accusative complement clauses occur as the most frequent complement type by far. Second, causal, final, and temporal adjuncts are the three most common adjunct clause types in both corpora. Third, locative adjunct clauses are not attested in either dataset. This is consistent with Pittner’s (2013:502) assessment of such clauses as marginal structures (Randerscheinungen).
Returning now to the equally weighted mean relative frequencies in table 7, for all functions except for RCs, the frequencies in regiolect vs. dialect are close, within between .68 and 4.52 ppts of each other. The greater difference for RCs here—7.02 ppts—mirrors the difference for RCs in the context of clausal forms, namely 6.78 ppts.
3.3. Center-embedding
In the previous subsection, I proposed that the lower equally weighted mean relative frequency of postpositioned NRFCs in regiolect than in dialect is at least partly attributable to a higher degree of center-embedding of these clauses in regiolect. In this subsection, I take a closer look at the embedding of clauses in the inner field, addressing research question 3.
As I discuss in section 1, there is disagreement in the literature regarding the placement of clausal constituents in the inner field, for example, whether adverbial clauses are barred from occurring in this position (Dürscheid Reference Dürscheid2012:103) or may readily appear there (Duden 2016:1067–1069). Eisenberg (Reference Eisenberg1994:415, Reference Eisenberg1999:391) notes that adverbial and complement clauses may appear in the inner field if they are considered parenthetical, that is, if a small pause occurs before and after the clause that sets it apart from the surrounding material of the matrix clause. However, Patocka (Reference Patocka1997:129) argues that such pauses do not run counter to the embedding of clauses in the matrix structure. He considers constructions as parenthetical only if they cannot be interpreted as embedded, meaning that they are not syntactically linked to the matrix clause, a stance that I adopt in the current work. The goal in what follows is to determine, for the current dataset, what types of clausal constituents occur in the inner field and to what extent. Before moving into the analysis, I shall note that I aim to contrast the placement of clausal constituents in the inner field with placement in the postfield, the primary focus of the current study. As such, I do not consider in this subsection the placement of clauses in the prefield.
To measure the degree of center-embedding in the dataset, it is necessary in an initial step to take stock of where embedded clauses occur, whether only in the postfield, or in both the postfield and inner field.Footnote 46 First, all clauses introduced by the complementizer dass ‘that’, regardless of their function (for example, final adjunct or object complement as in (54) and (58) above), as well as comparative and complement clauses, are attested only in the postfield. Second, some matrix verbs such as glauben ‘to believe’, versuchen ‘to try’, and wagen ‘to dare’, allow for both coherent and incoherent ICs, that is, constructions that are integrated into the predicate of the matrix verb or that form their own predicate (Durrell Reference Durrell2017:283, Rapp & Wöllstein Reference Rapp, Wöllstein, Meibauer, Steinbach and Altmann2013). Those ICs that could occur as either coherent or incoherent constructions are attested only as the latter, in the postfield:
Center-embedded ICs appear only with matrix verbs that require the coherent construction, such as brauchen ‘need’ and haben ‘have’ in the sense of müssen ‘must’, which Durrell (Reference Durrell2017:286–289) terms “semi-auxiliaries”:
Third, a subset of NRFCs, namely adjunct clauses with causal, conditional, and temporal semantics as well as attributive clauses, are attested not only in the postfield (see examples in section 3.2 above), but also in the inner field, as illustrated in (69)–(72).
Finally, non-continuative RCs occur both in the postfield (see (63) above) and the inner field:
Now that I have established in which fields clausal constituents are attested in the dataset, I may examine the frequency of center-embedding. First, I calculate equally weighted mean relative frequencies taking all NRFCs into account, those that occur only in the postfield and those that occur both in the inner field and postfield. I determine the relative frequency in each text by dividing the number of clauses with a center-embedded NRFC (across all texts: 16 regiolect, 4 dialect) by the total number of clauses with an NRFC in the inner field or postfield. I exclude cases in which a clause in the inner field can be interpreted as parenthetical, such as (74).Footnote 47
This yields low equally weighted mean relative frequencies of center-embedding, as shown in table 8.
In a second step, I consider only those types of NRFCs that display both center-embedding and embedding in the postfield and exclude from analysis all texts (3 regiolect, 6 dialect) in which such NRFCs are attested in neither the inner field nor the postfield. When only those constructions for which center-embedding is clearly optional are included in the analysis, the equally weighted mean relative frequencies, based on the relative frequencies in the individual texts calculated as above, are higher in both regiolect and dialect, with a combined frequency of 13.42% (see table 9). Mann–Whitney U tests indicate that for these NRFCs, the relative frequencies in the individual regiolect texts are statistically significantly higher than in the dialect texts (mean rank regiolect 29.69, dialect 21.85; U = 423.5, z = 2.24, p = .025, adjusted p = .045).
The relatively low frequency of embedding and low frequency of occurrence in dialect (four cases) is consonant with Patocka’s (1997:129–131) findings for Bavarian dialects spoken in Austria; in his dataset, four instances are attested, one temporal and conditional adjunct clause each as well as a second temporal clause and a subject complement clause, both with a pronominal correlate in the inner field.
To conclude, the present findings support the proposal in section 3.2 that the smaller proportion of NRFCs, among all postpositioned clauses, in regiolect than in dialect is partly attributable to the fact that these are center-embedded more frequently in the former than in the latter.
I now turn to the center-embedding of RCs, of which 38 cases are attested in regiolect and 7 in dialect. Here I exclude from analysis all texts (1 regiolect, 13 dialect) in which embedded RCs are attested in neither the inner field nor the postfield. I also eliminate all cases in which the antecedent to the relative pronoun occurs in the postfield, known in the literature as the Attraktionsprinzip ‘attraction principle’ (Kromann Reference Kromann1974:67, Zahn Reference Zahn1991:217, Filpus Reference Filpus1994:157–159, Patocka Reference Patocka1997:352–353, Vinckel Reference Vinckel2006:79–81), which contributes to what Uhlig (Reference Uhlig1972) terms blockbildendes Sprechen ‘block-forming speech’ and Zifonun, Hoffmann & Strecker (Reference Zifonun, Hoffmann and Strecker1997:1669–1671) call Informationsentflechtung ‘disentanglement of information’, illustrated in (75) (antecedent das Leinenzeug ‘the linen’, relative pronoun was ‘that’).
Likewise, I exclude cases of RC postpositioning with an antecedent in the prefield, as in (38) above, as well as continuative RCs such as was eigentlich schade ist ‘which is too bad, actually’ in (76), which occur obligatorily in the postfield.
The results show that the equally weighted mean relative frequency of RC embedding is 28.17% in regiolect and dialect combined and that regiolect displays a higher equally weighted mean relative frequency than dialect (see table 10). A Mann–Whitney U test indicates furthermore that the relative frequencies in the individual regiolect texts are statistically significantly higher than in the dialect texts (mean rank regiolect 26.86, dialect 17.76; U = 344, z = 2.31, p = .021, adjusted p = .045).
Considered together with the cases of clausal postpositioning in the dataset, the center-embedded NRFCs and RCs comprise 9.93% of all embeddings. The analysis has revealed that certain clause types, for example, complement clauses, appear to be fully grammaticalized in the postfield. However, the overall equally weighted mean relative frequencies of center-embedding for NRFCs that exhibit optional center-embedding and for RCs, 13.42% and 28.17%, respectively, indicate that such embedding is not a marginal phenomenon in the dataset. Furthermore, the findings contradict claims by some scholars regarding the placement of embedded clauses, such as Dürscheid’s (2012:103) assertion that conjunctional clauses may only appear in the pre- or postfield and Rath’s (1965:231) remark that RCs and conjunctional clauses “‘fast immer’ ausgeklammert werden” ‘are almost always postpositioned’. The results also indicate that the relative frequencies of center-embedding in the individual texts, for both NRFCs and RCs, are statistically significantly higher in regiolect than in dialect. In the case of NRFCs, this finding provides explanatory evidence for the fact that these clauses constitute a lower proportion of all postpositionings in regiolect than in dialect.
3.4. Final Embedding
To address research question 4, I examine more closely the frequency of final embedding, that is, embedding in the postfield.
Recall from section 3.1 that the dataset contains 5,923 clauses with (a) constituent(s) other than ø, a pronoun, or negator in the inner field and/or (a) postfield constituent(s). To determine for each text the relative frequency of postpositioning in V1/V2 clauses, I divide the number of VF clauses that are embedded in the postfield of a V1/V2 clause with an overtly filled right bracket by the total number of such V1/V2 clauses in the text (3,684 clauses across all texts). The calculations of final embedding in VF clauses, on the other hand, are less straightforward since a VF clause that embeds a VF clause may be embedded in a VF clause itself. To control for this, I calculate for each text the relative frequency of postpositioning in VF clauses by dividing the number of VF clauses that are embedded in a VF clause by the total number of VF clauses (2,239 across all texts) minus the number of VF clauses that are embedded in another VF clause (164 across all texts). In other words, the denominator for the calculations reflects only the number of VF clauses that are syntactically linked to a V1/V2 clause. Finally, to calculate the overall relative frequencies, I divide the sum of the number of VF clauses that are embedded in the postfield of a V1/V2 clause with an overtly filled right bracket and the number of VF clauses that are embedded in the postfield of a VF clause by the total number of V1/V2 and VF clauses minus the number of VF clauses that are embedded in another VF clause. That is, I sum the numerators from the separate calculations for final embedding in V1/V2 and VF clauses alone and divide this value by the sum of the denominators for these calculations.
The overall equally weighted mean relative frequencies in regiolect and dialect are presented in table 11. The equally weighted mean relative frequency is 4.24 ppts higher in regiolect than in dialect, a difference that is statistically significant as assessed by Student’s t-test, t(57) = 3.09, p = .003, adjusted p = .018, [95% CI, 1.50 to 7.00].Footnote 48
Table 12 shows that the higher equally weighted mean relative frequency of postpositioning in regiolect than in dialect also holds for V1/V2 and VF clauses separately. Mann–Whitney U tests indicate that the relative frequencies of postpositioning in both V1/V2 and VF clauses are statistically significantly higher in the individual regiolect texts than in the dialect texts (V1/V2: mean rank regiolect 36.17, dialect 23.62; U = 250, z = 2.81, p = .005, adjusted p = .025; VF: mean rank regiolect 35.60, dialect 24.21; U = 267, z = 2.57, p = .01, adjusted p = .04).
While the equally weighted mean relative frequency of postpositioning in V1/V2 clauses is higher in regiolect than in dialect, this holds here only for V1/V2 clauses with an overtly filled right bracket, not for V1/V2 clauses on the whole, including those in which the right bracket is not overtly filled. To determine whether the finding holds for all V1/V2 clauses, I examine in a subsequent step the occurrence of VF clauses after V1/V2 clauses without an overtly filled right bracket to which they are syntactically linked. Granted, the status of such VF clauses as center- or finally embedded is ambiguous, as I discussed in section 2.3 above, yet embedded in some way nonetheless. For each text, I calculate the relative frequency of postpositioning by dividing the number of VF clauses that occur after V1/V2 clauses without an overtly filled right bracket to which they are syntactically linked by the number of such V1/V2 clauses in each text. The equally weighted mean relative frequencies in regiolect and dialect are shown in table 13. As with the V1/V2 clauses with an overtly filled right bracket, the equally weighted mean relative frequency is higher in regiolect than in dialect. The 11.75 ppt difference is statistically significant as assessed by Welch’s t-test, t(46.61) = 5.55, p < .0005, adjusted p < .0005, [95% CI, 7.49 to 16.00].Footnote 49
Taken together, the findings presented in tables 11–13, in tandem with the results shown in table 6 above, suggest that dialect displays more parataxis (or less hypotaxis) than regiolect, as the frequency of (final) VF embedding, in both V1/V2 and VF clauses, is lower in dialect. A tendency toward parataxis in dialect has been noted in the literature, for example, Löffler (Reference Löffler1990, Reference Löffler2003) (see also Patocka Reference Patocka1997:89–91). Löffler (Reference Löffler2003:110) contends that some syntactic features that have been described as specific to dialect, such as a tendency toward parataxis, are, rather, characteristics of spoken (as opposed to written) language (for related discussion, see Lötscher Reference Lötscher2004, Louden Reference Louden2005, and Schwitalla Reference Schwitalla2012:26–35, 129–135). While this is accurate to a certain degree, as Patocka (Reference Patocka1997:89) remarks, the present study indicates, through the juxtaposition of spoken regiolect and dialect data, that the observed frequency of VF clause embedding in dialect is not solely a consequence of its spoken nature but is partly attributable to variety. Löffler (Reference Löffler2003:5) also attributes differences between dialect and the standard language with respect to parataxis versus hypotaxis to differences in the frequency of use of various grammatical constructions that are available to the speaker, an assessment consistent with Patocka (Reference Patocka1997:91). One such example in the current dataset is dialect speakers’ more frequent use of V2 RCs instead of VF RCs, discussed in section 3.2. As noted there, however, the exact syntactic status of V2 RCs and nature of the linkage to the matrix clause is a matter of debate in the literature. In addition, certain V2 clauses such as complement clauses, for example, ich glaube, er hat recht ‘I believe he’s right’, can be considered semantically subordinate, regardless of what their precise syntactic relationship to the matrix clause may be. Therefore, to make a more definitive statement regarding the degree of parataxis in dialect vs. regiolect in the current dataset, it would be necessary to account for and quantify such cases. This must be left to future work; for now, it may be noted that the frequency of embedding of VF clauses is lower in dialect than in regiolect.
The results in table 12 above also indicate for both regiolect and dialect that the equally weighted mean relative frequencies of final embedding are higher in V1/V2 clauses with an overtly filled right bracket than in VF clauses. This holds for all three clause forms: NRFCs, RCs, and ICs. The values for V1/V2 clauses and VF clauses, for regiolect and dialect combined, are presented in table 14. A paired t-test indicates that the 7.47 ppt difference in equally weighted mean relative frequencies is statistically significant, t(58) = 7.60, p < .0005, adjusted p < .0005, [95% CI, 5.50 to 9.44].Footnote 50
One possible explanation for this finding is that VF clauses, which can serve themselves as matrix clauses for further VF embedding, may appear not only in the postfield of a V1/V2 clause, but also in the inner field, as discussed in section 3.3 above, or in the prefield of a V2 clause, such as nachdem die Gerüste gefallen sind ‘after the scaffolding fell’ in (77).
Material from the (V1)/V2 clause immediately follows a VF clause embedded in its prefield or inner field, but this is typically not the case when the VF clause is embedded in its postfield.Footnote 51 Therefore, final embedding in VF clauses that are themselves embedded in the prefield or inner field of a (V1)/V2 clause may be more restricted than in VF clauses that are embedded in the postfield of a V1/V2 clause. This restriction could lead to an overall lower relative frequency of embedding in VF clauses vs. V1/V2 clauses. Indeed, the dataset contains only 8 cases (5 dialect, 3 regiolect) of a VF clause finally embedded in a VF clause that, in turn, is located in the prefield of a V2 clause, as illustrated in (78).
Cases of a VF clause finally embedded in a VF clause that, in turn, is located in the inner field of a V1/V2 clause, are unattested.
Although only 8 cases such as (78) occur in the dataset, VF clauses occur infrequently in the prefield of V2 clauses in the first place. Since the focus of the current study is postpositioning, I have not yet examined clausal placement in the prefield exhaustively; however, a preliminary analysis of the prefield in 24 randomly selected texts from the dataset (12 regiolect, 12 dialect) reveals that the equally weighted mean relative frequency of clausal embedding in the prefield is 4.91% in regiolect and 2.98% in dialect. Therefore, matrix VF clauses in this position would be expected to have a minimal effect on the overall frequency of postpositioning in VF clauses.
A more likely explanation for the difference between V1/V2 and VF clauses with respect to the frequency of final embedding lies in restrictions on syntactic subordination. A VF clause postpositioned in a V1/V2 clause results in a single level of final embedding, as illustrated in the following example:
A VF clause postpositioned in another VF clause, on the other hand, typically entails two levels of final embedding since the higher VF is syntactically linked to a V1/V2 clause, as in (80).
Laury & Ono (Reference Laury, Ono and van der Hulst2010:76–79) demonstrate that cross-linguistically, the frequency of embedding in spoken language drops off steeply after a single level, which they attribute to processing constraints and the lack of a functional need for multiple embeddings (see also Chafe Reference Chafe1994, Gibson Reference Gibson1998, and Cowan Reference Cowan2000). Their findings align with those presented in Karlsson (Reference Karlsson2010), an analysis of final embedding. Less frequent clausal postpositioning in VF clauses than in V1/V2 clauses would therefore be expected since minimally two levels of embedding are usually involved in the former but only a single level in the latter.
The levels of embedding in the current dataset clearly reflect this tendency. If only cases of final embedding in both V1/V2 clauses with an overtly filled right bracket and VF clauses are considered, regiolect exhibits up to four levels of syntactic embedding and dialect up to two.Footnote 52 The four levels are illustrated in the following examples:
The equally weighted mean relative frequencies of these levels of embedding are shown in table 15. As the table indicates, both regiolect and dialect exhibit final embedding predominantly at the single level, consistent with Laury & Ono’s (2010) findings. These results support the hypothesis that the higher degree of postpositioning in V1/V2 than in VF clauses is attributable to constraints on syntactic embedding.
Patocka (Reference Patocka1997:149–158) presents the results of an analysis of clausal embedding in his dataset to which the current dataset may be compared. Since he includes VF clauses that appear after V1/V2 clauses without an overtly filled right bracket in his analyses, I also examine such cases to ensure better comparability. However, it must be noted that the exact status of VF clauses occurring after a V1/V2 clause without an overtly filled right bracket is ambiguous, as discussed in section 2.3 above. If V1/V2 clauses without an overtly filled bracket are also taken into account, maximally five levels of embedding are attested in regiolect and two in dialect. These levels of embedding after such V1/V2 clauses are illustrated in (85)–(88).Footnote 54
The equally weighted mean relative frequencies of these levels of embedding are shown in table 16. As the results indicate, the equally weighted mean relative frequencies of levels of embedding, including cases after a V1/V2 clause without an overtly filled right bracket, are nearly identical to the frequencies when embedding in V1/V2 clauses only in which the right bracket is overtly filled is taken into account.
For his Bavarian dialect data, Patocka (Reference Patocka1997:149) finds that 82.3% of constructions with at least one embedded subordinate clause involve one clause, 13.4% involve two, 3.1% three, and 1.2% more than three. It is important to note that the number of subordinate clauses does not necessarily correspond directly to the level of embedding as Patocka counts multiple clauses at the same level of embedding as individual instances and remarks that in roughly 60% of cases that involve two or three subordinate clauses, these occur at the same level of embedding (p. 150). Furthermore, Patocka includes in his analysis right dislocations without further embedding. As such, his results are not directly comparable to those for the current dataset, even after inclusion of cases of embedding in V1/V2 clauses without an overtly filled right bracket. Nonetheless, in both studies, the vast majority of cases of embedding involve one clause, which corresponds to a single level, a finding consistent with Laury & Ono (Reference Laury, Ono and van der Hulst2010).
4. Summary and Outlook
In this final section, I summarize the main findings related to the four research questions presented at the outset and discuss some avenues for further research.
RQ1: Which forms and functions of clausal postpositioning are attested in the dataset?
RQ2: How are postpositioned clauses distributed in dialects and regiolects according to form and function?
With respect to research questions 1 and 2, the results in section 3.2 showed that in both regiolect and dialect, clausal postpositionings most frequently take the form of NRFCs, followed by RCs then ICs. However, NRFCs comprise a smaller proportion of postpositionings in regiolect than in dialect, while RCs and ICs make up a smaller proportion in dialect than in regiolect. I attributed the lower equally weighted mean relative frequency of NRFCs in regiolect and higher frequency in dialect to the fact that these clauses are center-embedded more frequently (and consequently postpositioned less frequently) in regiolect than in dialect, which I demonstrated in section 3.3, and that dialect speakers tend to produce functionally equivalent NRFCs in place of ICs, thereby resulting in a greater proportion of NRFCs among all postpositioned clauses in dialect. As for the larger proportion of RCs in regiolect than in dialect, I argued that the observed difference follows from a greater tendency among dialect speakers to produce V2 RCs rather than embedded VF RC structures. The analysis of postpositionings by function revealed that adjunct, attributive, comparative, complement, continuative, and relative clauses are attested in the dataset, with adjunct clauses occurring at the highest equally weighted mean relative frequency in both regiolect and dialect, followed by complement clauses. In contrast to the forms, the equally weighted mean relative frequencies for the functions in regiolect vs. dialect, except for RCs, are close, within 4.52 ppts or less.
RQ3: To what extent can clausal constituents that appear in the postfield also appear in the inner field?
The analysis of center-embedding in section 3.3 showed that all clauses introduced by the complementizer dass ‘that’, comparative and complement clauses, and all ICs that can occur as either coherent or incoherent constructions are attested only in the postfield. By contrast, a subset of NRFCs, namely adjunct clauses with causal, conditional, and temporal semantics as well as attributive clauses, and non-continuative RCs appear both finally and center-embedded in the dataset. For those NRFCs that display optional center-embedding and for RCs, the overall equally weighted mean relative frequencies of center-embedding are 13.42% and 28.17%, respectively. These findings indicate that embedding in the inner field is possible and not a marginal phenomenon, contradicting some claims in the literature.
RQ4: What is the frequency of clausal postpositioning according to variety?
To address research question 4, I calculated equally weighted mean relative frequencies of postpositioning in both V1/V2 clauses and VF clauses. The results indicated that clausal postpositioning occurs more frequently in regiolect than in dialect, in both V1/V2 and VF clauses. In addition, an analysis showed that postpositioning is more frequent in V1/V2 clauses with an overtly filled right bracket than in VF clauses. I proposed that this finding is attributable to restrictions on syntactic subordination and to the cross-linguistic tendency, demonstrated in Laury & Ono (Reference Laury, Ono and van der Hulst2010), for the frequency of embedding to drop off steeply after a single level; while final embedding in V1/V2 clauses involves a single level, embedding in a VF typically involves minimally two levels since the VF matrix clause is usually embedded in a V1/V2 clause itself. Detailed examinations of levels of embedding demonstrated that regiolect exhibits up to five levels and dialect up to two and that the vast majority of embeddings (c. 90–95%) exhibit a single level, consistent with the findings in Laury & Ono (Reference Laury, Ono and van der Hulst2010) and with Patocka’s (1997) analysis of levels of clausal embedding in Bavarian dialects spoken in Austria.
The present study contributes to our knowledge of the clausal syntax of spoken German varieties. It also serves to document the clausal postpositionings that occur in German regional language and lays the methodological groundwork for ongoing investigations of non-clausal postpositioning that build on Dubenion-Smith (Reference Dubenion-Smith and Xing2020) and for future related studies. For example, since the present article focuses primarily on postpositionings and secondarily on center-embedded constituents, the investigation could be expanded to include right- and left-dislocated constituents and a detailed analysis of the prefield. Given the high degree of variation in the placement of RCs, particularly in regiolect, a more detailed, multi-causal analysis of RC placement that is based on a larger set of regiolect data and that takes into account various RC subtypes (for example, restrictive vs. non-restrictive RCs) would be a fruitful avenue for research. Finally, since the data utilized in this study were collected over 60 years ago, it would be beneficial to compare the results with examinations based on more recently collected data to assess the continued validity of the findings and to what extent they hold true. Indeed, as Schmidt (Reference Schmidt2011) and Schmidt & Herrgen (Reference Schmidt and Herrgen2011) demonstrate (see also, for example, Auer & Schmidt Reference Schmidt, Auer and Erich Schmidt2010, Herrgen & Schmidt 2019, and Christen et al. Reference Christen, Ganswindt, Herrgen and Schmidt2020), German regional varieties have long undergone significant reevaluation processes and changes, which continue today. Thus, diachronic examinations may yield valuable new insights.
Appendix