Hostname: page-component-6bf8c574d5-rwnhh Total loading time: 0 Render date: 2025-02-22T14:01:57.410Z Has data issue: false hasContentIssue false

The three degrees of metrical strength in Strict CV metrics, a theory without parsing

Published online by Cambridge University Press:  20 February 2025

Noam Faust*
Affiliation:
Université Paris, Paris, France
Shanti Jeremiah Ulfsbjorninn
Affiliation:
Memorial University of Newfoundland, St. John’s, NL, Canada
*
Corresponding author: Noam Faust; Email: [email protected]
Rights & Permissions [Opens in a new window]

Abstract

Typology establishes three degrees of metrical strength. Foot-based theories designate the intermediate degree as that of unparsed syllables, i.e. syllables that are not part of a foot. However, this denotation of parsing mispredicts massively; moreover, there is no real reason why such unparsed syllables should be of intermediate prosodic strength (as opposed to the weakest or strongest). This paper presents an alternative account in Strict CV metrics (Ulfsbjorninn 2014, Faust & Ulfsbjorninn 2018). The correct three-way hierarchy follows from the basic operation of the theory, namely incorporation, whereby one nucleus becomes prominent by incorporating metrical significance from another nucleus. Examples come first from the more classical cases of Dutch and English and then from three test-cases provided by unrelated languages: St’át’imcets (Lillooet Salish), Burmese, and Tiberian Hebrew. No appeal is made to the notion of parsing.

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press

1. Introduction

The concept of parsing is almost ubiquitous in mainstream analyses of stress assignment. Syllables, moras, and segments can be either parsed into feet or not, and the latter option is regarded as undesirable.

For instance, Becker (Reference Becker2022) discusses the input /muʃkila/ ‘problem’ in Egyptian Arabic. Two candidates are compared:[(múʃ)kila] and [(muʃ)(kíla)] (brackets stand for feet, acute accent for stress). The first candidate violates a constraint Parse-σ (McCarthy & Prince Reference McCarthy and Prince1993, Prince & Smolensky Reference Prince and Smolensky2004), because the last two syllables are not footed. It is therefore dispreferred. Elsewhere in that paper, and indeed in the metrical literature on optimality theory, candidates with unparsed syllables, moras, or segments do emerge as optimal. The claim in this literature is therefore that while segmental and prosodic material may remain unparsed in some circumstances, all other considerations being equal, it should be fully parsed into feet.

At the same time, the literature on the interaction of stress and segment strength designates unparsed syllables as stronger than parsed ones, when the latter are foot dependents. Possibly the most famous example of this is the following data point from Dutch, which van Oostendorp (Reference van Oostendorp2000) attributes to Booij (Reference Booij1976). The orthographic fonologie is always pronounced with initial and final stresses, the latter being more prominent. The two intervening, unstressed syllables can be fully pronounced [fònoloɣí] or fully reduced [fònələɣí]. Another option is for only the second vowel to be reduced [fònəloɣí]. The fourth, conceivably possible, option *[fònoləɣí] is nevertheless ungrammatical. van Oostendorp (Reference van Oostendorp2000: 147) cites Kager (Reference Kager1989: 313) as providing the following footing /(fòno)lo(ɣí)/; the second syllable is a foot dependent, the third unparsed. Kager proposes the three-way hierarchy of metrical strength in (1), which we will dub the hierarchy of prosodic strength (HPS):

It follows from this hierarchy that the reduction of the unfooted syllable implies the reduction of the foot dependent, ruling out the unattested *[(fòno)lə(ɣí)].

The HPS and Parse-σ seem to make different predictions. If the preference is for prosodic material to be footed, then unfooted syllables are somehow non-optimal. Given an input like Arabic /katabu/ ‘they wrote’ that must have initial stress (as in many Arabic dialects), avoidance of unfooted syllables – that is, Parse-σ – should be able to result in an output without the unfooted syllable [(kátab)]. The HPS in turn suggests that such a result should never be possible, because a process of deletion targets an unfooted syllable but not a dependent one. Instead, [(kát)bu] is expected. The former option – deletion of the vowel of unparsed syllables – does not clearly occur in any Semitic language, whereas the latter is common. Parse-σ mispredicts.

To be sure, the problem is of course much wider than Semitic. Parse-σ predicts all kinds of weakening effects in unparsed syllables, which simply do not seem to occur. For instance, one can imagine a language in which all consonants in unfooted syllables, medial or final, must be placeless; no such language exists (to the best of our knowledge). Other mispredictions of Parse-σ are (i) a language in which all words, complex or simplex, have an even number of syllables on the surface, thereby avoiding unparsed syllables (or degenerate feet) anywhere in the language; (ii) a language with final epenthesis only in words with odd number of syllables; and (iii) a language in which entire syllables are systematically deleted when unparsed (/katabu/ => [káta], /katabunimi/ => [kàtabúni]; etc.).

While the HPS seems to be more on the right track, it too suffers from being partially unmotivated. Consider the representation of the Italian word [pɛ́kora] ‘sheep’ in (2), taken from Krämer (Reference Krämer2009). The final syllable is unfooted – it is associated directly to the level of the phonological word. Both [ra] and [ko] are dependents, and neither is stressed, what in this representation makes [ko] weaker than [ra]?

While the moraic representation of unfooted syllables does point to them being different from syllables in the foot-dependent position, it does not express which of the two is stronger than the other.

Given the above, a model of the relevant phenomena must do two things: (a) express the HPS and (b) do so in a way that follows from the representation. Preferably, this goal must be achieved without massively mispredicting like Parse-σ.

This is what the present paper aims to do. The next subsection outlines two attempts that have been made in the past to capture the HPS. It then introduces our own proposal, within the framework of Strict CV metrics. The rest of the paper then consists of three test cases, each in its own subsection. The first two subsections cover cases for which the difference between footed and unfooted syllables has been proposed in the past. Section 3 looks at (some) final syllables in St’át’imcets (Lillooet Salish) (Caldecott Reference Caldecott2009). It is shown that our proposal presents the best fit for the data. Section 4 moves on to Burmese minor syllables (Green Reference Green and Watkins2005), which feature in the type of prosodic shape called sesquisyllables. The third case (Section 5) is that of Tiberian Hebrew (TH) [i]~[ə]~Ø alternation, which we believe is also best understood using this distinction – though no study known to us has implemented this in the past. In all three cases, the HPS holds, and it is derived in a non-arbitrary manner from the representations of Strict CV metrics. The concept of parsing is shown to be unnecessary in this approach, which therefore does not carry its mispredictions. Section 6 concludes.

2. Motivating the hierarchy of prosodic strength

2.1. Two previous attempts

van Oostendorp (Reference van Oostendorp2000:147 et passim) discusses the Dutch data reviewed above, which Kager analyzed as involving unfooted syllables /(fòno)lo(ɣi)/. van Oostendorp, in turn, provides the fully parsed representation in (3), in which SFt refers to superfoot.

Crucially for the present purpose, the proposed structure does express the part of the HPS that remains unexplained by Kager: whereas the weaker /no/ is a foot dependent, the stronger /lo/ heads its own non-branching foot. This difference is used in order to derive the higher resistance of /lo/ to weakening, in an analysis whose details are too intricate to present here. However, why /lo/ is weaker than /fo/ is now not so clear – both are foot heads and neither carries the main stress (indeed, the assignment of secondary and main stresses to the first and last syllables respectively is not accounted for by the structure). We will return to this proposal in the next subsection; as we will see, it is not a good fit for the data of that subsection.

Balogné Bérces (Reference Balogné Bérces2011) addresses a similar asymmetry in English, this time regarding the famous consonant lenition process known as flapping. It has long been noted (see references in the original paper) that /t/ is more likely to lenite into a flap immediately after the stressed vowel than when separated from that vowel by an unstressed syllable. The chief example is orthographic competitive, which can be pronounced [kʰəmpʰéɾəɾɪv] or [kʰəmpʰéɾəɪv], but not *[kʰəmpʰéəɾɪv]. Like in the Dutch example, lenition in the second syllable after the stress implies lenition in the immediately posttonic syllable.

Balogné Bérces, like the present paper, uses Strict CV (Lowenstamm Reference Lowenstamm, Durand and Laks1996, Scheer Reference Scheer2004). The basic tenet of this framework is the following: all words are represented with a skeletal tier, and that skeletal tier consists of a single, repeated unit, the CV unit. Thus, the English word [pɪ́stəl] ‘pistol’ in (6) involves an empty V-slot between the consonants [s] and [t] and also ends in an empty V-slot. Every contentful V-slot entertains two lateral relations with preceding positions: government and licensing. Government inhibits realization; licensing enhances it. Contentful V-slots preferably govern preceding empty V-slots (full arrow), allowing them to remain empty. They also license their onsets, if there is one (dashed arrow), allowing them to be unlenited. According to Scheer (Reference Scheer2004), both government and licensing must be dispensed.

Balogné Bérces proposes that stressed vowels in open syllables require licensing from a following vowel. In (5), /ə/ licenses [é]. This leaves /t1/, the onset of licensor /ə/, unlicensed. Conversely, /t2/ is both governed and licensed from the following nucleus. Both /t/s are weak, since both are governed, but /t2/ is stronger because it is also licensed. Balogné Bérces further assumes that stressed vowels do not govern their onsets; /p/ is consequently only licensed, not governed, and is therefore always realized as fortis [pʰ] – stronger than either of the /t/s. A three-way strength distinction is derived: (onset of) foot head > unfooted > foot dependent.

Based on the two assumptions above, this account derives the three-way strength distinction in consonants mainly from lateral relations. A major limitation that we find is that, while the parallel between the Dutch and English facts is certainly real, the account does not straightforwardly carry over to vocalic strength. As we will see below, our own proposal can cover both vocalic and consonantal strength.

2.2. Strict CV metrics

The first systematic study examining stress assignment in Strict CV was Scheer & Szigetvári (Reference Scheer and Szigetvári2005). It proposed that depending on the language, empty nuclei could be metrically significant, as signaled by the grid mark in the Italian word in (6).Footnote 1

Scheer & Szigetvári equated metrically significant empty nuclei to moraic codas. And since empty V-slots are independently necessary in Strict CV, they concluded that within this approach, only V-slots, empty or full, project.Footnote 2 Mora-projecting consonants are redundant in Strict CV. With these tools, Scheer & Szigetvári showed that many weight-sensitive languages are in fact simpler cases of fixed-stress systems. However, there remained many truly weight-sensitive languages that resist such a treatment.

True weight-sensitivity was first given a principled treatment in this framework by Ulfsbjorninn (Reference Ulfsbjorninn2014). Ulfsbjorninn assumed that universally, nuclei occupied by full vowels project to L2; metrically significant empty nuclei, in turn, project to L1, as in (7a) (L1, L2 etc. are projection levels). Then, in a process dubbed incorporation, the metrical potential of the empty nucleus (*α) is amassed with that of the nucleus to its left, as depicted in (7b). Incorporation results in a prominent initial syllable.Footnote 3

As later proposed by Faust (Reference Faust2023), the process of incorporation is motivated, in this case, by the impossibility of realizing metrical significance on an empty position. This logic featured prominently in Faust & Ulfsbjorninn (Reference Faust and Ulfsbjorninn2018), an account of the quantity-sensitive systems of Cairene and Palestinian Arabic. Other studies that use and develop the mechanism of incorporation include Ulfsbjorninn (Reference Ulfsbjorninn2021), Faust (Reference Faust2023), Faust & Ulfsbjorninn (Reference Faust and Ulfsbjorninn2024) and Shikunova (Reference Shikunova2024).

Since the only metrically significant constituent is the nucleus (full or empty), Strict CV likens heavy syllables (CVV, CVC), which always involve an empty V-slot dependent, to the classic two-syllable, binary foot (Ulfsbjorninn Reference Ulfsbjorninn2014). Faust (Reference Faust2023) formally unites these two configurations under the mechanism of incorporation, which he proposes can hold between two contentful nuclei. For instance, in words with three open syllables and no lexical stress, such as Rioplatense Spanish [kaméʃo] ‘camel’, the underlying representation corresponds to (8a). This representation lacks prominence – and every content word in Spanish must have at least one prominent syllable. In order to achieve prominence, by default one grid mark from the final vowel is incorporated into the penultimate one as in (8b), making the latter prominent.

Thus, incorporation underlies both the prominence of closed syllables over open ones and the arrangement of syllables into alternating <weak, strong> pairs.

Strict CV metrics differs from the mainstream moraic theory on several grounds. First, feet (syllabic or moraic) are not a primitive of the system: they are derived by incorporation.Footnote 4 Second, metrical significance is more fine-tuned than with moras. The higher a vowel projects, the more prominent it is; in moraic theory, a segment either projects a single mora or it does not. Third, and most importantly, incorporation is a zero-sum process: what one nucleus wins in prominence through incorporation is taken from the other.

Because of the last two properties, the system derives three types of realized nuclei – in (8b), unincorporated [ka], incorporator [me] and incorporatee [ʃo].Footnote 5 These correspond to what moraic theory identifies as unfooted, foot head, and foot dependent, respectively. However, unlike in moraic theory, the respective strengths of the three types of nuclei follow from the representation. In (8b), incorporated nuclei project to L1 and are therefore weaker than unincorporated ones, which project to L2. These in turn are naturally weaker than incorporators, which project to L3. Strict CV metrics therefore manages to do exactly that which was defined as necessary in the introduction: (a) express the HPS and (b) do so in a way that follows from the representation.

Strict CV metrics easily derives the Dutch case. Assume all vowels except the stressed one start out projecting to L2 (9a; like van Oostendorp Reference van Oostendorp2000, we abstract away from the assignment of main stress to the final vowel). Incorporation in Dutch proceeds from left to right to resolve lapse, defined as two adjacent nuclei projecting to the same level. /no/ is incorporated, assigning prominence to /fo/ (9b). Since /ɣi/ bears main stress, /lo/ does not pose the problem of a lapse and remains unincorporated.

The phonetic interpretation of this strength asymmetry follows causally from the representation in (9b). Either there is no schwa reduction: [fònoloɣí], or speakers may reduce only /o/’s that project only to L1 (such as V2): [fònəloɣí]. They may also reduce /o/’s that project even to L2 (V3), but if they do, they must simultaneously reduce the /o/’s at L1 (V2) also: [fònələɣí]. The inverse, reducing V3 but not V2 *[fònoləɣí], would be incoherent with (9b).

Incorporation-based metrics can derive the consonantal asymmetry of English in the same manner. The stressed vowel /e/ incorporates one of the grid-marks of the following V-slot, becoming prominent. The next vowel /ɪ/ in V3 is not an incorporator because it is final (and the final empty nucleus is not metrically significant in English [Burzio Reference Burzio1994]), nor does it need to be incorporated because it does not pose the problem of a lapse. One may now say that onsets are obligatorily weakened before nuclei that project to L1 (V2), and optionally so before nuclei that project to L2 (V3). Again, the latter configuration clearly implies the former but not vice versa.

It seems to us that incorporation-based Strict CV metrics transparently derives both asymmetries.

Before we continue, it is important to add that in this system, there is no need at all for the concept of parsing. This, it seems to us, is a welcome result, as we saw that Parse-σ mispredicts massively, implicating the existence of whole phenomena that are unattested. Instead, unincorporated nuclei – the equivalent of unfooted or unparsed syllables in moraic theory – are not a problem in principle, because there is no requirement to parse anything: only to create prominence asymmetries.

The rest of the paper further illustrates the application of this approach to data sets from three unrelated languages with very different phonological systems, all requiring the three-way distinction above. The first of the three is a Salish language.

3. The extrapod, the ternary foot, and incorporation

In this subsection, we show that Strict CV metrics is a better fit for the unfooted syllable of St’át’imcets (Lillooet Salish) than any of the previously proposed formalizations.

Caldecott (Reference Caldecott2009) carefully examines the prosodic structure of St’át’imcets. Descriptively, the language has a trochaic pattern and – in words with more than one foot – the head of the rightmost foot bears primary stress (Caldecott Reference Caldecott2009:115). Codas do not contribute weight.

Much of Caldecott’s dissertation is devoted to a particular and regular structure in the language, namely final syllables after a foot, e.g. [(čún-tu)muɬ] ‘order us’. Caldecott refers to these as extrapods and shows that they behave unlike either foot heads or foot dependents.

Extrapods are distinct from foot heads in their prosodic behaviour. The language has a strict prosodic condition by which the final foot of a phrase gets a H(igh) boundary tone followed by a L(ow) tone. If that foot is disyllabic and aligned to the right edge, the H tone rests on the foot head and the low tone on the foot dependent (12a). If the final word is monosyllabic, the only foot must be a degenerate foot, and the entire HL contour rests on the only syllable (12b). In contrast, if the last syllable in the phrase is an extrapod, the HL prosody ignores it and is realized on the preceding foot (12c). Caldecott concludes that the extrapod cannot be the head of a degenerate foot, or it would behave like the only syllable of phrase-final monosyllabic words. The footing differences between (12a) and (12c) is due to preceding material.

Recall that van Oostendorp’s proposal, (3) above, designated the second syllable after stress as a degenerate foot under a superfoot. That also cannot be the correct analysis for the St’át’imcets data, or the HL prosody would appear on the extrapod.

If extrapods are not degenerate feet, maybe they are foot internal, i.e. they are somehow a dependent of the foot that precedes them? There are two ways in which this can be expressed: (a) the ordinary ternary foot of Nespor & Vogel (Reference Nespor and Vogel1986) (13a) or (b) with an internally layered foot (ILF) (13b) (cf. Ladd Reference Ladd1986; Martínez-Paricio Reference Martínez-Paricio2013; Martínez-Paricio & Kager Reference Martínez-Paricio and Kager2015, Reference Martínez-Paricio and Kager2016; Kager & Martínez-Paricio Reference Kager and Martínez-Paricio2018; Martínez-Paricio & Torres-Tamarit Reference Martínez-Paricio and Torres-Tamarit2018).

Using the diagnostic of vowel reduction, Caldecott shows that neither of the two ternary approaches fits the facts. In St’át’imcets, a general process reduces unstressed /a/ to [ə] in all unstressed syllables except the extrapod (Caldecott Reference Caldecott2009:58, 62).

The ordinary ternary foot does not distinguish structurally between the two non-heads, wrongly predicting that both should be equally weak; the vowels of both syllables should be reduced, contrary to facts. Similarly, in the ILF both unstressed V-slots are foot final, so both should exhibit non-head status; while only the extrapod is foot final in the maximal foot, that difference is not straightforwardly translatable into a difference in strength. To this one might add that both approaches are further challenged if one considers that prosodic strength is licensed by the foot head (Itô Reference Itô1988, Harris Reference Harris1997), and the further away from the foot head, the weaker a position should be. If anything, approaches that admit ternary feet should predict that the extrapod would be even weaker than the non-head of the minimal foot.

Since the extrapod is different from both the foot dependent and the foot-head, Caldecott concludes that it is neither. It is simply an extrapod – an unfooted syllable. Since this unfooted syllable is more resistant to reduction than the foot dependent, one may conclude that Caldecott has arrived at the HPS: Foot head > Unfooted > Foot dependent. And like any other work in moraic theory that arrived at this conclusion, while the conclusion fits the facts, it is not straightforwardly expressed by the theory. Why would an unfooted syllable associated to the prosodic word be stronger than the foot dependent? Why is it not weaker than the foot dependent? The intermediate strength of the extrapod does not follow from its representation in such accounts.

In stark contrast, the strength of extrapods as compared to foot dependents is straightforwardly derived by Strict CV metrics. In St’át’imcets, as mentioned, codas do not contribute weight. In Strict CV metrics, this means that empty nuclei are all metrically insignificant. Consequently, a sequence like [šùpkin’úšam] ‘I’m scratching my forehead’ is underlyingly represented as in (15a), with all and only full nuclei projecting, all to L2. The lapses have to be resolved, and this is done through incorporation. Incorporation proceeds from left to right, such that a grid mark of every even vowel is incorporated into the preceding vowel, resulting in (15b). The final /a/ projects to L1. An /a/ that projects to L1 is reduced to [ə]. The HL contour can be construed to target the last L3 projector.

But if the vowel /a/ is in an odd syllable (counting from the left edge), it will not be incorporated, as is shown in (16) (only the post-incorporation stage is shown). Accordingly, it will remain an L2 projector, and therefore, it will not reduce to [ə]. Again, no lapse is incurred by leaving this final vowel unchanged, since the previous nucleus has been incorporated. In other words, the final syllable is not unparsed, unfooted, or associated to the prosodic word; it is only untouched by incorporation.Footnote 6

To summarize, extrapods are unincorporated, non-incorporator nuclei. Strict CV metrics automatically derives not only the existence such nuclei but also their intermediary position in the HPS. Mainstream moraic accounts, while they do identify this relative strength, fail to motivate it in their representations.

4. Sesquisyllables

Many languages, especially in Southeast Asia, exhibit a distinction between minor and major syllables. Words in these languages usually combine one minor and one major syllable, with virtually all examples exhibiting the order minor–major. This combination has previously been referred to as quasi-disyllables, or sesquisyllables, ‘a syllable and a half in length’ (Matisoff Reference Matisoff and Hyman1973:86). It is particularly recurrent in Austroasiatic, where it constitutes the canonical word structure (Michaud Reference Michaud, Nau, Stolz and Storch2012:2). However, it is found in other language families too, such as the Sino-Tibetan language Burmese, which we will consider here. The data and foot-based analysis below come from Green (Reference Green and Watkins2005).

Green proposes that the differences between the two syllables depend on parsing: minor syllables are unfooted and unparsed; major syllables are footed and parsed. In what follows, after presenting the relevant facts and Green’s take on them, we show that Strict CV metrics derives the differences between major and minor syllable without reference to feet or parsing.

Burmese has eight monophthongs [a, i, u, e, ɛ, o, ɔ, ə], four diphthongs [aj, ej, ow, aw], and four tones [á, à, a̰, aʔ] (high, low, creaky, and checked). Major syllables may contain any vowel except [ə]; they may be open or closed, but only [ʔ] and placeless N can close a syllable; they must bear tone; and they may have simple (C) or complex (CG) onsets (G = glide).Footnote 7 Minor syllables in Burmese contain only [ə]. They never involve codas, tone, or complex onsets. A word may contain only a major syllable but not only a minor syllable.

Monophthongs may occur with or without N or ʔ, but diphthongs are always followed by either [ʔ] or [N]. Green analyzes this as a result of two interacting requirements. First, a major syllable must be bimoraic. Second, the right edge of a syllable may not bear independent place features. Monophthongs may occupy two moras without violating this ban (17a), because the features in the second mora are dependent on those of the first; but bimoraic diphthongs violate it (17b). Only monomoraic diphthongs are therefore allowed in major syllables, and they always have to be followed by a placeless coda (17c; codas are moraic in Burmese).

The best illustration of the restrictions on minor syllables is found in the so-called reducing compounds. These combine two major syllables, leading to the reduction of the first of the two into a minor one. As shown in (18), that syllable loses its tone, its vowel quality, its coda (18a, c) and the glide of its branching onset (18d).

Green argues that the restrictions on minor syllables follow from their monomoraic, unfooted status as well as the place ban mentioned above. Simplex words and reducing compounds may not involve more than one right-aligned foot, and feet in Burmese must be monosyllabic. In (19), the major syllable involves a bimoraic monophthong, which makes for the monosyllabic foot. Accordingly, the first syllable /ʧaN/ may not keep its coda, because there is no mora to host it. Consequently, its vowel is at the right edge of the syllable and is subject to the place ban, such that it loses all features and is reduced to [ə].Footnote 8

The explanation for tone loss is less insightful, involving conflicting ad hoc constraints. One constraint militates for all feet to carry a tone (‘a foot is associated with tone’ [Zec Reference Zec1999]), and another for there not to be tone at all (*Tone); the result is that only the tone of the footed vowel survives.

The loss of the glide of initial clusters is accounted for in the following manner. There is a high-ranked constraint against syllable-initial clusters. When a syllable is footed, the first C can be ‘associated directly to the foot level’ (20a), such that the cluster is not really syllable-initial, so the constraint is not violated. But a minor syllable is not footed, so such extrasyllabicity is not possible (20b). One of the two consonants must be left unassociated and cannot be pronounced, and it is systematically the glide (20c):

Extrasyllabicity, in our opinion, only sidesteps the problem and introduces other issues. First, one would like to see phonological reflexes that correspond to this status – for instance, a C associated directly to the foot being realized as weaker or stronger than one associated to the syllable. No such effects are reported – for Burmese or for any other language to the best of our knowledge. Moreover, the problem in (20b) can in fact be solved by another type of extrasyllabicity: like unparsed syllables, the /θ/ can be associated directly to the prosodic word.Footnote 9

Nevertheless, we do understand why Green, working within a foot-based approach, designates the minor syllable as unfooted. As for St’át’imcets, other variants of this approach cannot capture the facts. A ternary foot, whether flat or layered, will designate the first and third moras as equally weak, leading to incorrect predictions; and an approach that regards the first syllable as a degenerate foot would be unable to explain the reduced contrast in such feet.

Consider now the following account in Strict CV metrics. Assume that all vowels project only to L2 lexically, but words must contain an L3 projection. The representation in (21a) is thus ill-formed, whereas that in (21b) is well-formed, because the vowel reaches L3 through incorporation. The representation in (21c), in turn, contains the three types of nuclei of Strict CV metrics: unincorporated, incorporator, and incorporatee. It is also well-formed in terms of the above principles. To understand why V0 in (21c) can only host [ə], whereas V1 may host any vowel, it suffices to look at their projection levels, and state that only V-slots projecting to L3 may host full vowels.

Reducing compounds and other non-compound words can be defined as entries having a single L3 projection. Similarly to the reasoning for vowel reduction, one may state that only V-slots projecting to L3 may carry tone. This principle is indeed found in the many languages where functional items – which can be construed as projecting only to L2 – may not carry independent tone (see for instance Duanmu Reference Duanmu2007:72 for Mandarin).

The distribution of consonants in the word follows from previously established principles of Strict CV. It is well-known in Strict CV that there is a hierarchy between nuclei in terms of their licensing potential (Charette Reference Charette1990, Reference Charette1992; Cyran Reference Cyran2010; Ulfsbjorninn Reference Ulfsbjorninn and Samuels2017; Cavirani & van Oostendorp Reference Cavirani and van Oostendorp2019; Balogné Bérces & Ulfsbjorninn Reference Balogné Bérces, Ulfsbjorninn, Breit, Yoshida and Youngberg2023). V-slots with full vowels license more features than V-slots with reduced vowels, which in turn license more than empty nuclei:

In the Strict CV metrics analysis of Burmese, this hierarchy can be motivated by the level to which each nucleus projects. The second V-slot of the major syllable (V2) does not project at all and therefore can only license placeless occupiers of the preceding C-slot, /ʔ/ or /N/. It also cannot license a glide, which would be a placeful segment associated to the C (23a). Diphthongs are therefore monopositional (23b); consequently, the C-slot following them must be occupied, or the entire CV would have no motivation, incorporation would not occur, and the word would be subminimal.

In contrast, the first consonant of the minor syllable in (21c) above is licensed by a stronger V-slot, one that projects to L2, and therefore may host any of the consonants of the language. At the same time, V0 has less licensing power than V1, which projects to L3: V0 may not license branching onsets.

To summarize, sesquisyllabic structures like those of Burmese are easily explainable using Strict CV metrics. The three projection levels underlie the different licensing strengths of their nuclei, accounting for the distribution of consonants and tone in the word as well as for possible and impossible word shapes. No appeal is made either to otherwise unmotivated concepts, such as unparsedness or extrasyllabicity, or to constraints such as counter-factual *Tone.

5. Schwa in Tiberian Hebrew

In this last subsection, we show that the three-way distinction between incorporator, unincorporated, and incorporatee is handy in understanding the distribution and realization of Tiberian Hebrew (TH) schwa. The facts of this language are interesting in that, unlike in the two previous test-cases, the position of the unincorporated nucleus is not dependent on the position of the prominent, stressed vowel of its word.

5.1. The facts

The orthography of the Hebrew of the Jewish Bible initially lacked many symbols that would instruct one regarding the precise pronunciation of the written text. After Hebrew ceased to be spoken (around the second century CE), several traditions emerged in order to establish a stable pronunciation of that text. By far the most widespread today is the system developed by a group of scholars in the city of Tiberias between the eighth and tenth centuries CE, hundreds of years later. The pronunciation that these scholars aimed to establish is called TH. Thus, TH is not a spoken language; it is a pronunciation tradition of an annotated script, preserved through minute annotations. TH phonology depends on interpretations of these annotations. Despite these historical complications, Bat-El (Reference Bat-El2023) convincingly argues that a great many of the phonological patterns attested in TH are natural linguistic patterns.

The most up-to-date phonological interpretation of TH annotation seems to be Khan (Reference Khan2020). The facts in this subsection follow that study (with some minor deviations). The analysis, in turn, owes much to Bat-El (Reference Bat-El2023).

According to Khan, vowels were lengthened in open unstressed syllables and in any stressed syllable (stress is generally final). Here we will be concerned only with the vowel transliterated with the symbol schwa. This symbol takes the form of two perpendicular dots under the letter, e.g. אְ. Based on Khan’s description, the vowel that it stands for can be defined first and foremost by what it does not do. It is never stressed, and it is never lengthened.

The realization of schwa is less straightforward. Two additional aspects of that realization are illustrated by the forms in (24), all concerning the CC-initial stem /ʃmor/ ‘keep.nonpst’. The first C of the stem in (24a–c) is annotated with a schwa symbol (henceforth, all consonants annotated with schwa are in bold). In (24a), the stem is preceded by a prefix. In this case, the following cluster is not broken – the schwa is not pronounced – and the vowel of the prefix is in a closed syllable. When the same stem occurs without a prefix in (24b), the first two consonants are separated by [ə], creating a short initial open syllable (TH did not allow for initial clusters). One may conclude that the schwa symbol is pronounced [ə] in open syllables.Footnote 11 Before a V-initial suffix, for reasons that will not concern us here, the stem loses its vowel (24c, d). The second stem consonant, here /m/, is then also annotated with a schwa. If that loss creates an internal triconsonantal cluster (24c), the schwa symbol on the middle consonant is realized [ə] (here, too, it is in an open syllable). But if the cluster resulting from the syncope is triconsonantal and word-initial, the stem vowel is completely syncopated (24d), and the first letter is not annotated with a schwa symbol, but with the vowel symbol called hirek, which elsewhere corresponds to the vowel [i].

To summarize, the schwa symbol is not realized unless it has to be. In such cases, it corresponds to [ə] in an open syllable, and in closed syllables, it alternates with the symbol otherwise pronounced [i].

A fact that Bat-El does not discuss is the following. The prefix in (24) surfaces as [i] and is in a closed, unstressed syllable, like the first vowel of the stem in (24d). It is therefore ambiguous between a lexical /i/ and a schwa in a closed syllable. The facts in (25) argue for the latter view: when the prefix occurs on a CV-initial stem and is therefore in an open syllable, it is annotated with schwa and realized [ə] (25b, d).

The next subsection discusses Bat-El’s take on the data.

5.2. Bat-El (Reference Bat-El2023)

According to Bat-El (Reference Bat-El2023), schwas represent ‘empty moras’, i.e. moras that are unassociated to a vocalic nucleus (she cites Anderson Reference Anderson1982 on French schwa for this idea). These empty moras are realized through epenthesis, unless they can be associated to a consonant in coda position. To illustrate, the first consonant of the CC-initial stems carries such an empty mora. In (26a), since the consonant is initial, it cannot be a coda, and epenthesis ensues in order to realize the mora. But in (26b), since the consonant can be a coda, the mora is associated to it and no epenthesis is called for. (Note, again, that we are not concerned here with the representation of the final, stressed syllable.)

The metrification of the first empty mora as a coda is true for the stem-initial mora in (27a), too. But this form also carries a suffix, and as shown above, in such cases the stem loses its vowel. According to Bat-El, the original mora of that vowel is nevertheless retained and becomes an empty mora. Since the preceding consonant /ʃ/ is already a coda, the /m/ cannot be syllabified as a coda, and epenthesis ensues in order to realize the empty mora. Finally, the form in (27b) has two consecutive empty moras. The first must be realized through <i> epenthesis – this quality will be discussed presently – and therefore the second can be syllabified as a coda.

In order to explain the different qualities of epenthesis in (26a) and (27b), Bat-El puts forth a principle according to which [ə] should be followed by a mora associated to a full vowel. If, as in (27), it is followed by a moraic coda, the empty mora must be realized as [i].

While we believe the empty mora analysis is definitely on the right track, we also think it presents two shortcomings. First, empty moras are not arbitrarily distributed in the language, so they should be derivable in a more principled manner. Bat-El asserts that all empty moras originate in syncopated full vowels, like the second stem mora above. However, this is clearly not so for the first mora of the CC-initial stem: if it originated in a full vowel, we would expect that vowel, not [ə], in (26). Second, the principle proposed for the realization of the empty mora as [i] is also arbitrary. Why should the realization [ə] be followed by a mora associated to a full vowel? Note, that simply banning [ə] from closed syllables would have the same effect but would be as arbitrary.

We submit that these two shortcomings can be eliminated, and the facts better understood, if examined in Strict CV – specifically through the lens of the three-way distinction of incorporator, incorporate, and unincorporated.

5.3. Tiberian Hebrew schwa in Strict CV metrics

Bat-El’s empty moras do not require independent motivation in Strict CV: they are the empty V-slots of the theory, present after any consonant that is not followed by a realized vowel. The forms [ʃəmóːr, ʃimríː] are therefore represented as in (28): in the former V1 is empty and V2 is full; in the latter both are empty (the syncopated stem vowel is represented in parentheses and unassociated, principally as a mnemonic).

As in Burmese, empty nuclei are metrically significant and project to L1, while associated full vowels project to L2. The representations in (29) show the situation before the application of incorporation. Final empty nuclei do not project.

As discussed above, incorporation is motivated by two considerations: (a) the creation of prominence and (b) the expression of metrical significance of empty nuclei. In (29a), there is no problem of prominence. The first V-slot (V1) is empty and significant, yet it cannot be incorporated, as there is no nucleus to its left. This, we submit, is the reason for its realization as [ə], as shown in (30a). In (29b), in turn, both problems arise: there are two metrically significant empty nuclei (V1 and V2), and they are consecutive, creating a lapse – a lack of prominence. Incorporation solves both problems: the second nucleus is incorporated into the first (30b). The result is an empty nucleus projecting to L2. In other words, V1 projects to the same level as a lexical vowel. Accordingly, we submit, it is realized with a lexical vowel quality /i/, as opposed to the non-lexical quality [ə].

The difference in quality between epenthetic <ə> and <i> correlates with their projection level. Arguably, the lexical quality [i] is used because the nucleus projects to L2, like a lexical vowel; and the non-lexical quality [ə] is used when the nucleus projects to L1, unlike a lexical vowel. Possibly, the nuclei are realized in different stages of the derivation, too: [i] at the lexical level and [ə] at the post-lexical level.

Now, recall that the prefix vowel also alternates between [ə] and [i] – in Strict CV terms, it has an empty nucleus, too. Therefore, a form like [tiʃməríː] is in fact /t_ʃ_m_ri/, involving 3 consecutive empty nuclei in a row. As shown in (31), the metrical potential of the second one is incorporated into the first, and the third remains unincorporated. Their realizations follow the principles stated above.

The initial form /t_ʃ_m_ri/ could in principle be realized as *[təʃimri]. We assume that this is not the case because resolution of lapses proceeds from left to right: a second consecutive empty nucleus is incorporated, without regard of the status of the following nucleus.

Finally, of course, not all sequences of empty nuclei are left-aligned. The form [tətargəmíː], (25) above, would in Strict CV involve a medial sequence of two empty nuclei /t_-tar_g_mi/ (alongside an initial empty nucleus). The first of the two can be incorporated; thus, the second must be realized.

It emerges from the analysis in Strict CV metrics that the schwa symbol represents empty nuclei – the main feature of Strict CV representations, amply motivated independently. In this, the analysis stands in stark contrast to Bat-El’s, whose empty moras are far from uncontroversial in moraic theory. More importantly, for our present purpose, the realization of empty nuclei in our account follows the three-way distinction we argued for in St’át’imcets and Burmese: incorporator empty nuclei are realized as stronger [i], unincorporated empty nuclei as weaker [ə], and incorporated empty nuclei remain unrealized. This result provides further support for our proposal. Moreover, the TH case enriches the typology in showing how our conception of the HPS is useful in understanding phenomena that are independent of the position of the prominent, stressed vowel of the word.

Finally, let us return to the larger issue of parsing. Although parsing does not figure in Bat-El’s account, we assert that the TH facts above can be analyzed using this notion, more or less in a manner parallel to what we have done here. It could, for instance, be claimed that unparsed empty moras – empty moras that have not found a coda – are realized through epenthesis and associated to the prosodic word. But, as in the other languages examined here, the fact that unparsed syllables get realized at all, as opposed to simply being deleted, only follows from their association to the prosodic word, which is little more than an ad hoc way of saving them from deletion. In contrast, the three-way distinction derived by incorporation renders the notion of parsing redundant. Our account of TH does not require it and otherwise follows independently established principles of realization. The fate of parsing in Strict CV metrics is sealed: it is an unnecessary tool of phonological analysis.

6. Conclusion

Typology compels one to distinguish at least three types of nuclei: strong, weak and intermediary. At the descriptive level, this three-way distinction can lead to different terms: syllables with intermediary strength have been called extrapods in St’át’imcets and Salish languages, whereas in Southeast Asian languages they constitute the minor syllables in the common syllable-and-a-half or sesquisyllabic structure.

Foot-based accounts, because they operate with binary feet, can express the three types of syllables as (i) foot head, (ii) foot dependent, and (iii) neither. The latter category is most commonly referred to as unfooted or unparsed; because the syllable is still pronounced, it is claimed to be associated to the prosodic word. But while this view indeed expresses a three-way distinction, it requires additional assumptions in order to model the intermediary strength of the unfooted syllable. In other words, it does not straightforwardly capture the HPS, repeated in foot-based terms in (33).

Other attempts within foot-based approaches are not successful either. Ternary feet, flat or layered, cannot distinguish between dependents in a way that expresses the HPS. Approaches that eschew unparsed syllables by making the otherwise unfooted syllable into a degenerate foot (associated to a superfoot) do not fit the facts of either extrapods or sesquisyllables.

Another, more general problem has to do with the way the HPS in (33) is termed, namely the designation of the syllable with intermediary strength as unfooted. Most, if not all foot-based approaches regard parsing as preferable to non-parsing, thus characterizing the unfooted status as somehow problematic or marked. Indeed, a common markedness constraint in the mainstream framework of OT is Parse-σ, which demands that all syllables be parsed into feet. But language after language, one sees that unfooted syllables are not gotten rid of, as marked configurations are expected to be. Parse-σ mispredicts.

We therefore began this paper with a challenge, namely to express the HPS in a way that follows from the representation and does not share the mispredictions of Parse-σ. We claimed that Strict CV metrics, a theory that was developed with the goal of formalizing weight-sensitive stress within Strict CV, provides the goods. The HPS was recast in the terms of that theory:

Through the zero-sum property of incorporation, which is shared by no other metrical device to the best of our knowledge, nuclei become prominent by taking strength from other nuclei. Dependent positions are derivationally weakened by incorporation, a fate that does not equate with merely being unstressed. Unlike a foot-based model, which encodes a strength/weakness relation between head and dependent with the diacritics ‘strong’ and ‘weak’, in Strict CV metrics the mechanism which assigns prominence can only operate via phonological weakening of one V-slot in favor of the other. Unincorporated nuclei, in turn, are those that have neither incorporated a grid mark from another nucleus nor lost one to another nucleus. Therefore, they carry intermediary strength. Thus, the HPS is derived for free: it is a prediction of the approach, not something that requires additional assumptions.

In addition, unincorporated nuclei are not marked in any way. The theory does not have a constraint (or a principle) against unincorporated nuclei. These only pose a problem – the familiar problem of lapse – next to another unincorporated nucleus. Thus, our approach expresses the HPS in its representations, without appealing to parsing or lack thereof, and therefore it does not suffer from the mispredictions of the notion. There is no parsing in Strict CV metrics.

In Strict CV metrics, the nuclei of both extrapods and minor syllables share the property of being unincorporated – neither incorporators nor incorporatees. The two structures in (35) might superficially look dissimilar in terms of word shape, but the only significant difference is the position of the unincorporated syllable with respect to the incorporator–incorporatee sequence. The location of extrapods is to the right of that sequence, whereas the minor syllable precedes it.

The position of the unincorporated syllable in these languages is dependent on that of the prominent syllable. But the HPS is also useful in understanding phenomena that are not related to prominence; in TH, the three-way distinction accounts for the three different realizations of empty nuclei: lexical [i] in incorporator nuclei, non-lexical [ə] in unincorporated nuclei, and Ø in empty, incorporated nuclei.

Our approach has captured different types of data from a typologically diverse selection of languages – we have cast the theoretical net wide. It remains to be seen whether other sesquisyllabic patterns, other lenition facts, and other particulars of individual languages have also been caught in it.

Acknowledgments

We would like to thank the students who attended our class of NYI 2024 as well as Alexandra Shikunova and Andrew Lamont for their interest in our ideas. Andrew is also to be thanked for pointing out to us the St’át’imcets data. We also thank Outi Bat-El and three anonymous reviewers for their remarks. Thanks also go to a linguist and their challenge ‘Well, how do you do Burmese?’ and an anonymous reviewer of Faust (Reference Faust2023), writing, ‘perhaps the author would like to generally do without a distinction between unparsed syllables and the weak branch of a foot, but some day Strict CV prosody will have to confront the Dutch data.’

Footnotes

1 For important precursors of this idea see Yoshida (Reference Yoshida1999), Harris & Gussmann (Reference Harris and Gussmann2002) and Charette (Reference Charette2008).

2 According to Alexei & Ulfsbjorninn (Reference Alexei and Ulfsbjorninn2022), the fact that the system operates on CV-units rather than syllables is crucial to understanding languages like Romanian, where the metrical window is defined not as three standard syllables but three CV units.

3 Such subscripts on grid marks are not diacritics; they are used only for graphic purposes.

4 Incorporation distinguishes Strict CV metrics from the family of grid-based approaches to meter (Halle & Vergnaud Reference Halle and Vergnaud1987; Idsardi Reference Idsardi1992, Reference Idsardi, Raimy and Cairns2009; Halle & Idsardi Reference Halle, Idsardi and Goldsmith1995; van der Hulst Reference van der Hulst, Raimy and Cairns2009).

5 We use unincorporated only for non-incorporators.

6 The insignificance of empty nuclei in such languages sheds additional doubt on the possibility of extending the account in Balogné Bérces (2011) to vocalic weakness, since that account relies on lateral relations.

7 For more on the phonetics of Burmese, see Watkins (Reference Watkins2001).

8 We abstract away from several aspects of Green (Reference Green and Watkins2005), such as the existence of major, footed syllables before the final major syllable in non-reducing compounds.

9 Incidentally, like syllables or segments associated directly to the prosodic word, the parsing in (20) further relaxes the Strict Layering Hypothesis (Selkirk Reference Selkirk1984), to the point of undermining the predictive power of the entire Prosodic Hierarchy. For an example of this relaxation taken to its logical conclusion, see the multifarious representations in Kiparsky (Reference Kiparsky, Féry and van de Vijver2003).

10 This hierarchy is crucial, for instance, in stress assignment. In many languages, schwa can be stressed but only when it is the only vowel-type in the word; incidentally, this is so in St’át’imcets (Caldecott Reference Caldecott2009:19). Such cases can either be attributed to ‘sonority driven stress’ (although see discussion in Rasin Reference Rasin2017), or the schwa can be analyzed as the phonetic implementation of an unsilenced V-slot (Kaye et al. Reference Kaye, Lowenstamm and Vergnaud1990, Charette Reference Charette1991, Scheer Reference Scheer2004). Accordingly, in these systems the important distinction is between filled vs. empty V-slots, with a further subdivision for realized vs. silenced empty V-slots.

11 Khan argues that the schwa symbol in such cases was pronounced as a short [ǎ]. However, so was /a/ in a closed syllable, which was lengthened in open syllables. For clarity, we follow Bat-El (Reference Bat-El2023) in transcribing the realized schwa as [ə].

References

Alexei, Sofia & Ulfsbjorninn, Shanti. 2022. Metrical exceptionality and stress shift in Romanian nouns and adjectives. Isogloss: Open Journal of Romance Linguistics 8.1: 129.CrossRefGoogle Scholar
Anderson, Stephen. 1982. The analysis of French shwa: or, how to get something for nothing. Language 58, 534573.CrossRefGoogle Scholar
Balogné Bérces, Katalin. 2011. Weak and semiweak phonological positions in English. Journal of English Studies 9, 7596.CrossRefGoogle Scholar
Balogné Bérces, Katalin & Ulfsbjorninn, Shanti. 2023. Prevocalic tenseness in English, binarity and the typology of long vowel distributions. In Breit, Florian, Yoshida, Yuko & Youngberg, Connor (eds.), Elements, government & licensing, 7788. London: UCL Press.CrossRefGoogle Scholar
Bat-El, Outi. 2023. Biblical Hebrew phonology from a theoretical perspective. Radical: A Journal of Phonology 5, 145.Google Scholar
Becker, Michael. 2022. Cairene Arabic stress is local. Radical: A Journal of Phonology 4, 211247.Google Scholar
Booij, Geert. 1976. Klinkerreductie in het Nederlands. Leuvense Bijdragen 65, 461469.Google Scholar
Burzio, Luigi. 1994. Principles of English stress. Cambridge, UK: Cambridge University Press.CrossRefGoogle Scholar
Caldecott, Marion Gerda. 2009. Non-exhaustive parsing: phonetic and phonological evidence from St’át’imcets. Ph.D. dissertation, University of British Columbia.Google Scholar
Cavirani, Edoardo & van Oostendorp, Marc. 2019. Empty morphemes in Dutch dialect atlases: Reducing morphosyntactic variation by refining emptiness typology. Glossa: A Journal of General Linguistics 4.1, 88.Google Scholar
Charette, Monik. 1990. License to govern. Phonology 7, 233253.Google Scholar
Charette, Monik. 1991. Conditions on phonological government. Cambridge, UK: Cambridge University Press.CrossRefGoogle Scholar
Charette, Monik. 1992. Mongolian and Polish meet government licensing. SOAS Working Papers in Linguistics and Phonetics 2, 275291.Google Scholar
Charette, Monik. 2008. The vital role of the trochaic foot in explaining Turkish word endings. Lingua 118.1, 4665.CrossRefGoogle Scholar
Cyran, Eugeniusz. 2010. Complexity scales and licensing in phonology. Berlin & New York: De Gruyter Mouton.CrossRefGoogle Scholar
Duanmu, San. 2007. The phonology of Standard Chinese (2nd ed.). Oxford, UK: Oxford University Press.CrossRefGoogle Scholar
Faust, Noam. 2023. Metrically-conditioned syncope in Strict CV metrics. Glossa: A Journal of General Linguistics 8.1.Google Scholar
Faust, Noam & Ulfsbjorninn, Shanti. 2018. Arabic stress in strict CV, with no moras, no syllables, no feet and no extrametricality. The Linguistic Review 35.4, 561600.CrossRefGoogle Scholar
Faust, Noam & Ulfsbjorninn, Shanti. 2024. A footless stroll through Italian stress. The Linguistic Review 41.4, 741763.CrossRefGoogle Scholar
Green, Anthony D. 2005. Word, foot and syllable structure in Burmese. In Watkins, Justin (ed.), Studies in Burmese linguistics, 124. Canberra, Australia: Pacific Linguistics.Google Scholar
Halle, Morris & Idsardi, William. 1995. General properties of stress and metrical structure. In Goldsmith, John (ed.), The handbook of phonological theory, 403443. Oxford, UK: Blackwell.Google Scholar
Halle, Morris & Vergnaud, Jean-Roger. 1987. An essay on stress. Cambridge, MA: MIT Press.Google Scholar
Harris, John. 1997. Licensing inheritance: an integrated theory of neutralisation. Phonology 14, 315370.CrossRefGoogle Scholar
Harris, John & Gussmann, Edmund. 2002. Word-final onsets. UCL Working Papers in Linguistics 14, 142.Google Scholar
Idsardi, William J. 1992. The computation of prosody. Ph.D. dissertation, MIT.Google Scholar
Idsardi, William J. 2009. Calculating metrical structure. In Raimy, Eric & Cairns, Charles (eds.), Contemporary views on architecture and representations in phonology, 191212. Cambridge, MA: MIT.CrossRefGoogle Scholar
Itô, Junko. 1988. Syllable theory in prosodic phonology. New York: Garland Publishing.Google Scholar
Kager, René. 1989. A metrical theory of stress and destressing in English and Dutch. Dordrecht, the Netherlands: Foris.Google Scholar
Kager, René & Martínez-Paricio, Violeta. 2018. The internally layered foot in Dutch. Linguistics 56.1, 69114.CrossRefGoogle Scholar
Kaye, Jonathan, Lowenstamm, Jean & Vergnaud, Jean-Roger. 1990. Constituent structure and government in phonology. Phonology 7, 193231.CrossRefGoogle Scholar
Khan, Geoffrey. 2020. The Tiberian pronunciation tradition of Biblical Hebrew, vol. 1. Open Book Publishers.Google Scholar
Kiparsky, Paul. 2003. Syllables and moras in Arabic. In Féry, Caroline & van de Vijver, Ruben (eds.), The syllable in optimality theory, 147182. Cambridge, UK: Cambridge University Press.CrossRefGoogle Scholar
Krämer, Martin. 2009. The phonology of Italian. Oxford, UK: Oxford University Press.CrossRefGoogle Scholar
Ladd, D. Robert. 1986. Intonational phrasing: the case for recursive prosodic structure. Phonology Yearbook 3, 311340.CrossRefGoogle Scholar
Lowenstamm, Jean. 1996. CV as the only syllable type. In Durand, Jacques & Laks, Bernard (eds.), Current trends in phonology. Models and methods, 419441. Manchester, UK: European Studies Research Institute, University of Salford.Google Scholar
Martínez-Paricio, Violeta. 2013. An exploration of minimal and maximal metrical deet. Ph.D. dissertation, University of Tromsø.Google Scholar
Martínez-Paricio, Violeta & Kager, René. 2015. The binary-to-ternary rhythmic continuum in stress typology: layered feet and non-intervention constraints. Phonology 32.3, 459504.CrossRefGoogle Scholar
Martínez-Paricio, Violeta & Kager, René. 2016. Metrically conditioned pitch and layered feet in Chugach Alutiiq. Loquens 3.2, 114.CrossRefGoogle Scholar
Martínez-Paricio, Violeta & Torres-Tamarit, Francesc. 2018. Trisyllabic hypocoristics in Spanish and layered feet. Natural Language & Linguistic Theory 37, 659691.CrossRefGoogle Scholar
Matisoff, James A. 1973. Tonogenesis in southeast Asia. In Hyman, Larry M. (ed.) Consonant types and tones (Southern California Occasional Papers in Linguistics 1), 7196. Los Angeles: University of Southern California.Google Scholar
McCarthy, John J., and Prince, Alan. 1993. Prosodic morphology: Constraint interaction and satisfaction. Rutgers University Center for Cognitive Science Technical Report no. 3.Google Scholar
Michaud, A. 2012. Monosyllabicization: Patterns of evolution in Asian languages. In Nau, Nicole, Stolz, Thomas & Storch, Cornelia (eds.), Monosyllables: from phonology to typology, 115130. Berlin: Akademie Verlag.CrossRefGoogle Scholar
Nespor, Marina & Vogel, Irene. 1986. Prosodic phonology. Dordrecht, the Netherlands: Foris.Google Scholar
Prince, Alan & Smolensky, Paul. 2004. Optimality theory: Constraint interaction in generative grammar. Malden, MA: Blackwell.CrossRefGoogle Scholar
Rasin, Ezer. 2017. The stress-encapsulation universal and phonological modularity. Presented at GLOW 2017, Leiden University, March 14–18, 2017.Google Scholar
Scheer, Tobias. 2004. A lateral theory of phonology. Vol 1: What is CVCV, and why should it be? Berlin: Mouton de Gruyter.CrossRefGoogle Scholar
Scheer, Tobias & Szigetvári, Peter. 2005. Unified representations for the syllable and stress. Phonology 22, 3775.CrossRefGoogle Scholar
Selkirk, Elisabeth O. 1984. Phonology and syntax: The relation between sound and structure. Cambridge, MA: MIT Press.Google Scholar
Shikunova, Alexandra. 2024. Quality-conditioned stress as length: glide epenthesis in Moksha. Radical: A Journal of Phonology 6, 3173.Google Scholar
Ulfsbjorninn, Shanti. 2014. A field theory of stress. Ph.D. dissertation, SOAS, University of London.Google Scholar
Ulfsbjorninn, Shanti. 2017. Formalising the markedness of syllable structure. In Samuels, Bridget (ed.), Beyond markedness in formal phonology, 153190. Amsterdam: John Benjamins Publishing Company.CrossRefGoogle Scholar
Ulfsbjorninn, Shanti. 2021. Lenition and metathesis in Hawu: a quantity-sensitive language. SOAS Working Papers in Linguistics 20, 125.Google Scholar
van der Hulst, Harry. 2009. Brackets and grid marks or theories of primary accent and rhythm. In Raimy, Eric & Cairns, Charles (eds.), Contemporary views on architecture and representations in phonology, 225245. Cambridge, MA: MIT.CrossRefGoogle Scholar
van Oostendorp, Marc. 2000. Phonological projection: A theory of feature content and prosodic structure. Berlin: Mouton de Gruyter.CrossRefGoogle Scholar
Watkins, Justin W. 2001. Burmese. Journal of the International Phonetic Association 31.2, 291295.CrossRefGoogle Scholar
Yoshida, Yuko. 1999. On pitch accent phenomena in Standard Japanese. The Hague: Holland Academic Graphics.Google Scholar
Zec, Draga. 1999. Footed tones and tonal feet: rhythmic constituency in a pitch-accent language. Phonology 16: 225264.CrossRefGoogle Scholar