Hostname: page-component-586b7cd67f-g8jcs Total loading time: 0 Render date: 2024-11-28T02:21:19.211Z Has data issue: false hasContentIssue false

The acoustic characteristics of implosive and plosive bilabials in Shimaore

Published online by Cambridge University Press:  30 January 2023

Miki Mori*
Affiliation:
CUFR de Mayotte & Praxiling UMR 5267 CNRS [email protected] [email protected]
Rights & Permissions [Opens in a new window]

Abstract

Implosive consonants have drawn the attention of researchers over time, partially due to their relative rarity in the world’s languages, and partially due to their unique ingressive air flow. This sound category has varying complex features from an articulatory and acoustic perspective. This study explores the sound category by analyzing the acoustic features of a language whose implosives have yet to be acoustically considered: Shimaore, a Bantu-Sabaki language spoken in Mayotte. Specifically, it compares the bilabial plosive /b/ and the bilabial implosive /ɓ/ in terms of Voice Onset Time (VOT), fundamental frequency, amplitude, and voice quality via H1*–H2*, harmonics-to-noise ratios (HNR), and cepstral peak prominence (CPP). Analyses show that VOT is shorter for implosives than for plosives. At oral closure and vowel onset, amplitude and f0 are higher than plosives. H1*–H2* values read alongside HNR and CPP values suggest that implosives in Shimaore have glottal constriction. Some individual differences are explored for question of variation in implosives. Implications regarding sociophonetic studies in Mayotte as well as general implications for implosives are discussed.

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press on behalf of The International Phonetic Association

1 Introduction

Implosive consonants have drawn the attention of researchers over time, partially due to their relative rarity in the world’s languages, and partially due to their unique acoustic properties. These consonants have been described as ‘problematic’ (McLaughlin Reference McLaughlin2005: 201), due to challenges in describing the articulatory and acoustic properties involved. In fact, this category of sounds consists of several subgroups with various distinguishing properties, including at times a lack of ingressive air flow, once thought to be a criterial dimension (Ladefoged Reference Ladefoged1971). There appears to be an array of variation across languages for implosives (Lindau Reference Lindau1984). Considering these observations, studies are needed that look at how these sounds are realized in specific languages and dialects.

One language whose implosives have been understudied is Shimaore, a Bantu-Sabaki language variety spoken on the island of Mayotte in the Mozambique Channel. In descending order of frequency,Footnote 1 Shimaore has bilabial /ɓ/, alveolar /ɗ/ and palatal [ʄ] voiced implosives, the first two in phonemic contrast with their plosive counterparts /b/ and /d/, respectively. Only a handful of contrastive pairs exist, such as /bibi/ ‘insect’ and /ɓiɓi/ ‘young girl’, and all are bilabial. For the alveolar sounds, /dipe/ is ‘bread’ (from the French phrase du pain) whereas /ɗago/ means ‘city’ and ‘home’. While rare, the palatal implosive [ʄ] has been documented as a sound of the languageFootnote 2 (Blanchy Reference Blanchy1996, Association SHIME 2016), even though other language documentation texts only mention bilabial and alveolar implosives (Rombi Reference Rombi1983, Johansen Alnet Reference Johansen Alnet2009). That is, the palatal is mostly observed with the verb ‘to eat’ and derivations of it, including [uʄa], [ula], or [uɟa]Footnote 3. What is more, documentation of these sounds varies, and it appears that realization may differ by speaker, village, and context. It is not clear what role, if any, contact with French and Kibushi – the island’s other principal languages, which lack implosives – has on variation in sound production.

A close acoustic analysis of implosive sounds is needed, particularly for bilabials, which are the most common not only in Shimaore but also in many other Bantu languages (Greenberg Reference Greenberg1970, Maddieson Reference Maddieson, Nurse and Philippson2003). Indeed, bilabial implosives most often contrast with plosives. The purpose of this study is to acoustically describe the bilabial implosives in Shimaore and their plosive counterparts. The study is guided by two questions: How do voiced bilabial implosives and voiced bilabial plosives differ in terms of VOT, fundamental frequency, amplitude and voice quality, via H1*–H2*, HNR and CPP values? What are some qualitative individual differences between the two sound categories, /ɓ/ and /b/? The first question is to better understand which acoustic properties are the most informative for implosives in Shimaore. The second question is asked in order to explore possible variation.

1.1 An overview on implosives

Implosives are rare in the world’s sound systems. Estimates vary from 10% (Ladefoged & Maddieson Reference Ladefoged and Maddieson1996) to 13% (Maddieson Reference Maddieson, Dryer and Haspelmath2013), to 20% (Clements & Osu Reference Clements, Osu, Gussenhoven and Warner2002) and to 23% (Cun Reference Cun2009). A general understanding is that implosives are ‘stops that are produced with a greater than average amount of the lowering of the larynx during the time that the oral closure for the stop is maintained’ (Ladefoged & Maddieson Reference Ladefoged and Maddieson1996: 82). Note that the ‘greater than average’ caveat comes from the fact that for voiced stops in some languages, such as Maidu, Thai and Zulu, the larynx lowers slightly during sustained vocal fold vibration (Ladefoged & Maddieson Reference Ladefoged and Maddieson1996: 51, 78). Besides larynx lowing, ingressive airflow is a common property of implosives, like in Sindhi, but it is possible to have little to no ingressive airflow, as has been observed in Hausa (Nihalani Reference Nihalani1986). Ingressive airflow is due to the negative pressure that is created during oral closure when the larynx lowers (see Cun Reference Cun2009 for a more recent analysis of airflow). In fact, Catford (Reference Catford1939) describes implosives as ‘suction stops’. Researchers agree that implosives do not have egressive airflow (Lex Reference Lex1994, as cited in Clements & Osu Reference Clements, Osu, Gussenhoven and Warner2002: 304), which may be why they are sometimes called ‘nonexplosive stops’ (Clements & Osu Reference Clements, Osu, Gussenhoven and Warner2002).

While these definitions may suffice for many languages, they are also problematic (Clements & Osu Reference Clements, Osu, Gussenhoven and Warner2002) because they are based on Catford’s (Reference Catford1939) foundational argument that implosives have four distinguishing components: glottal closure, larynx lowering, rarefaction, and implosive release. While glottal closure is observed for implosives, implosives with modal voice also exist (Ladefoged & Maddieson Reference Ladefoged and Maddieson1996). As discussed above, larynx lowering and rarefaction are not always found in implosives. For this reason, Clements & Osu (Reference Clements, Osu, Gussenhoven and Warner2002: 10) argue that implosives should be considered nonobstruent stops, as the property ‘distinguishing implosives from plosives is the absence of air pressure buildup in the oral cavity’ (emphasis added). Ashby (Reference Ashby1990) argues that clarification needs to be made regarding implosives as an auditory class, which could help move beyond the problem of variation in sound production. Nevertheless, given these considerations, it is reasonable to define implosives as nonexplosive stops that tend to involve larynx lowering.

There are various other aspects to take into consideration. For one, the most common implosive is the bilabial, and implosives become rarer the more posterior the place of articulation is (Greenberg Reference Greenberg1970, Ladefoged & Maddieson Reference Ladefoged and Maddieson1996). This can be seen for example in Xhosa, which only has /ɓ/. As already mentioned, the bilabial is the most common implosive in Shimaore. In addition, voiced implosives tend to increase in amplitude during oral closure (Lindau Reference Lindau1984, Demolin Reference Demolin, Connell and Arvanati1995) as well as shorter negative VOT when compared to obstruent stops (Hussain Reference Hussain2018). Hainan Min (Wenchang Hainanese) has implosives that increase in amplitude during oral closure (Cun Reference Cun2009). Voiceless implosives (with full glottal closure) are observed in some languages such as Owerri Igbo (Ladefoged, Kemeny & Brackenridge Reference Ladefoged1976) and Seerer-Siin (McLaughlin Reference McLaughlin2005). Furthermore, Ladefoged & Maddieson (Reference Ladefoged and Maddieson1996: 89) hypothesize that auditory characteristics of implosives may be due to effects on formant frequencies brought on by larynx lowering.

Furthermore, laryngeal settings (voice quality) for implosives vary, as some languages have modal voicing (Xhosa) and others creaky (Hausa), such that the degree of glottal constriction varies. Indeed, creaky phonation has been noted to co-occur with implosives in several languages (Ladefoged & Maddieson Reference Ladefoged and Maddieson1996, Gordon & Ladefoged Reference Gordon and Ladefoged2001). However, it is argued that most Bantu languages do not have creaky phonation or glottal constriction (Maddieson & Sands Reference Maddieson and Sands2019). In addition, it has been claimed that some implosive sounds do not include larynx lowering but create cavity space by cheek expansion, as in the case of Gyeli (Grimm Reference Grimm2019). The use of cheek expansion has also been noted in a Hendo unexploded palatal implosive (Demolin, Ngonga-Ke-Mbembe & Soquet Reference Demolin, Ngonga-Ke-Mbembe and Soquet2002). Some languages contrast these sounds according to free variation such as with Bajele (Renaud Reference Renaud1976), the following vowel type, as seen in Mpiemo (Thornell & Nagano-Madsen Reference Thornell and Nagano-Madsen2004), or word-initial position, as seen in Bekwel (Cheucle Reference Cheucle2014, as cited in Grimm Reference Grimm2019: 143).

Thus, these sounds vary across languages around the world in terms of their phonetic and phonemic characteristics, even if they tend to be voiced (McLaughlin Reference McLaughlin2005). Defining and identifying implosives can therefore prove to be challenging at times due to these various articulatory and acoustic realizations (Grimm Reference Grimm2019). In fact, implosive sounds of languages spoken in the same region can have varying differences in acoustic properties (Lindau Reference Lindau1984). We can see how distinguishing between plosives and implosives in this sense can be at times difficult, and that there is indeed ‘a gradient between one form of voiced plosive and what may be called a true implosive’ (Ladefoged & Maddieson Reference Ladefoged and Maddieson1996: 82). Given the possibility that implosives may exist on a gradient, one goal of this study is to better understand where along this gradient certain sound pairs exist and to understand their phonetic properties via acoustic analyses. Specifically, Voice Onset Time, fundamental frequency, amplitude and spectral tilt and noise measures are analyzed in order to better understand this sound category for bilabials in Shimaore.

1.2 Acoustic measurements for implosives

Various measures exist for identifying implosives and plosives. Recent research stress the need to analyze multiple acoustic properties, as contrasts appear to be distinguished by a combination of parameters, for example, not just Voice Onset Time (VOT), since fundamental frequency and spectral tilt of the following vowel may differ according to consonant type (Kirby Reference Kirby2018, Cho, Whalen & Docherty Reference Cho, Whalen and Docherty2019). In fact, all three phonetic properties have been used to effectively analyze stops in various places of articulation, including bilabial and dental, in Burushaski (Hussain Reference Hussain2021). Since we consider implosives to be nonexplosive stops, it may be that a combination of correlates is employed for contrast discrimination in bilabial (im)plosives. For spectral tilt, in the case of implosives, looking at laryngeal settings via H1*–H2* alongside harmonics-to-noise ratio (HNR) and cepstral peak performance (CPP) (itself a measure of HNR) may prove useful for analyzing voice quality and establishing the co-occurrence of creaky voice or the presence of glottal constriction. The HNR of < 500 Hz was used because as a measure of low-frequency noise, it is most closely related to irregular voicing. Each measure in addition to amplitude as related to implosives is discussed below.

1.2.1 Voice Onset Time (VOT)

First, VOT is a measurement used with stop consonants for measuring the time between the release of the stop and the start of voicing. Voiced consonants have negative VOT since the vocal folds start vibrating before the release. This measurement is an acoustic staple for identifying contrast (Lisker & Abramson 1964, Cho et al. Reference Cho, Whalen and Docherty2019) in laryngeal settings, including implosives (Hussain Reference Hussain2018). Implosives tend to have short −VOT, with a continuous build-up of amplitude, whereas plosives have longer −VOT and a decrease in amplitude before the release (Demolin Reference Demolin, Connell and Arvanati1995, Ladefoged & Maddieson Reference Ladefoged and Maddieson1996, Hussain Reference Hussain2018). This is the case for Sindhi and other Indo-Iranian languages (Nihalani Reference Nihalani1986, Hussain Reference Hussain2018). One small-scale study found that Swahili implosives have shorter VOT measures than plosives (Coburn & Hjortnaes Reference Coburn and Hjortnaes2019). However, this may not be the case in other languages. For Mpiemo, plosives tend to have shorter VOT than implosives, but this could be due to speaker variation and speaking rate (Nagano-Madsen & Thornell Reference Nagano-Madsen and Thornell2012). Qualitative analyses of Zulu suggest shorter VOT for the plosive /b/ than for the implosive /ɓ/ (Naidoo Reference Naidoo2010). In addition, speech rate may factor into VOT, with Zulu implosives having the shortest VOT when occurring in rapid speech (Midtlyng Reference Midtlyng2011). It is yet to be understood how VOT functions for implosives and stops in Shimaore.

1.2.2 Fundamental frequency (f0) and CF0

Besides VOT, fundamental frequency is an important measure for looking at implosive consonant dimensions, both during oral closure and in the following vowel. Focus has been made on the latter as it is thought to be a perceptual cue for the preceding consonant (Wright & Shryock Reference Wright and Shryock1993). This is also called Consonant Induced f0 (CF0) since the fundamental frequency of the following vowel is influenced by the preceding consonant (Kirby Reference Kirby2018, Cho et al. Reference Cho, Whalen and Docherty2019). In addition, during oral closure, implosives may have higher f0 compared to voiced plosives due to the rapid lowering of the larynx (Ohala Reference Ohala1976, as cited by Demolin Reference Demolin, Connell and Arvanati1995: 382; Hombert, Ohala & Ewan Reference Hombert, Ohala and Ewan1979). For the following vowel, Painter (Reference Painter1978) observed that implosives tend to result in overall higher pitch throughout the vowel duration. Furthermore, for SiSwati, Wright & Shryock (Reference Wright and Shryock1993) found vowel f0 following a voiced bilabial implosive to be higher than when following a voiced sonorant. It has been argued that vowels following implosives have pitch similar to voiceless stops (Hombert Reference Hombert and Fromkin1978). CF0 also appears to be higher for vowels following implosives versus voiced plosives for Mpiemo (Nagano-Madsen & Thornell Reference Nagano-Madsen and Thornell2012). It may also be true that f0 values during the oral closure of implosives remain higher than their plosive counterparts. This is yet to be confirmed for Shimaore.

1.2.3 Spectral tilt and amplitude

Measures beyond VOT and f0 are important when considering implosives. Spectral tilt is a relative term used to describe measurements identifying different phonation types and degree of glottal constriction, such as breathy, modal and creaky voice (Jackson et al. Reference Jackson, Ladefoged, Huffman and Antoñanzas-Barroso1986). It is defined as ‘the degree to which intensity drops off as frequency increases’ (Gordon & Ladefoged Reference Gordon and Ladefoged2001: 15). That is, in its essence, it is used to measure ‘whether higher-frequency parts of a spectrum show much lower amplitude than lower frequency parts, only slightly lower amplitude or even greater amplitude’ (Thomas Reference Thomas2011: 231). It is an amplitude ratio of the lowest-frequency harmonic (f0, H1) to other ones including H2 (second harmonic amplitude), A1 (first formant amplitude), and A2 (second formant amplitude) (Jackson et al. Reference Jackson, Ladefoged, Huffman and Antoñanzas-Barroso1985, Thomas Reference Thomas2011, Khan Reference Khan2012). For implosives and the potential co-occurrence of creaky voice, H1–H2 (the amplitude of the first harmonic, H1, minus the amplitude of the second harmonic, H2) is the most interesting. This value tends to be lower for creaky voice with higher glottal constriction, and higher for breathy voice, with modal voice falling somewhere in between the two phonation types (Garellek Reference Garellek, Katz and Assmann2019). As a reminder, the phonation scale in order of increasing glottal constriction is breathy, slack, modal, creaky and glottalized. Certainly, creaky phonation is a term that in fact encompasses a wide range of phenomena, such as vocal fry, laryngealization, glottalization, but in general refers to a sound with higher than modal glottal constriction (Garellek Reference Garellek, Katz and Assmann2019, Davidson Reference Davidson2020). Jessen & Roux (2002) found lower H1*–H2* Footnote 4 of the following vowel for implosives, suggesting that they are not breathy voiced. This may be expected since breathy voice involves egressive airflow.

In addition, HNR measures allow for further clarification of phonation when compared alongside spectral tilt measures, as more glottal constriction results in low HNR values due to the presence of noise (Garellek Reference Garellek, Katz and Assmann2019). This same conclusion can be found for using CPP values in which low values correspond to non-modal phonation or more noise (Seyfarth & Garellek Reference Seyfarth and Garellek2018). The interest in this study is to determine if implosives have a more constricted laryngeal setting and voice quality than plosives. In addition, some studies look at amplitude in and of itself for addressing degree of implosiveness (Naidoo Reference Naidoo2010, Nagano-Madsen & Thornell Reference Nagano-Madsen and Thornell2012, Coburn & Hjortnaes Reference Coburn and Hjortnaes2019, Grimm Reference Grimm2019). Amplitude during oral closure and the following vowel may function as a distinguishing acoustic property for implosives, but little to nothing is known about spectral tilt or amplitude concerning implosives in Shimaore.

Figure 1 Map of the Comorian Archipelago in the Mozambique Channel.

1.3 Shimaore and Mayotte’s linguistic landscape

As for the language in question, Shimaore is a Northeast Coast Bantu language of Sabaki variety (Guthrie Reference Guthrie1948 classification G44d) belonging to the Comorian language group, of which there are four, one for each island of the Comorian Archipelago (Nurse & Hinnebusch Reference Nurse and Hinnebusch1993, Patin, Mohamed-Soyir & Kisseberth Reference Patin, Mohamed-Soyir and Kisseberth2019), as illustrated in Figure 1. The language group subdivides into the western group of Shindgazija and Shimwali and the eastern group of Shindzuani and Shimaore (Ahmed-Chamanga Reference Ahmed-Chamanga2017). The two groups are – for the most part – not mutually intelligible. Inhabitants of Mayotte, the furthest island from the African coast, speak Shimaore. Most recent estimates suggest 41% of the inhabitants speak Shimaore and 31% speak a variety of the three Comorian dialects Shingazidja, Shimwali and Shindzuani (Insee Mayotte Infos 2014), this last dialect having the largest percentage due to high immigration rates from Anjouan, the Shindzuani-speaking island located only 60 kilometers from Mayotte. Shindzuani and Shimaore are mutually intelligible. Modest estimates are around 190,000–220,000 speakers of Shimaore or a Shindzuani-influenced Shimaore in Mayotte. That is, when referring to the Bantu language spoken in Mayotte, this can be in reference to a variety that is influenced by Shindzuani. As seen in Figure 2, certain villages, such as those in the North and East of the island, speak a variety closer to Shindzuani, whereas western and southern villages speak a variety often described as the ‘pure’ Shimaore due to its isolation from contact with other island varieties. Language variation is often village specific, as can be seen in the northwest of the island with Mtsamboro and Hamjago, which are separated by only a few minutes on foot, but which speak two mutually unintelligible languages, Shimaore and Kibushi Kisakalava, respectively. Dense urban areas on the northeast part of the island such as Mamoudzou, prove to be complex in terms of variation and language contact, such that identifying them just as influenced from Shindzuani may be limiting. Furthermore, calling Shimaore its own language versus a Comorian dialect is as much a political choice as a linguistic one, but this discussion goes beyond the scope of this study.

Figure 2 Approximate distribution of the local languages and their varieties in Mayotte.

The Shimaore (and arguably Shindzuani) consonants and vowels are provided in Table 1. In parentheses are sounds from Arabic, which some argue are part of the Shimaore/Shindzuani sound system (Ahmed-Chamanga Reference Ahmed-Chamanga2017), but which speakers rarely use, based on Rombi’s (Reference Rombi1983) and the present author’s observations. Shimaore has prenasalized consonants for the plosives, affricates and implosives as well as the labial-dental fricatives. It has been observed that implosives become plosives in post-nasal environments (Nurse & Hinnenbusch Reference Nurse and Hinnebusch1993). In addition, variation appears to be present in the realization of prenasalized consonants, some of which may be due to regional or dialectal differences. For example, /ntsi/ and /tsi/Footnote 5 are both acceptable for saying ‘earth’ or ‘country’, with some locals arguing that /ntsi/ belongs to the dialects originating from Anjouan (Nurse & Hinnenbusch Reference Nurse and Hinnebusch1993). Stress in Shimaore is penultimate (Rombi Reference Rombi1983, Philippson Reference Philippson1988, Ahmed-Chamanga Reference Ahmed-Chamanga2017), while tones are absent (Rombi Reference Rombi1983).

Table 1 Consonants and vowels of Shimaore, with marginal consonant phonemes shown in parentheses. Grey areas indicate articulations judged impossible.

aThe post-alveolar retroflex plosives and have also been observed as retroflex affricates and by Rombi(Reference Rombi1983) and by the author.

Concerning (im)plosives, Bantu terms in Shimaore retain their implosive sounds. Like Swahili, Shimaore contains many loanwords from Arabic, adapted to the local phonology, such that the voiced bilabial implosive is privileged over the voiced bilabial plosive (Rombi Reference Rombi1983). For example, the word for ‘love’ is mahaa, from the Arabic word mahabbah. That is, bilabial plosives from Arabic loanwords become implosive whereas those from French loanwords remain plosive (as well as loanwords from other languages not having implosives such as Malagasy), presumably because they were introduced into the language more recently. This appears to be in contrast to Shingazidja, where Arabic loanwords maintain their plosive quality (Patin et al. Reference Patin, Mohamed-Soyir and Kisseberth2019). That is, literature on Shimaore (Rombi Reference Rombi1983, Johansen Alnet Reference Johansen Alnet2009) and Shingazidja (Rombi Reference Rombi1989, Patin et al. Reference Patin, Mohamed-Soyir and Kisseberth2019) and observations on the status of implosives show that their realization varies. At the very least, further inquiry into these sounds is needed, particularly prior to conducting research that investigates sociophonetic variation. Considering Swahili, a recent study comparing voiced stops showed individual variation, such that some speakers produced stops as implosives, whereas others produced plosives, thus implicating the existence of variation for these sounds (Coburn & Hjortnaes Reference Coburn and Hjortnaes2019).

Among other languages in Mayotte, Kibushi is spoken by about 15% of the inhabitants (Jamet Reference Jamet2016). It is an Austronesian language closely related to Sakalava Malagasy. There are two main varieties, Kibushi Kisakalava and Kibushi Kiantalautsi, the latter being spoken by only a few thousand people. The French language has an increasing presence in Mayotte, a French department since 2011 whose schooling and administrative activities occur exclusively in French from an official standpoint. Younger generations of Maore (people from Mayotte) and migrants speak French, although fifteen years ago, more than a third of inhabitants over 14 years of age reported not speaking any French (Insee 2007). Even if more recent statistics are not available, with the departmentalization and the influence of the national education system, this number is arguably in decline and may continue to decrease in the coming generations. Furthermore, studies suggest that locals prioritize passing on French to their children, as they believe it is the language of the future for the island (Laroussi Reference Laroussi2015; Mori Reference Mori2018, Reference Mori and Priolet2021). Thus, the influence of French on the local languages is a strong possibility. Neither French nor Kibushi have implosive sounds natively, although some varieties of Kibushi have innovated them, presumably as a result of contact with Shimaore.

2 Materials and method

2.1 Participants

The analyzed recordings come from 28 speakers (seven male, 21 female) aged 16–47 years (mean age = 25 years, standard deviation = 8). All participants were bilingual French/Shimaore speakers, and we exchanged in French during the experiments. Due to the voluntary participation and the nature of the fieldwork, sex and age were not controlled for.

2.2 Materials

Words examined in this study come from a larger wordlist of 50 words in Shimaore with corresponding images (see Appendix for the wordlist and sample images). Drawings were used due to the oral nature of the language and difficulties encountered when asking participants to read words in Shimaore using the Latin alphabet. When possible, images from the International Picture Naming Project were used (see Bates et al. Reference Bates, Federmeier and Herron2000). The recordings of 15 words with /b/ or /ɓ/ were analyzed, as shown in Table 2. Bilabials were chosen because they most commonly exhibit the contrast between implosives and non-implosives. In order to control for factors such as stress and phonetic environment, only words in word initial, stressed position were analyzed. For words containing two instances of a target consonant, such as bibi ‘insect’, only the first consonant was analyzed. Note that speakers produce implosives in stressed intervocalic positions, such as maawa ‘wings’. However, (im)plosives sometime undergo devoicing when in an unstressed, word-final position with the closed back vowel /u/, regardless of loanword origin such as with latabu from French la table ‘the table’ and dahau from Arabic dahab ‘gold’. Loanword status is included as this may be involved in variation, as discussed in Section 1.1 above, though it is beyond the scope of the paper to discuss if these words are recent loanwords or fully integrated vocabulary. Note that six out of 15 words on the list are loanwords from French, all of which contain the non-implosive /b/. As mentioned above, this is not uncommon for the Shimaore vocabulary.

Table 2 Word list.

aIn the figures /ɓ/ = bb and /b/ = b.

b‘No’ is for words presumed to have Bantu origins.

2.3 Procedure

Using a modified sociolinguistic interview method (image list, cartoon strips, storytelling), participants were recorded in a 1m2 soundproof booth on the Centre universitaire de formation et de recherche de Mayotte university campus using an Apex 435B condenser microphone, Presonus audiobox and Clarin’s SpeechRecorder software (Draxler & Jänsch Reference Draxler and Jänsch2014) with a sampling rate of 44,100 Hz. Each word was pronounced twice except for participants 93F22, 94F21, 95F22, 96F20, and 98F22, who pronounced the word only once (for naming, ‘F’ means female and the following number is the age of the participant).

2.4 Annotation and analysis

Recordings were annotated to TextGrids in Praat (Boersma & Weenink Reference Boersma and Weenink2019). Voiced (im)plosives were segmented from the onset of vocal fold vibration (voiced closure) to the start of the release burst (see e.g. Figure 3 in Hussain Reference Hussain2018). For example, for the word /ˈɓiɓi/, there were four boundaries placed for segmentation, the first at the onset of the vocal fold vibration, the second at the start of the release burst, the third at the end of /i/, the fourth at the release burst of the second /ɓ/, and the fourth at the end of the second vowel (see for example Figure 6). VOT and f0 were analyzed using Praat. PraatSauce (Kirby Reference Kirby2020) was used to analyze HNR (0–500 Hz), CPP and H1*–H2* (values discussed are the corrected values, indicated by *). In order to adjust for individual differences, including male versus female voices, frequency was converted from hertz to semitones for each participant, using each participant’s f0 averages (12*(log((zz)/100))/log(2)). Statistical analyses and figures were created using R (R Core Team 2017), such as for means (mean) and standard deviation (SD). Smoothing Spline Analysis of Variance (SSANOVA; Gu Reference Gu2002) was used for looking at f0, amplitude, and H1*–H2* using Gu’s (Reference Gu2021) ‘gss’ package in R (Gu Reference Gu2021). Modeling Gaussian regression, the SSANOVA (ssanova) model is similar to linear regression functions (lm) in R. Because data subsets were non-normally distributed, nonparametric methods were used, specifically the Kruskal-Wallis $\chi^{2}$ test (kruskal.test). Spectrograms and waveforms were made using the phonTools package (Barreda Reference Barreda2015) on R.

Figure 3 VOT (ms) boxplots by word.

3 Results

3.1 Global trends

3.1.1 VOT

Overall, VOT values are significantly different for the two consonant types (Kruskal-Wallis $\chi^{2}$ = 4253.3, df = 1, p < .001), with average implosives having a VOT of −57.9 ms (SD 22.67) and plosives having an average VOT of −105.55 ms (SD 33.32). Plosive VOT values were found to be just under twice as long as the implosives VOT values. Figure 3 shows boxplots of VOT by word with visible intra- and inter-word variation. We can see that durations are similar for certain words, such as bibiro ‘bottle’ and ⁓a ‘kick’. Some words have a wider range of duration values, such as ⁓ao ‘plank/board’ and bibi ‘insect’, whereas others have a narrower range, such as ⁓aa ‘father’. VOT differences are consistent with previous research on VOT in implosives (Ladefoged & Maddieson Reference Ladefoged and Maddieson1996). Figures 47 show spectrograms and waveforms for two participants’ production of ⁓ii ‘young girl’ and bibi ‘insect’, chosen for their typicality of what was observed. Note that the implosive /ɓ/ is indicated with ‘bb’ in graphs and figures. Figures 4 and 5 are for participant 06M23 and Figures 6 and 7 are for participant 12F34. As can be seen VOT are much shorter for word initial implosives than plosives.

Figure 4 Waveform and spectrogram (Frequency from 0–5000 Hz) for ‘bbibbi’ from participant 06M23.

Figure 5 Waveform and spectrogram (Frequency from 0–5000 Hz) for ‘bibi’ from participant 06M23.

Figure 6 Waveform and spectrogram (Frequency from 0–5000 Hz) for ‘bbibbi’ from participant 12F34.

Figure 7 Waveform and spectrogram (Frequency from 0–5000 Hz) for ‘bibi’ from participant 12F34.

3.1.2 Amplitude during oral closure and vowel onset

As for amplitude, Figure 8 shows a smoothing spline ANOVA with 95% Bayesian confidence intervals (indicated by the dotted lines) for amplitude during the last 100 ms of oral closure and the first 100 ms of the following vowel. For implosives, amplitude continuously increases during oral closure and continues to increase until about 25 ms after the release. For plosives, amplitude begins to increase about 20 ms before the release and continues to rise until around 50 ms into the vowel. At the release, average amplitude for implosives is 56.69 Db (SD 10.62), and average amplitude for plosives, 51.95 Db (SD 11.00). This difference of about 5 Db is statistically significant (Kruskal-Wallis $\chi^{2}$ = 410.58, df = 1, p < .01). Figures 4 and 6 show this amplitude increase during oral closure of implosives and Figures 5 and 7 demonstrate steady or diminishing amplitude for plosives.

Figure 8 Amplitude (Db) from 100 ms before to 100 ms after consonant release.

Figure 9 F0 in semitones from 100 ms before to 100 ms after consonant release.

3.1.3 F0 during oral closure and vowel onset

As for f0, Figure 9 shows a smoothing spline ANOVA for pitch in semitones during the last 100 ms of oral closure and the first 100 ms of the following vowel. A noticeable difference is observed during oral closure of implosives in that the f0 is higher than plosives. F0 decreases on release and starts to stabilize around 20 ms into the vowel. For plosives, pitch is low during oral closure but starts to sharply rise 25 ms before release at which it continues to rise, but at a slower pace. The difference in pitch for the first 20 ms of the vowel is statistically significant (Kruskal-Wallis rank sum test $\chi^{2}$ = 42.22, df = 1, p < .01). These findings seem to support theories that the downward movement of the larynx accompanied with glottal constriction during oral closure of implosives results in a higher pitch during closure (Ohala Reference Ohala1976, Hombert, Ohala & Ewan Reference Hombert, Ohala and Ewan1979).

Figure 10 H1*–H2* values (Db) during the first 100 ms of the vowel.

Table 3 Harmonics-to-noise ratio (HNR) and cepstral peak performance (CPP) values for the first 20 ms of vowel.

aHNR-05 = Harmonics-to-Noise ratio at 0–500 Hz (Hillenbrand et al. Reference Hillenbrand, Cleveland and Erickson1994).

3.1.4 Voice Quality via H1*–H2*, HNR and CPP

For H1*–H2* values, averages for the first 20 ms of the vowel are significantly different per consonant type: the implosive mean is 8.10 Db (SD 6.60) and the plosive mean is 11.15 Db (SD 7.11) (Kruskal-Wallis rank sum test $\chi^{2}$ = 137.087, df = 3, p < .01). Figure 10 shows a SSANOVA of the first 100 ms of the vowel. As we can see, H1*–H2* values for the two consonant types run parallel to each other much of the time, with plosive scores remaining higher. Table 3 shows the corresponding Harmonics-to-Noise (HNR) ratio and cepstral peak prominence (CPP) values. As can be seen, the difference between the two sound groups in terms of noise is minimal. Considering these two phonation parameters (H1*–H2* and the HNR/CPP values), it appears that the implosives do have glottal constriction (Garellek Reference Garellek, Katz and Assmann2019). While the constriction is not strong enough to induce irregularity in the noise measures (HNR and CPP), it is strong enough to affect spectral tilt, specifically H1*–H2* values. Thus, the bilabial implosive in Shimaore appears to move away from the category of modal phonation, a finding which does not support Maddieson & Sands’ (Reference Maddieson and Sands2019: 28) claim that Bantu implosives tend to lack glottal constriction, having vocal folds in ‘the normal position for voicing’. That is, the implosive bilabial has more glottal constriction that the plosive but not to the point to qualify it as creaky phonation.

3.2 Individual differences

Looking at individual differences can shed light not only on variation but also on the question of implosives existing on a gradient with plosives. Figures 11, 12 and 13 show VOT, amplitude and f0 values per participant, respectively. For some participants, VOT differences between consonants are quite different, such as participants 06M23 and 93F22, whereas for others the values are closer together, such as participants 30F21 and 05M21. In addition, some have varying VOT lengths (participant 99F23) whereas VOT values for others fall within a short range (21F23). A general observation of Figure 12 is that amplitude values vary among participants, but that many have higher amplitudes at least 10 ms before release. In addition, male speakers tend to have higher amplitude values than female speakers. For f0, we can see in Figure 13 that f0 during oral closure is higher for many but not all participants. Some, like 30F21 and 99F23, have similar f0 curves during closure.

While it is beyond the scope of the paper to analyze each participant, it is worthwhile to look at a couple of them. First, there is participant 06M23, whose production of (im)plosives parallels overall findings in Section 3.1 above. Indeed, this person produces implosives in ways that reflect the averages in Section 3.1. VOT duration is markedly different, where implosive VOT is −41.96 ms (SD = 17.82) and plosive VOT is −112.25 ms (SD = 32.15). Amplitude values are similar to the overall average, in that there is an increase during oral closure and during the first 20 ms of the vowel. At the release, implosive amplitude average is 71.55 Db (SD = 3.88) and for plosives, it is 68.30 Db (SD = 3.07), a difference that is slightly less than overall averages. Like Figure 9, the participant’s f0 curve shows higher values during closure and at the beginning of the vowel, before lowering.

In contrast, participant 30F21 stands out because some of her readings are different than those discussed in Section 3.1. First, VOT is only minimally different: implosive average = −67.43 ms (SD 22.03) and plosive average = −70.48 ms (SD 26.27). Compared to VOT averages in Section 3.1.1, we see the participant tends to have much shorter VOT for plosives and somewhat longer VOT for implosives. Concerning amplitude, 30F21’s SSANOVA curves resemble those of Section 3.1.2 except that amplitude increases much more for plosives around 25 ms into the vowel. At the release, the implosive amplitude average is 49.73 Db (SD 2.70) and the plosive amplitude average is 45.24 Db (SD = 3.79), which is similar to the averages in Section 3.1.2. For f0, as can be seen in Figure 13, the main difference is during oral closure, where the implosive has only a slightly higher pitch than the plosive. This is a departure from Section 3.1.3, where f0 is elevated throughout oral closure. For this participant, it is possible that the larynx is quickly lowered just before release, which could explain the lower f0 values. This participant’s differences and the individual variation suggest that bilabial implosives are manifested in various ways in Shimaore and that there can be variation, regarding VOT, amplitude and f0. It is possible that individuals vary in the degree and timing of larynx lowering, which may influence the degree of oral pressure, oral closure f0, amplitude and VOT.

Figure 11 VOT (ms) boxplots by participant, distinguished by consonant type.

Figure 12 Amplitude (Db) per participant from 100 ms before to 100 ms after consonant release.

Figure 13 F0 in semitones per participant from 100 ms before to 100 ms after consonant release.

In addition, Figures 14 and Figure 15 show another observation concerning consonant release quality. Some participants produced bursts at the release of the implosive. A clear indicator of a burst-like activity is seen with participant 17F20 in Figure 14, whereas 02F46, as seen in Figure 15, does not appear to have a burst. In Figure 14, there is a visible depression in the waveform as well as a transient corresponding to the release in the spectrogram. Some studies suggest that there should be no visible burst for implosives (Clements & Osu Reference Clements, Osu, Gussenhoven and Warner2002), but that does not appear to always be the case for this data and is consistent with observations of bursts in Zulu implosives (Naidoo Reference Naidoo2010) and Hainan Min (Cun Reference Cun2009).

Figure 14 Waveform and spectrogram (Frequency from 0–5000 Hz) for ‘bbueni’ from participant 17F20.

Figure 15 Waveform and spectrogram (Frequency from 0–5000 Hz) for ‘bbueni’ from participant 02F46.

Finally, it is worth considering which acoustic dimension may be the most salient for each participant in distinguishing implosives on the plosive–implosive cline. Looking individually at these correlates may reveal variation among speakers and give insight into the distinguishing role of the measurements analyzed above. This was done by calculating standardized differences via Cohen’s d values between the implosive and plosive measurements for each dimension, allowing for a standardized comparison (see Seyfarth & Garellek Reference Seyfarth and Garellek2018 for a thorough explanation). In conformity with the measurements in Section 3.1, f0, amplitude and spectral tilt measures were taken from the last 20 ms of oral closure through the first 20 ms of the vowel, for a total of 40 ms. Figure 16 illustrates the standardized differences for the five acoustic dimensions analyzed. As can be seen, VOT stands out as the property that separates these sound categories (average value = 2), followed by amplitude (average value = .82) and f0 (average value = .66). Looking at individual differences, it can be observed that for participant 18M16 amplitude and f0 are the more salient dimension for distinguishing the sound pairs, particularly amplitude. This is in contrast with participant 30F21, where the acoustic dimensions do not have much contrast for the two sounds. Notably, for participant 02F46, besides VOT, voice quality measures (H1*–H2* and CPP) appear to be more salient than amplitude or f0. For others, such as participant 28F22 and 29F21, VOT stands out as the most distinguishing measurement for differentiating between bilabial implosives and bilabial plosives.

Figure 16 Standardized differences between implosives and plosives per acoustic dimension for each participant.

4 General discussion

This study sought to explore bilabial implosives in an understudied Bantu language, Shimaore, via various acoustical analyses. First, /ɓ/ has significantly shorter VOT than /b/. For the implosive, amplitude and f0 are also higher during oral closure and at the beginning of the vowel. For voice quality, H1*–H2* is higher in plosives than in implosives, but there is little difference between the sound pairs for HNR or CPP values, which suggests the presence of glottal constriction. Individual differences were also observed regarding these acoustic correlates. Standardized differences between correlates show VOT as the principal factor for distinguishing the two sounds, but that amplitude and f0 are also factors to consider.

In terms of how these results compare to other languages, for VOT and amplitude during oral closure, Shimaore implosives appear to behave similarly to those in Hausa (Lindau Reference Lindau1984) and Sindhi (Nihalani 2006), with short VOT and gradual amplitude increase before release (Ladefoged & Maddieson Reference Ladefoged and Maddieson1996). Second, high f0 values during oral closure and at the beginning of the vowel are like Bantu Mpieme (Nagano-Madsen & Thornell Reference Nagano-Madsen and Thornell2012). These high f0 values during oral closure support the theory that larynx lowering induces higher pitch (Ohala Reference Ohala1976, Hombert, Ohala & Ewan Reference Hombert, Ohala and Ewan1979). Thus, it appears that, for some participants, pitch may be used in sound pair discrimination (Kirby Reference Kirby2018, Cho et al. Reference Cho, Whalen and Docherty2019). Regarding phonation, results do not support the idea that bilabial implosives in Shimaore are accompanied by creaky voice, as seen in other languages (Ladefoged & Maddieson Reference Ladefoged and Maddieson1996, Gordon & Ladefoged Reference Gordon and Ladefoged2001). That is, while H1*–H2* values are higher for implosives, which suggest non-modal phonation, HNR and CPP readings do not suggest creaky voice. It appears that Shimaore implosives have a phonation somewhere between modal and creaky voice when considering the glottal states. That is, the spectral tilt and noise values support the argument that bilabial implosives have more glottal constriction than plosives in Shimaore.

One question asked in this study concerns considering the argument that implosiveness should be considered on a gradient. If these sounds exist on a cline, then it is important to know how individuals distinguish these sound pairs. Whereas VOT stands out as the most important correlate, it appears that several acoustic dimensions factor into creating this distinction. Where VOT is less distinguishing, amplitude and f0 contribute to sound differentiation. That is, implosives with longer VOT values may still be perceived as implosives via higher amplitude and f0 values during the oral closure and the beginning of following vowel. Considering that f0 and amplitude differences are signs of larynx lowing, it appears that acoustic correlates of larynx lowering are important for distinguishing implosives from plosives in Shimaore. Likewise, for some participants, smaller differences between f0 and amplitude values suggest the realization of implosives that would put them closer to plosives on the implosive–plosive gradient in terms of larynx lowering. Further studies are needed to address the argument that implosives and plosives exist on a gradient, including how this might affect IPA conventions and the need for diacritic marks to indicate the degree of implosiveness.

Furthermore, while glottal constriction and voice quality appear to be less salient dimensions, they may still be important for some individuals, such as participants 02F46 and 95F22. Indeed, it is important to consider individual differences. For example, there is the possibility that some speakers may produce implosives that are much closer to plosives, as was the case for participant 30F21. This individual may in fact be in the process of losing this distinction, and the most salient dimension of loss may be the lack of distinction for VOT. It is speculative at best to discuss why this individual produces these sounds so similarly and further research is needed in order to become informed about the phenomenon. Nevertheless, the language contact factor comes to mind.

That is, since the departmentalization of the island, the French language is becoming more and more dominant in the Maore society, with children spending time in classrooms learning French from the age of three. Many Maore parents privilege French in the household, as they associate the language with academic and economic success. Indeed, it is the language used in administrative, educational and many economic contexts. This shift in language use could be a factor in the realization of implosives in Shimaore. Studies have shown a delay in developing implosive sounds in child language acquisition and in second language learning (Lewis & Roux Reference Lewis and Roux1999, Cissé, Demolin & Vallée Reference Cissé, Demolin and Vallée2011, Alerechi Reference Alerechi, Jason Kandybowicz, Torrence and Duncan2019). If learned late in childhood, it is possible that the realization of implosives may change in terms of larynx lowering, glottal constriction and/or VOT. Concerning plosives, studies have also shown that language contact situations lead to variation in VOT, for example with /b/ in the English (Scobbie Reference Scobbie, Goldsetin, Whalen and Best2006). This may be observed with Shimaore and the differentiation between plosive and implosives in terms of VOT.

Finally, these findings raise many sociophonetic questions, specifically concerning variation and language contact. Indeed, Cho et al. (Reference Cho, Whalen and Docherty2019) stress the need to better understand sociophonetic aspects of sounds. Beyond effects of language learning and language contact, it is possible that implosives and their markedness are implicated in language use and identity (Eckert Reference Eckert2012, Hay & Drager Reference Hay and Drager2007). For example, speakers may associate the use of plosives rather than implosives in Shimaore as an indicator of being more French than Maore. Likewise, implosive use may index an identity limited in its connections to French, such that speakers may employ the sound when aligning themselves with a non-French local identity. Research is needed to explore variation in the realization of implosives on several levels, including the phonetic context, age, gender, village, SES (socioeconomic status), among other factors. Analyses of sound production in informal situations are needed, particularly considering that f0 production may vary by context, such as reading versus colloquial speech (Kirby Reference Kirby2014). Time and sociophonetic research may tell how these sounds change, remain stable and index meaning on the island. This is important not only for bilabial implosives, but also alveolar implosives, for which there is little to no research on their realization in various contexts. In addition, identification of which factors provoke the use of the palatal implosive is also for the moment unknown. Further studies are needed to understand the sociophonetic context of all implosives in the Bantu languages spoken in Mayotte.

Acknowledgements

I would like to thank Marc Garellek, Kelly Berkson and Alexei Kochetov for their thorough and thoughtful feedback, as well as Qandeel Hussain for his initial feedback on analyzing implosives and James Kirby for feedback on my poster presentation of implosives in Shimaore during the The 5th International Workshop on Sound Change. The publication of the present article was funded by CUFR de Mayotte.

Appendix. Wordlist and sample images

The study’s wordlist and some corresponding images from the International Picture Naming Project (https://crl.ucsd.edu/experiments/ipnp/).

Footnotes

1 For example, for the only printed Shimaore-French dictionary, entries show about 165 entries for ɓ/b versus 80 entries for ɓ/d (Blanchy Reference Blanchy1996). Among these entries, a small fraction is plosive, based on the author’s observations, because the dictionary does not differentiate between the two sounds. Note that the dictionary is not exhaustive and is missing loanwords from French, such as ‘bibiro’ (‘bottle’).

2 Rombi (Reference Rombi1989) describes a palatal implosive for the Grand Comorian language Shingazidja. Ahmed-Chamanga (Reference Ahmed-Chamanga2017) does not mention the palatal implosive in his description of Shindzuani. Both language varieties are present in many villages in Mayotte.

3 In the sample from this study, 11 participants produced [ula], 12 produced [uɟa], three produced [uʄa] and two produced what appeared to be something between a plosive and implosive voiced palatal. The /l/ is arguably used in Shindzuani.

4 The asterisk indicates corrected values, which are done to correct for formant frequencies and bandwidths. It is often standard to use corrected values when analyzing spectral tilt in order to have a better estimate of it (from the voice source) and allow for cross-vowel comparisons. The approach used in this paper, PraatSauce (Kirby Reference Kirby2020), adopts the formula from Iseli, Shue & Alwan (Reference Iseli, Shue and Alwan2007).

5 Recordings for these words are not provided

References

Ahmed-Chamanga, Mohamed. 2017. Introduction à la grammaire structural du comorien, vol. II: Le shiNdzuani. Moroni: KomEDIT.Google Scholar
Alerechi, Roseline. 2019. Consonant substitution in child language (Ikwere). In Jason Kandybowicz, Travis Major, Torrence, Harold & Duncan, Philip T. (eds.), African linguistics on the prairie: Selected papers from the 45th Annual Conference on African Linguistics, 135155. Berlin: Language Science Press. doi:10.5281/zenodo.1251724.Google Scholar
Ashby, Michael. 1990. Articulatory possibilities for implosives. Journal of the International Phonetic Association 20(2), 1518.CrossRefGoogle Scholar
Association, SHIME. 2016. Apprenons le shimaore. Narifundrihe shimaore. Dzoumogne, Mayotte: Association SHIME.Google Scholar
Barreda, Santiago. 2015. phonTools: Functions for phonetics in R. R package version 0.2-2.1Google Scholar
Bates, Elizabeth, Federmeier, Kara D., Herron, Dan, Gowri Krovi Iyer, Thomas Jacobsen, Thomas Pechmann, Simona D'Amico, Antonella Devescovi, Nicole Wicha, Araceli Orozco-Figeuroa, Kathryn Kohnert, Gabriel Gutierrez, Ching Ching Lu, Daisy Hung, Jean Hsu, Ovid Tzeng, Elena Andonova, Irina Gerdjikova, Teodora Mehotcheva, Anna Székely & Csaba Pléh. 2000. Introducing the CRL International Picture-Naming Project (CRL-IPNP). Center for Research in Language Newsletter 12(1), 114.Google Scholar
Blanchy, Sophie. 1996. Dictionnaire mahorais–français, français–mahorais. Paris: L’Harmattan.Google Scholar
Boersma, Paul & Weenink, David. 2019. Praat: Doing phonetics by computer (version 6.1.01). http://www.praat.org, 6 September 2022.Google Scholar
Catford, J. C. 1939. On the classification of stop consonants. Le Maître Phonétique 65, 25. [Reprinted in Journal of the International Phonetics Association 40(3), 287–291, 2010.]Google Scholar
Cheucle, Marion. 2014. Étude comparative des langues makaa–njem (bantu A80): Phonologie, morphologie, lexique. Vers une reconstruction du proto-A80. Ph.D. dissertation, Université Lumière Lyon 2.Google Scholar
Cho, Taehong, Whalen, D. H. & Docherty, Gerard. 2019. Voice onset time and beyond: Exploring laryngeal contrast in 19 languages. Journal of Phonetics 72, 5265.CrossRefGoogle Scholar
Cissé, Ibrahima Abdoul Hayou, Demolin, Didier & Vallée, Nathalie. 2011. The acquisition of plosives and implosives by a Fulfulde-speaking child aged from 5 to 10;29 months. Proceedings from 17th International Congress of Phonetic Sciences (ICPhS XVII), Hong Kong, 500–503.Google Scholar
Clements, George N. & Osu, Silvester. 2002. Explosives, implosives, and nonexplosives: The linguistic function of air pressure differences in stops. In Gussenhoven, Carlos & Warner, Natasha (eds.), Laboratory Phonology, vol. 7, 299350. Berlin: Mouton de Gruyter.Google Scholar
Coburn, Jeremy & Hjortnaes, Nils. 2019. A phonetic study of Swahili voiced stops. The Journal of the Acoustical Society of America 145(3), 1928.CrossRefGoogle Scholar
Cun, Xi. 2009. A phonetic study on implosives in China. Ph.D. dissertation, Hong Kong University of Science and Technology. doi:10.14711/thesis-b1044466.CrossRefGoogle Scholar
Davidson, Lisa. 2020. The versatility of creaky phonation: Segmental, prosodic, and sociolinguistic uses in the world’s languages. Wiley Interdisciplinary Reviews: Cognitive Science, e1547.Google Scholar
Demolin, Didier. 1995. The phonetics and phonology of glottalized consonants in Lendu. In Connell, Bruce & Arvanati, Amalia (eds.), Phonology and phonetic evidence (Papers in Laboratory Phonology IV), 368385. Cambridge: Cambridge University Press.CrossRefGoogle Scholar
Demolin, Didier, Ngonga-Ke-Mbembe, Hubert & Soquet, Alain. 2002. Phonetic characteristics of an unexploded palatal implosive in Hendo. Journal of the International Phonetic Association 32, 115.CrossRefGoogle Scholar
Draxler, Christoph & Jänsch, Klaus. 2014. SpeechRecorder, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University. http://hdl.handle.net/11372/LRT-307, 6 September 2022.Google Scholar
Eckert, Penelope. 2012. Three waves of variation study: The emergence of meaning in the study of sociolinguistic variation. Annual Review of Anthropology 41, 87100.CrossRefGoogle Scholar
Garellek, Mark. 2019. The phonetics of voice. In Katz, William F. & Assmann, Peter F. (eds.), The Routledge handbook of phonetics, 75106. Oxford: Routledge.CrossRefGoogle Scholar
Gordon, Matthew & Ladefoged, Peter. 2001. Phonation types: A cross-linguistic overview. Journal of Phonetics 29(4), 383406.CrossRefGoogle Scholar
Greenberg, Joseph H. 1970. Some generalizations concerning glottalic consonants, especially implosives. International Journal of American Linguistics 36, 123145.CrossRefGoogle Scholar
Grimm, Nadine. 2019. Implosives in Bantu A80? The case of Gyeli. In Emily Clem, Peter Jenks & Hannah Sande (eds.), Theory and description in African linguistics: Selected papers from the 47th Annual Conference on African Linguistics, 135–153. Berlin: Language Science Press.Google Scholar
Gu, Chong. 2002. Smoothing Spline ANOVA models. New York: Springer.CrossRefGoogle Scholar
Gu, Chong. 2021. Package ‘gss’. https://cran.r-project.org/web/packages/gss/gss.pdf, 6 September 2022.Google Scholar
Guthrie, Malcolm. 1948. The classification of the Bantu languages. London: Oxford University Press for the International African Institute.Google Scholar
Hay, Jennifer & Drager, Katie. 2007. Sociophonetics. Annual Review of Anthropology 36, 89103.CrossRefGoogle Scholar
Hillenbrand, James M., Cleveland, Ronald A. & Erickson, Robert L.. 1994. Acoustic correlates of breathy vocal quality. Journal of Speech and Hearing Research 37, 769778.CrossRefGoogle ScholarPubMed
Hombert, Jean-Marie. 1978. Consonant types, vowel quality, and tone. In Fromkin, Victoria (ed.), Tone: A linguistic survey, 77111. New York: Academic Press.CrossRefGoogle Scholar
Hombert, Jean-Marie, Ohala, John J. & Ewan, William. 1979. Phonetic explanations for the development of tones. Language 55, 3758.CrossRefGoogle Scholar
Hussain, Qandeel. 2018. A typological study of Voice Onset Time (VOT) in Indo-Iranian languages. Journal of Phonetics 71, 284305.CrossRefGoogle Scholar
Hussain, Qandeel. 2021. Phonetic correlates of laryngeal and place contrasts of Burushaski. Speech Communication 26, 7189.CrossRefGoogle Scholar
Insee. 2007. Mayotte: les langues en 2007. https://www.insee.fr/fr/statistiques/2569783#consulter, 6 September 2022.Google Scholar
Insee Mayotte Infos. 2014. Les difficultés face à l’écrit en langue française. Quatre jeunes sur dix en grande difficulté à l’écrit à Mayotte. Population 70, 16.Google Scholar
Iseli, Markus, Shue, Yen-Liang & Alwan, Abeer. 2007. Age, sex and vowel dependencies of acoustic measures related to the voice source. The Journal of the Acoustical Society of America 121(4), 22832295.CrossRefGoogle Scholar
Jackson, Michel, Ladefoged, Peter, Huffman, Marie & Antoñanzas-Barroso, Norma. 1985. Measures of spectral tilt. UCLA Working Papers in Phonetics 61, 7278.Google Scholar
Jackson, Michel, Ladefoged, Peter, Huffman, Marie & Antoñanzas-Barroso, Norma. 1986. Automated measures of spectral tilt. UCLA Working Papers in Phonetics 62, 7788.Google Scholar
Jamet, Robert. 2016. Dictionnaire kibushi–français. Dzoumogné, Maotte: Association SHIME.Google Scholar
Jessen, Michael & Justus C. Roux. 2002. Voice quality differences associated with stops and clicks in Xhosa. Journal of Phonetics 30, 1–52.Google Scholar
Johansen Alnet, Aimee. 2009. The clause structure of the Shimaore dialect of Comorian (Bantu). Ph.D. dissertation, University of Illinois at Urbana–Champaign.Google Scholar
Keating, Patricia, Esposito, Christina M., Garellek, Mark, Sameer ud Dowla Khan & Kuang, Jiangjin. 2010. Phonation contrasts across languages. UCLA Working Papers in Phonetics 108, 188202.Google Scholar
Khan, Sameer ud Dowla. 2012. The phonetics of contrastive phonation in Gujarati. Journal of Phonetics 40, 780795.CrossRefGoogle Scholar
Kirby, James P. 2014. Incipient tonogenesis in Phnom Penh Khmer: Acoustic and perceptual studies. Journal of Phonetics 43, 6985.CrossRefGoogle Scholar
Kirby, James P. 2018. Onset pitch perturbations and the cross-linguistic implementation of voicing: Evidence from tonal and non-tonal languages. Journal of Phonetics 71, 326354.CrossRefGoogle Scholar
Kirby, James P. 2020. PraatSauce: Praat-based tools for spectral analysis. https://github.com/kirbyj/praatsauce, 6 September 2022.Google Scholar
Ladefoged, Peter. 1971. Preliminaries to linguistic phonetics. Chicago, IL: The University of Chicago Press.Google Scholar
Ladefoged, Peter, Iris Kemeny & William Brackenridge. 1976. Acoustic Effects of Style of Speech. The Journal of the Acoustical Society of America 59(1), 228–231.Google Scholar
Ladefoged, Peter & Maddieson, Ian. 1996. The sounds of the world’s languages. Oxford: Basil Blackwell.Google Scholar
Laroussi, Foued. 2015. La transmission des langues et cultures à Mayotte. Enjeux identitaires pour la famille et l’école. La Revue Internationale de l’Education Familiale 38, 2748.Google Scholar
Lewis, Phlip & Roux, Justius C. 1999. Xhosa L2 speech intelligibility: Ejectives, implosives and clicks. In John J. Ohala, Yoko Hasegawa, Manjari Ohala, Daniel Granville & Ashlee C. Bailey (eds.), Proceedings of the 14th International Congress of Phonetic Sciences (ICPhS XIV), San Francisco, 1149–1152. Berkeley, CA: University of California Press.Google Scholar
Lex, Gloria. 1994. Le dialecte peul du Fouladou (Casamance – Sénégal): Étude phonétique et phonologique. Ph.D. dissertation, Université de Paris 3 (Sorbonne Nouvelle).Google Scholar
Lindau, Mona. 1984. Phonetic differences in glottalic consonants. Journal of Phonetics 12, 147155.CrossRefGoogle Scholar
Lisker, Leigh & Arthur S. Abramson. 1964. A cross-language study of voicing in initial stops: Acoustical measurements. Word 20(3), 384–422.CrossRefGoogle Scholar
Maddieson, Ian. 2003. The sounds of the Bantu languages. In Nurse, Derek & Philippson, Gerard (eds.), The Bantu languages, 1541. London & New York: Routledge.Google Scholar
Maddieson, Ian. 2013. Glottalized consonants. In Dryer, Matthew S. & Haspelmath, Martin (eds.), The world atlas of language structures online. Leipzig: Max Planck Institute for Evolutionary Anthropology. http://wals.info/chapter/7, 6 September 2022.Google Scholar
Maddieson, Ian & Sands, Bonny. 2019. The sounds of the Bantu languages. In Van de Velde et al. (eds.), 79–127.Google Scholar
McLaughlin, Fiona. 2005. Voiceless implosives in Seerer-Siin. Journal of the International Phonetic Association 35(2), 201214.CrossRefGoogle Scholar
Midtlyng, Patrick. 2011. The effects of speech rate on VOT for initial plosives and click accompaniments in Zulu. In Eyamba G. Bokamba, Ryan K. Shosted & Bezza Tesfaw Ayalew (eds.), Selected Proceedings of the 40th Annual Conference on African Linguistics, 105–118. Somerville, MA: Cascadilla Proceedings Project.Google Scholar
Mori, Miki. 2018. The linguistic future of Mayotte concerning language policy, multilingual education and the written form of the native languages: The attitudes of undergraduate students. Carnets de Recherches de l’océan Indien 1(1), 113130.Google Scholar
Mori, Miki. 2021. L’identité, les langues et l’école: une analyse des attitudes linguistiques chez le professeur des écoles stagiaire mahorais. In Priolet, Maryvonne (ed.), L’école à Mayotte. Approches plurielles, 209258. Paris: Sépia.Google Scholar
Nagano-Madsen, Yasuko & Thornell, Christina. 2012. Acoustic properties of implosives in Bantu Mpiemo. Proceedings from FONETIK 2012, 7376.Google Scholar
Naidoo, Shamila. 2010. A re-evaluation of the Zulu implosive [ɓ]. South African Journal of African Languages 30(1), 110.CrossRefGoogle Scholar
Nihalani, Paroo. 1986. Phonetic implementation of implosives. Language and Speech 29(3), 253262.CrossRefGoogle Scholar
Nihalani, Paroo. 2006. An interface between phonetics and phonology: Evidence from implosives. Proceedings of the 7th International Seminar on Speech Production (ISSP 2006).Google Scholar
Nurse, Derek & Hinnebusch, Thomas J.. 1993. Swahili and Sabaki: A linguistic history (University of California Publications in Linguistics 121). Berkeley, CA: University of California Press.Google Scholar
Ohala, John J. 1976. A model of speech aerodynamics. Report of the Phonology Laboratory 1, 93107.Google Scholar
Painter, Colin. 1978. Implosives, inherent pitch, tonogenesis and laryngeal mechanisms. Journal of Phonetics 6, 249274.CrossRefGoogle Scholar
Patin, Cédric, Mohamed-Soyir, Kassim & Kisseberth, Charles. 2019. Ngazidja G44A. In Van de Velde et al. (eds.), 591–614.Google Scholar
Philippson, Gerard. 1988. L’accentuation du Comorian: essai d’analyse métrique. Etudes Océan Indien 9, 3579.Google Scholar
R Core Team. 2017. R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing. https://www.R-project.org/, 6 September 2022.Google Scholar
Renaud, Patrick. 1976. Le bajeli. Phonologie, morphologie nominale, vols. 1–2. Yaounde: Les dossiers de l’ALCAM.Google Scholar
Rombi, Marie-Françoise. 1983. Le shimaore (île de Mayotte, Comores): Première approche d’un parler de la langue comorienne. Paris: Société d’études linguistiques et anthropologiques de France (SELAF).Google Scholar
Rombi, Marie-Françoise. 1989. La nasalité en Ngazija. Revue d’Ethnolinguistique: Cahiers du LACITO 4, 5981.Google Scholar
Scobbie, James. 2006. Flexibility in the face of incompatible English VOT systems. In Goldsetin, Louis, Whalen, D. H. & Best, Chatherine T. (eds.), Laboratory Phonology 8, 367392. Berlin & New York: Mouton de Gruyter.CrossRefGoogle Scholar
Seyfarth, Scott & Garellek, Mark. 2018. Plosive voicing acoustics and voice quality in Yerevan Armenian. Journal of Phonetics 71, 425450.CrossRefGoogle Scholar
Thomas, Erik R. 2011. Sociophonetics: An introduction. New York: Palgrave Macmillan.CrossRefGoogle Scholar
Thornell, Christina & Nagano-Madsen, Yasuko. 2004. Preliminaries to the phonetic structure of the Bantu language Mpiemo. Africa & Asia 4, 163180.Google Scholar
Van de Velde, Mark, Bostoen, Koen, Nurse, Derek & Philippson, Gérard (eds.). 2019. The Bantu languages, 2nd edn. (Routledge Language Family Series). London: Routledge.CrossRefGoogle Scholar
Wright, Richard & Shryock, Aaron. 1993. The effects of implosives on pitch in SiSwati. Journal of the International Phonetic Association 23(1), 1623.CrossRefGoogle Scholar
Figure 0

Figure 1 Map of the Comorian Archipelago in the Mozambique Channel.

Figure 1

Figure 2 Approximate distribution of the local languages and their varieties in Mayotte.

Figure 2

Table 1 Consonants and vowels of Shimaore, with marginal consonant phonemes shown in parentheses. Grey areas indicate articulations judged impossible.

Figure 3

Table 2 Word list.

Figure 4

Figure 3 VOT (ms) boxplots by word.

Figure 5

Figure 4 Waveform and spectrogram (Frequency from 0–5000 Hz) for ‘bbibbi’ from participant 06M23.

Figure 6

Figure 5 Waveform and spectrogram (Frequency from 0–5000 Hz) for ‘bibi’ from participant 06M23.

Figure 7

Figure 6 Waveform and spectrogram (Frequency from 0–5000 Hz) for ‘bbibbi’ from participant 12F34.

Figure 8

Figure 7 Waveform and spectrogram (Frequency from 0–5000 Hz) for ‘bibi’ from participant 12F34.

Figure 9

Figure 8 Amplitude (Db) from 100 ms before to 100 ms after consonant release.

Figure 10

Figure 9 F0 in semitones from 100 ms before to 100 ms after consonant release.

Figure 11

Figure 10 H1*–H2* values (Db) during the first 100 ms of the vowel.

Figure 12

Table 3 Harmonics-to-noise ratio (HNR) and cepstral peak performance (CPP) values for the first 20 ms of vowel.

Figure 13

Figure 11 VOT (ms) boxplots by participant, distinguished by consonant type.

Figure 14

Figure 12 Amplitude (Db) per participant from 100 ms before to 100 ms after consonant release.

Figure 15

Figure 13 F0 in semitones per participant from 100 ms before to 100 ms after consonant release.

Figure 16

Figure 14 Waveform and spectrogram (Frequency from 0–5000 Hz) for ‘bbueni’ from participant 17F20.

Figure 17

Figure 15 Waveform and spectrogram (Frequency from 0–5000 Hz) for ‘bbueni’ from participant 02F46.

Figure 18

Figure 16 Standardized differences between implosives and plosives per acoustic dimension for each participant.