Hostname: page-component-669899f699-7tmb6 Total loading time: 0 Render date: 2025-05-06T02:05:22.968Z Has data issue: false hasContentIssue false

Kam (Rongjiang Variety)

Published online by Cambridge University Press:  28 April 2025

Jie Yang
Affiliation:
School of Liberal Arts, Minzu University of China, Beijing, China
Xiyu Wu
Affiliation:
Laboratory of Language Sciences, Peking University, Beijing, China, Department of Chinese Language and Literature, Peking University, Beijing, China
Jiangping Kong*
Affiliation:
Laboratory of Language Sciences, Peking University, Beijing, China, Department of Chinese Language and Literature, Peking University, Beijing, China
*
*Corresponding author. Email: [email protected]
Rights & Permissions [Opens in a new window]

Abstract

The Rongjiang variety of Kam described in the present study is a southern Dong dialect (ISO 639-3: [kmc]), which belongs to the Tai-Kadai Languages (Edmondson & Luo 2008; Pittayaporn 2021).

Type
Illustration of the IPA
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - ND
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NoDerivatives licence (https://creativecommons.org/licenses/by-nd/4.0/), which permits re-use, distribution, and reproduction in any medium, provided that no alterations are made and the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of The International Phonetic Association

The Rongjiang variety of Kam described in the present study is a southern Dong dialect (ISO 639-3: [kmc]), which belongs to the Tai-Kadai Languages (Edmondson & Luo Reference Diller, Edmondson and Luo2008; Pittayaporn Reference Pittayaporn, Sidwell and Jenny2021).

Rongjiang (榕江) is located in southeastern Guizhou Province, in the south of Qiandongnan Miao and Dong Autonomous Prefecture, straddling the Pearl River and Yangtze River basins, as shown in Figure 1. It is adjacent to Liping (黎平) and Congjiang (从江) to the east, Libo (荔波) to the south, Sandu Sui Autonomous County (三都水族自治县) and Leishan County (雷山县) in Qiannan Prefecture (黔南州) to the west, and Jianhe County (剑河县) to the north, with a total area of 3,300 square kilometers. Rongjiang has a total population of 379,900, of which ethnic minorities account for about 85 percent, with Dong, Miao, Han, and other ethnic groups dominating the county’s population. The majority of the county can speak the Kam language, with the exception of a few areas.

Figure 1. Distribution of Kam. The areas of the northern dialect are shaded light red and the areas of the southern dialect are shaded blue. The map is drawn from C1–12 of Language Atlas of China (中国语言地图集).

The plosives of Kam in Rongjiang are divided into aspirated and unaspirated, except for /ʔ/. It has a rich tone inventory, with 15 tones, and the tones are closely linked to consonant aspiration and vowel length. Liang (Reference Liang1980), Zheng & Yang (Reference Zheng and Yang1985), Wu (Reference Wu2012), among others, have studied its phonetic system.

The present Illustration is based on data collected with a male native Kam speaker aged 50, an elementary school teacher in former Chejiang Township (now merged into Gucheng Town), Rongjiang County, who has not spent long periods of time outside of the area. The monosyllabic morphemes used in the investigation are from the Dong Han Common Dictionary (侗汉常用词典), and totaled 1,726.

The phonetic characteristics of Kam are analyzed and described on the basis of acoustic data. The devices used during the recording sessions were a laptop computer (Thinkpad X1 Carbon Gen 10, Beijing, China), an external sound card (Sound Blaster X-Fi Surround 5.1 Pro, Creative Labs, Singapore), a mixing console (Behringer XENYX 302USB Premium 5-Input Mixer, Beijing, China), and a unidirectional collar clip microphone (ECM-44B, SONY, Tokyo, Japan). The software was Adobe Audition 2023, recording with a sampling rate of 44.1k Hz and 16-bit resolution. The recordings were carried out in a quiet room. The basic speech parameters are extracted by Praat software (Boersma & Weenink Reference Boersma and Weenink2020). It should be noted that Chao’s five-scale pitch system is applied to transcribe the tones throughout this article (Chao Reference Chao1980).

Consonants

As shown in Consonants table, the consonants of Kam comprise 26 phonemes. Plosives (with the exception of glottal) contrast in terms of aspiration: they may be either aspirated vs. unaspirated. The bilabial plosives, bilabial nasals, and lateral approximants contrast in terms of palatalization, and the velar plosives and velar nasals have labialized counterparts.

The following minimal and near-minimal pairs illustrate the contrasts summarized above:

Plosives

Plosives in Kam appear in 409 monosyllabic morphemes, accounting for 23.7 percent of the total. Kam plosives, all voiceless, show the places of articulation: bilabial, alveolar, palatal, velar, and glottal. In addition to the glottal, all plosives come with a general two-way contrast in aspiration. Figure 2 presents a spectrogram of /ka3⁵⁵/ ‘to pull’ and /kʰa2⁵/ ‘ear’. It intuitively demonstrates the difference in the voice onset time (VOT) between the two. The VOT of unaspirated plosives, that is, /p/, /t/, /c/ and /k/, is shorter with the release and voicing onset occurring within approximately 12 ms. On the other hand, the VOT of aspirated plosives, that is, /pʰ/, /tʰ/, /cʰ/ and /kʰ/, is about 55 ms, as shown in Figure 3.

Figure 2. Waveforms and spectrograms for the minimal pairs of /ka3⁵⁵/ ‘to pull’ vs. /kʰa2⁵/ ‘ear’ in Kam.

Figure 3. VOT of plosives in Kam, the unaspirated and aspirated are indicated by distinct colors. The mean and standard deviation were calculated using 10 tokens of each plosive from the single speaker.

In addition to appearing in the onset position, the unaspirated plosives /p/, /t/, and /k/ can also appear as stop codas following vowels. With the exception of a few modal particles, syllables beginning with a vowel are preceded by the glottal plosive /ʔ/.

Figure 4. FFT spectrum (blue line) and spectral envelope (cepstrally smoothed spectrum, red line) (made with a 23 ms window centred on the peak of noise intensity) of the frication in /sa3⁵⁵/ ‘ford’, /ça3⁵⁵/ ‘to cover’ and /ha22/ ‘only if’.

Figure 5. Waveforms and spectrograms for the minimal pairs /pa3⁵⁵/ ‘fish’ vs. /pja3⁵⁵/ ‘stone’. The red dotted lines represent the formants.

Nasals

Nasals are present in the different places of articulation: bilabial (/m/, /mj/), alveolar (/n/), palatal (/ɲ/), and velar (/ŋ/, /ŋw/). /m/ (as in /ma3⁵⁵/ ‘vegetable’ and /taːm3⁵⁵/ ‘handle’), /n/ (as in /na3⁵⁵/ ‘thick’ and /paːn3⁵⁵/ ‘male’) and /ŋ/ (as in /ŋa22/ ‘to slander’ and /paːŋ3⁵⁵/ ‘straw’) can occur both in onset and coda position, while /mj/ (as in /mja211/ ‘hand’), /ɲ/ (as in /ɲa3⁵⁵/ ‘river’) and /ŋw/ (as in /ŋwa211/ ‘to raise’) occur only in onset position.

Fricatives

There are three fricatives which are produced at three places of articulation: alveolar /s/, as in /sa3⁵⁵/ ‘ford’, palatal /ç/, as in /ça3⁵⁵/ ‘to cover’, and glottal /h/, as in /ha22/ ‘only if’. Fricative spectra (made with a 23 ms window centred on the peak of noise intensity) in three fricative samples of Kam are provided in Figure 4. The energy distribution of the three fricatives is different. /s/ has its energy mainly concentrated between 5000–10000 Hz. The energy of /ç/ is primarily between 4000–8000 Hz. As for /h/, its greatest energy is mainly below 2000 Hz, and it gradually decreases as the frequency rises.

Lateral approximants and approximants

There are two lateral approximants, /l/ and /lj/ and two approximants, /j/ and /w/.

Palatalized consonants

In Kam, the voiceless unaspirated bilabial plosive, voiceless aspirated bilabial plosive, voiced bilabial nasal, and lateral approximant each have a palatalized counterpart, namely, /pj/ (as in /pja3⁵⁵/ ‘stone’), /pʰj/ (as in /pʰje41/ ‘to give’), /mj/ (as in /mja211/ ‘hand’), and /lj/ (as in /lja211/ ‘to shake’). According to the spectra for plain /p/ and palatalized /p/ in Figure 5, vowels after a palatalized consonant begin with a high F2 similar to that of a /i/.

As shown in Figure 6, we statistically analyzed the mean F2 onset of the plain consonants and corresponding palatalized consonants. The mean F2 onset of plain consonants was around 1000 Hz, while that of palatalized consonants was around 2000 Hz. The F2 onset of palatalized consonants is significantly higher than that of plain consonants.

Figure 6. The mean F2 onset after plain vs. palatalized consonants in Kam. The mean and standard deviation were calculated using 10 tokens of each consonant from the single speaker.

Figure 7. Waveforms and spectrograms for the minimal pairs /ka3⁵⁵/ ‘to pull’ vs. /kwa3⁵⁵/ ‘to register’. The red dotted lines represent the formants.

It is worth noting that the consonants which have the palatalized contrast /p pʰ m l/ are slightly velarized, which is especially apparent when followed by /i/, enlarging the physiological and acoustic difference between plain vs. palatalized consonants.

Labialized consonants

Velar plosives and nasals have labialized counterparts: /kw/ (as in /kwa3⁵⁵/ ‘to register’), /kʰw/ (as in /kʰwa12/ ‘tail’ and /ŋw/ as in /ŋwa211/ ‘to raise’ (i.e. these consonants are pronounced with rounded lips), so that the consonants have some pronunciation features similar to /u/, as shown in Figure 7.

Vowels

Monophthongs

Kam has seven vowel phonemes: /a/, /aː/, /ə/, /e/, /i/, /o/ and /u/, among which /a/, /e/, /i/, /o/ and /u/ can be found in open syllables. All seven vowels can occur in closed syllables with nasal codas /m n ŋ/ and stop codas /p t k/. Stops have no audible release when they appear in coda position. It should be noted that, although /a/ is technically a low front vowel in the IPA, we are using this symbol to denote a low central vowel.

The vowel chart is plotted based on the relative F1 and F2 values of vowel phonemes in Kam, as shown in Figure 8. The values of formants are extracted from open syllables whose vowels are /a/, /e/, /i/, /o/ and /u/, and closed syllables whose vowels are /a/ and /ə/. We chose 10 syllables with different consonants for each vowel, then extracted the values of the stable segment of the formants using Praat software, with the default Settings.

Figure 8. Acoustic plots for the vowels of Kam, F1and F2 of each vowel were based on mean formant values of 10 open syllables. The ellipses show the F1 and F2 values to 2 standard deviations.

/a/ is a low central vowel, which can be a monophthong by itself or followed by nasal and stop codas. It has two allophones, [ɐ] and [ɑː]. When /a/ is the monophthong final, it is pronounced similarly to [ɑː] (e.g. /pa3⁵⁵/ ‘fish’, /ta3⁵⁵/ ‘eye’). And when it occurs in closed syllables, it is close to [ɐ] (e.g. /pan3⁵⁵/ ‘bamboo’, /tam3⁵⁵/ ‘pond’).

Table 1. Duration of /aː/ and /a/

Figure 9. Waveforms and spectrograms for the minimal pairs /taːm3⁵⁵/ ‘handle’ vs. /tam3⁵⁵/ ‘pond’.

Like /a/, /aː/ is a low central vowel, and /aː/ is pronounced close to [ɑː] (e.g. /paːn3⁵⁵/ ‘male’, /taːm3⁵⁵/ ‘handle’). The /aː/ and /a/ differ in length when followed by nasal codas or stop codas. As in Table 1, by separately calculating the average duration of /aː/ and /a/ in all the monosyllabic morphemes with nasal codas and stop codas in Kam, the mean duration of /aː/ is identified as 180 ms, and that of /a/ as 58 ms. It is noteworthy that with nasal codas, long nuclei have short codas, while short nuclei have long codas. For example, Figure 9 shows spectrograms of two syllables with a nasal coda, /taːm3⁵⁵/ ‘handle’ and /tam3⁵⁵/ ‘pond’. The duration of the two syllables is almost equal, but /aː/ is significantly longer than /a/, and in /am/, the duration of /m/ is longer than in /aːm/.

/ə/ has two allophones, [ə] and [ɤ]. When the coda is /m/ or /ŋ/, the nucleus is [ɤ] (e.g. /pəm323/ ‘plop’, /təŋ⁵1/ ‘black’). In other cases, the nucleus is pronounced [ə]. As in Figure 10, we calculated the average values of F1, F2 of syllables with /m/, /ŋ/ and with other codas, of which each category was calculated from seven tokens. F2 of /ə/ followed by /m ŋ/ is lower than that of /ə/ followed by other consonantal codas.

Figure 10. Two allophones [ə] and [ɤ] of /ə/ in F1 vs. F2 vowel space.

There are two allophones of /e/ in different contexts. When occurring in open syllables, /e/ is realized as [e]. In other cases, /e/ is pronounced with a tongue position a little higher than /ɛ/ and a little lower than /e/ (e.g. /pen22/ ‘to prepare’). As a high front vowel, /i/ has two allophones and is pronounced [i] when it follows a palatalized consonant (e.g. /lji211/ ‘centimeter’). In other cases, following the plain consonants which have plain vs. palatalized contrast (i.e. /p pʰ m l/, which, as mentioned above, are velarized apparently when followed by /i/), it is pronounced [əi] with a schwa-like onglide (e.g. /pi3⁵⁵/ ‘cup’, /pik42/ ‘crucian’). /u/ is a high back vowel. There are two allophones of /u/. When /u/ is used as a monophthong vowel, it is pronounced [u] (e.g. /pu3⁵⁵/ ‘to pick’, /ku3⁵⁵/ ‘whistle’); when it is a nucleus with a consonantal coda, it is pronounced [ʊ] (e.g. /pup323/ ‘lung’, /hut13/ ‘poor’).

Diphthongs

There are eight diphthongs in Kam, all of which only occur in open syllables. Table 2 presents the mean and standard deviation of F1 and F2 of these diphthongs, and each based on the measurements of 10 tokens at 20 percent and 80 percent of the vowel duration.

Table 2. Mean F1 and F2 of the diphthongs measured at 20 percent and 80 percent of the vowels

When followed by /i/, the F2 of /aː/ is higher than when followed by /u/, indicating that the articulation of /a/ is more fronted. In addition, we transcribe the diphthongs in /aːi/ and /aːu/ with a long /aː/ instead of with a short /a/. This is because we statistically averaged the duration of /aː/ among 10 monosyllabic morphemes containing /aːi/ and /aːu/ and found that the average duration is about 183 ms. The average duration of /aː/ with nasal or stop codas is 180 ms, which is similar to that of /a/ with vowel codas, so /a/ in /aːi/ and /aːu/ should be notated as the long vowel /aː/. In the diphthongs of /eu iu ou au/, the F1 and F2 of the /u/ in /eu/ are larger than the others, close to [ʊ].

Tones

Previous studies have shown that Kam has a total of 15 tones, nine unchecked and six checked, and the latter are split into long and short tones (Liang Reference Liang1980; Zheng & Yang Reference Zheng and Yang1985; Long Reference Long2018).

The pitch trajectory of each tone is illustrated in Figure 11. The f0 values of each tone were extracted using the Praat software, with the default Settings (Boersma & Weenink Reference Boersma and Weenink2020). Each tonal contour is obtained by averaging across five tokens. Tonal values are marked using the five-scale pitch system developed by Chao (Reference Chao1980), where 5 is highest and 1 is lowest. To facilitate observation and comparison, we have charted the pitch trajectory of unchecked and checked tones separately.

Figure 11. f0 contours of Kam tones for the one speaker; (a) unchecked tones; (b) checked tones.

Among the nine unchecked tones, the f0 contour of T355 goes from middle to high and then remains level towards the end. T25 is a low-high rising tone, which has a larger slope than T355. T211 is a low falling tone, and the f0 contour of T211 moves up from high to low and then remains level towards the end. T323 is a falling-rising tone and T12 is a low rising tone. T41 and T51 are both falling tones, but T41 is a mid-falling tone, and T51 is a high-falling tone. T451 is a rising-falling tone. T22 is the only level tone.

Among the six checked tones, there is one level tone, two rising tones, two falling tones and one falling-rising tone. T55 is a high level tone, and T34 is a high rising tone. T43 is a high falling tone. T323 is a falling-rising tone, which is the same contour as unchecked tone T323. T12 is a low rising tone. T42 is a low falling tone, of which the slope of the f0 contour is smaller than T43. It is noteworthy that the duration of T55, T34 and T43 is about 50 ms, while the duration of T323, T12 and T42 is more than 100 ms, which is significantly longer.

Some scholars have simplified the tone categories. Liang (Reference Liang1980) indicated that the distribution of checked and unchecked tones are in complementary distribution depending on the absence vs. presence of a coda, and claimed that the tone value of the six checked tones are the same or similar to the first six unchecked tones, thus simplifying the number of tones to nine. However, for the studies on language change and historical comparison of related languages, it would be more convenient to display all the 15 tone categories of Kam separately. In addition, as investigated in this study, the tone values of the mentioned unchecked and checked tones are not exactly the same. Thus, we keep the 15 tones of Kam separate.

The consonants and tones in Kam have strict cooccurrence restrictions. As shown in Table 3, aspirated plosives can only appear in unchecked tones T25, T12, T451 and checked tones T34 and T12, while unaspirated plosives can only appear in unchecked tones T355, T211, T323, T41, T51, T22 and checked tones T55, T43, T323 and T42, and other consonants can appear in all tones. According to the principle of complementary distribution, the aspirated plosives can be regarded as allophones of unaspirated plosives in the same position, distinguished by tones. However, we are currently unable to determine whether Kam speakers distinguish these phonemes by tone features or aspiration features, this needs to be investigated by perceptual experiments. Therefore, the aspirated and unaspirated plosives are temporarily divided into different phonemes, and the tones are not merged.

Table 3. Onset consonants and tones

There is a close relationship between checked tones and vowel length in Kam. When followed by the stop codas /p/, /t/, /k/, the nucleus vowels /a/ and /ə/ are short and appear only in T355, T34 and T43; when /aː/, /e/, /i/, and /u/ are the nuclei, they are long, and only appear in T323, T12 and T42. /o/ is special in that it can be used as a nucleus in all checked tones. When the coda is /p/, /t/, the vowel /o/ is long and only appears in T323, T12 and T42. When the stop coda is /k/, /o/ is long if it appears in T323, T12 and T42, and short if it appears in T355, T34 and T43. The long vowel [oː] and the short vowel [o] show a complementary distribution, so are merged and notated as /o/, as shown in Table 4.

Table 4. Checked tone distribution of vowels with stop codas

Syllable structure

Kam is a tone language, and all the consonants can be onsets. Vowels are divided into four categories: monophthongs, diphthongs, vowels with stop codas, and vowels with nasal codas. Therefore, there are four combinations of consonants and vowels, namely CV, CVV, CVS, and CVN. Among the 1,726 monosyllables investigated, CV, CVV, CVS, and CVN accounted for 19.5, 20.2, 19.0, and 41.3 percent, respectively. Table 5 shows the syllable structure and examples.

Transcription of the recorded passage

The passage used for the recordings is the story ‘The North Wind and the Sun,’ which is transcribed using the consonants, vowels, and tones of Kam described above. The transcription below is broad phonemic. The symbol ‘ǀ’ marks a pause and ‘ǁ’ marks the end of the sentence. The transcription of each sentence is presented in three versions: the first line is an IPA transcription; the second, in italics, is a Kam writing system transcription; and the third is the interlinear morphemic glossing. Abbreviations used in interlinear glossing follow the Leipzig Glossing Rules (LGR, http://www.eva.mpg.de/lingua/resources/glossing-rules.php). The non-standard abbreviation (not included in the LGR) is: prep = preposition. Additionally, a free English translation is also provided.

Table 5. Syllable structure and examples of Kam

Acknowledgments

This work was supported by the National Social Science Foundation of China (No. 22&ZD213). We are grateful to the journal’s editors, Marc Garellek and Marija Tabain, as well as the two anonymous reviewers, for their detailed comments and insightful suggestions, which have significantly contributed to improving the analysis and presentation of the Illustration. We also extend our sincere thanks to Prof. Youjing Lin, Dr. Changwei Liang and Dr. Jingyao Zheng for their valuable contributions.

References

Boersma, Paul & Weenink, David. 2020. PRAAT: Doing phonetics by computer (Version 6.1.16), http://www.praat.org/ Google Scholar
Chao, Yuanren. 1980. A system of “tone-letters”. Fangyan [Dialects] 2, 8183.Google Scholar
Diller, Anthony V. N., Edmondson, Jerold A., & Luo, Yongxian. 2008. The Tai-Kadai languages (1st ed.). London: Routledge.Google Scholar
Institute of Linguistics and Institute of Ethnology and Anthropology, Chinese Academy of Social Sciences & Research Centre on Linguistics and Language Information Sciences, City University of Hong Kong. 2012. Language atlas of China. Beijing: The Commercial Press.Google Scholar
Liang, Min. 1980. Dongyu Jianzhi [A Sketch of the Kam Language]. Beijing: Nationalities Press.Google Scholar
Long, Runtian. 2018. Dongyu shengdiao yanbian de hebing qushi tanxi [Research on tone merging in the Kam language]. Guizhou minzu yanjiu [Guizhou Ethnic Studies] 9, 194199.Google Scholar
Pittayaporn, Pittayawat. 2021. Typological profile of Kra-Dai languages. In Sidwell, Paul & Jenny, Mathias (eds.), The languages and linguistics of mainland southeast Asia: A comprehensive guide, 433468. Berlin and Boston: De Gruyter Mouton.CrossRefGoogle Scholar
Wu, Yongyi. 2012. A study of phonetic change in Chejiang Kam in the past 50 years. MA. Dissertation, Guizhou Minzu University.Google Scholar
Zheng, Guoqiao & Yang, . 1985. Rongjiang donghua de yuyin [The phonetics of Kam in the Rongjiang area]. Guizhou minzu yanjiu [Guizhou Ethnic Studies] 2, 138151.Google Scholar
Figure 0

Figure 1. Distribution of Kam. The areas of the northern dialect are shaded light red and the areas of the southern dialect are shaded blue. The map is drawn from C1–12 of Language Atlas of China (中国语言地图集).

Figure 1

Figure 2. Waveforms and spectrograms for the minimal pairs of /ka3⁵⁵/ ‘to pull’ vs. /kʰa2⁵/ ‘ear’ in Kam.

Figure 2

Figure 3. VOT of plosives in Kam, the unaspirated and aspirated are indicated by distinct colors. The mean and standard deviation were calculated using 10 tokens of each plosive from the single speaker.

Figure 3

Figure 4. FFT spectrum (blue line) and spectral envelope (cepstrally smoothed spectrum, red line) (made with a 23 ms window centred on the peak of noise intensity) of the frication in /sa3⁵⁵/ ‘ford’, /ça3⁵⁵/ ‘to cover’ and /ha22/ ‘only if’.

Figure 4

Figure 5. Waveforms and spectrograms for the minimal pairs /pa3⁵⁵/ ‘fish’ vs. /pja3⁵⁵/ ‘stone’. The red dotted lines represent the formants.

Figure 5

Figure 6. The mean F2 onset after plain vs. palatalized consonants in Kam. The mean and standard deviation were calculated using 10 tokens of each consonant from the single speaker.

Figure 6

Figure 7. Waveforms and spectrograms for the minimal pairs /ka3⁵⁵/ ‘to pull’ vs. /kwa3⁵⁵/ ‘to register’. The red dotted lines represent the formants.

Figure 7

Figure 8. Acoustic plots for the vowels of Kam, F1and F2 of each vowel were based on mean formant values of 10 open syllables. The ellipses show the F1 and F2 values to 2 standard deviations.

Figure 8

Table 1. Duration of /aː/ and /a/

Figure 9

Figure 9. Waveforms and spectrograms for the minimal pairs /taːm3⁵⁵/ ‘handle’ vs. /tam3⁵⁵/ ‘pond’.

Figure 10

Figure 10. Two allophones [ə] and [ɤ] of /ə/ in F1 vs. F2 vowel space.

Figure 11

Table 2. Mean F1 and F2 of the diphthongs measured at 20 percent and 80 percent of the vowels

Figure 12

Figure 11. f0 contours of Kam tones for the one speaker; (a) unchecked tones; (b) checked tones.

Figure 13

Table 3. Onset consonants and tones

Figure 14

Table 4. Checked tone distribution of vowels with stop codas

Figure 15

Table 5. Syllable structure and examples of Kam