Review of research on applications of speech recognition technology to assist language learning

Rustam Shadiev; Jiawen Liu

doi:10.1017/S095834402200012X

Review of research on applications of speech recognition technology to assist language learning

Published online by Cambridge University Press: 14 July 2022

Rustam Shadiev

and

Jiawen Liu

Show author details

Rustam Shadiev: Affiliation:
Nanjing Normal University, China ([email protected])
Jiawen Liu: Affiliation:
Nanjing Normal University, China ([email protected])

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

Speech recognition technology (SRT) is now widely used in education because of its potential to aid learning, particularly language learning. Nevertheless, SRT has received only limited attention in earlier review studies. The present research aimed to address this gap in the field. To this end, 26 articles published in SSCI journals between 2014 and 2020 were selected and reviewed with respect to domain and skills, technology and their application, participants and duration, measures, reported results, and advantages and disadvantages of SRT. The results showed that English received much more attention than any other language, and scholars mostly focused on facilitating pronunciation skills. Dragon Naturally Speaking and Google speech recognition were the most popular technologies, and their most frequent application was providing feedback. According to the results, college students were involved in research more than any other group, most studies were carried out for less than one month, and most scholars administered a questionnaire or pre-/posttest to collect the data. Positive results related to gains in proficiency and student perceptions of SRT were identified. The study revealed that improved affective factors and enhanced language skills were advantages, whereas a low accuracy rate and insufficiency (i.e. lack of some useful features to support learning efficiently) of SRT were disadvantages. Based on the results, the study puts forward several implications and suggestions for educators and researchers in the field.

Keywords

review speech recognition technology language learning

Type: Research Article
Information: ReCALL , Volume 35 , Issue 1 , January 2023 , pp. 74 - 88

DOI: https://doi.org/10.1017/S095834402200012X [Opens in a new window]
Copyright: © The Author(s), 2022. Published by Cambridge University Press on behalf of European Association for Computer Assisted Language Learning

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Ahn, T. Y. & Lee, S.-M. (2016) User experience of a mobile speaking application with automatic speech recognition for EFL learning. British Journal of Educational Technology, 47(4): 778–786. https://doi.org/10.1111/bjet.12354 CrossRef Google Scholar

Arcon, N., Klein, P. D. & Dombroski, J. D. (2017) Effects of dictation, speech to text, and handwriting on the written composition of elementary school English language learners. Reading & Writing Quarterly, 33(6): 533–548. https://doi.org/10.1080/10573569.2016.1253513 CrossRef Google Scholar

Baker, E. A. (2017) Apps, iPads, and literacy: Examining the feasibility of speech recognition in a first-grade classroom. Reading Research Quarterly, 52(3): 291–310. https://doi.org/10.1002/rrq.170 CrossRef Google Scholar

Bodnar, S., Cucchiarini, C., de Vries, B. P., Strik, H. & van Hout, R. (2017) Learner affect in computerised L2 oral grammar practice with corrective feedback. Computer Assisted Language Learning, 30(3–4): 223–246. https://doi.org/10.1080/09588221.2017.1302964 CrossRef Google Scholar

Caseiro, N. & Santos, D. (Eds.). (2018). Smart specialization strategies and the role of entrepreneurial universities. Hershey, PA: IGI Global. Available from: https://www.igi-global.com/book/smart-specialization-strategies-role-entrepreneurial/197442 Google Scholar

Cavus, N. & Ibrahim, D. (2017) Learning English using children’s stories in mobile devices. British Journal of Educational Technology, 48(2): 625–641. https://doi.org/10.1111/bjet.12427 CrossRef Google Scholar

Creswell, J. W. (2014). Educational research: Planning, conducting, and evaluating quantitative. Boston, MA: Pearson Education.Google Scholar

Dalim, C. S. C., Sunar, M. S., Dey, A. & Billinghurst, M. (2020) Using augmented reality with speech input for non-native children’s language learning. International Journal of Human-Computer Studies, 134: 44–64. https://doi.org/10.1016/j.ijhcs.2019.10.002 CrossRef Google Scholar

de Vries, B. P., Cucchiarini, C., Bodnar, S., Strik, H. & van Hout, R. (2015) Spoken grammar practice and feedback in an ASR-based call system. Computer Assisted Language Learning, 28(6): 550–576. https://doi.org/10.1080/09588221.2014.889713 CrossRef Google Scholar

Duman, G., Orhon, G. & Gedik, N. (2015) Research trends in mobile assisted language learning from 2000 to 2012. ReCALL, 27(2): 197–216. https://doi.org/10.1017/S0958344014000287 CrossRef Google Scholar

Ehsani, F. & Knodt, E. (1998) Speech technology in computer-aided language learning: Strengths and limitations of a new CALL paradigm. Language Learning & Technology, 2(1): 54–73.Google Scholar

Haug, K. N. & Klein, P. D. (2018) The effect of speech-to-text technology on learning a writing strategy. Reading & Writing Quarterly, 34(1): 47–62. https://doi.org/10.1080/10573569.2017.1326014 CrossRef Google Scholar

Hsu, L. (2016) An empirical examination of EFL learners’ perceptual learning styles and acceptance of ASR-based computer-assisted pronunciation training. Computer Assisted Language Learning, 29(5): 881–900. https://doi.org/10.1080/09588221.2015.1069747 CrossRef Google Scholar

Liakin, D., Cardoso, W. & Liakina, N. (2017) Mobilizing instruction in a second-language context: Learners’ perceptions of two speech technologies. Languages, 2(3): 1–21. https://doi.org/10.3390/languages2030011 CrossRef Google Scholar

MacArthur, C. A., & Cavalier, A. R. (2004). Dictation and speech recognition technology as test accommodations. Exceptional Children, 71(1), 43–58.CrossRef Google Scholar

Matthews, J. & O’Toole, J. M. (2015) Investigating an innovative computer application to improve L2 word recognition from speech. Computer Assisted Language Learning, 28(4): 364–382. https://doi.org/10.1080/09588221.2013.864315 CrossRef Google Scholar

McCrocklin, S. M. (2016) Pronunciation learner autonomy: The potential of automatic speech recognition. System, 57: 25–42. https://doi.org/10.1016/j.system.2015.12.013 CrossRef Google Scholar

McKechnie, J., Ahmed, B., Gutierrez-Osuna, R., Monroe, P., McCabe, P. & Ballard, K. J. (2018) Automated speech analysis tools for children’s speech production: A systematic literature review. International Journal of Speech-Language Pathology, 20(6): 583–598. https://doi.org/10.1080/17549507.2018.1477991 CrossRef Google Scholar PubMed

Mirzaei, M. S., Meshgi, K., Akita, Y. & Kawahara, T. (2017) Partial and synchronized captioning: A new tool to assist learners in developing second language listening skill. ReCALL, 29(2): 178–199. https://doi.org/10.1017/S0958344017000039 CrossRef Google Scholar

Mroz, A. (2018) Seeing how people hear you: French learners experiencing intelligibility through automatic speech recognition. Foreign Language Annals, 51(3): 617–637. https://doi.org/10.1111/flan.12348 CrossRef Google Scholar

Oh, E. Y. & Song, D. (2021) Developmental research on an interactive application for language speaking practice using speech recognition technology. Educational Technology Research and Development, 69(2): 861–884. https://doi.org/10.1007/s11423-020-09910-1 CrossRef Google Scholar

Radha, V. & Vimala, C. (2012) A review on speech recognition challenges and approaches. World of Computer Science and Information Technology Journal, 2(1): 1–7.Google Scholar

Shadiev, R. & Huang, Y.-M. (2020) Investigating student attention, meditation, cognitive load, and satisfaction during lectures in a foreign language supported by speech-enabled language translation. Computer Assisted Language Learning, 33(3): 301–326. https://doi.org/10.1080/09588221.2018.1559863 CrossRef Google Scholar

Shadiev, R., Huang, Y.-M. & Hwang, J.-P. (2017) Investigating the effectiveness of speech-to-text recognition applications on learning performance, attention, and meditation. Educational Technology Research and Development, 65(5): 1239–1261. https://doi.org/10.1007/s11423-017-9516-3 CrossRef Google Scholar

Shadiev, R., Hwang, W.-Y., Chen, N.-S. & Huang, Y.-M. (2014) Review of speech-to-text recognition technology for enhancing learning. Journal of Educational Technology & Society, 17(4): 65–84.Google Scholar

Shadiev, R., Hwang, W.-Y., Huang, Y.-M. & Liu, C.-J. (2016) Investigating applications of speech-to-text recognition technology for a face-to-face seminar to assist learning of non-native English-speaking participants. Technology, Pedagogy and Education, 25(1): 119–134. https://doi.org/10.1080/1475939X.2014.988744 CrossRef Google Scholar

Shadiev, R., Sun, A. & Huang, Y.-M. (2019) A study of the facilitation of cross-cultural understanding and intercultural sensitivity using speech-enabled language translation technology. British Journal of Educational Technology, 50(3): 1415–1433. https://doi.org/10.1111/bjet.12648 CrossRef Google Scholar

Shadiev, R., Wang, X., Wu, T.-T. & Huang, Y.-M. (2021) Review of research on technology-supported cross-cultural learning. Sustainability, 13(3): 1–23. https://doi.org/10.3390/su13031402 CrossRef Google Scholar

Shadiev, R., Wu, T.-T., Sun, A. & Huang, Y.-M. (2018) Applications of speech-to-text recognition and computer-aided translation for facilitating cross-cultural learning through a learning activity: Issues and their solutions. Educational Technology Research and Development, 66(1): 191–214. https://doi.org/10.1007/s11423-017-9556-8 CrossRef Google Scholar

Shadiev, R. & Yang, M. (2020) Review of studies on technology-enhanced language learning and teaching. Sustainability, 12(2): 1–22. https://doi.org/10.3390/su12020524 CrossRef Google Scholar

Tsai, P. (2019) Beyond self-directed computer-assisted pronunciation learning: A qualitative investigation of a collaborative approach. Computer Assisted Language Learning, 32(7): 713–744. https://doi.org/10.1080/09588221.2019.1614069 CrossRef Google Scholar

van Doremalen, J., Boves, L., Colpaert, J., Cucchiarini, C. & Strik, H. (2016) Evaluating automatic speech recognition-based language learning systems: A case study. Computer Assisted Language Learning, 29(4): 833–851. https://doi.org/10.1080/09588221.2016.1167090 CrossRef Google Scholar

Wang, Y.-H. & Young, S. S.-C. (2014) A study of the design and implementation of the ASR-based iCASL system with corrective feedback to facilitate English learning. Journal of Educational Technology & Society, 17(2): 219–233.Google Scholar

Wang, Y.-H. & Young, S. S.-C. (2015) Effectiveness of feedback for enhancing English pronunciation in an ASR-based CALL system. Journal of Computer Assisted Learning, 31(6): 493–504. https://doi.org/10.1111/jcal.12079 CrossRef Google Scholar

Xiao, W. & Park, M. (2021) Using automatic speech recognition to facilitate English pronunciation assessment and learning in an EFL context: Pronunciation error diagnosis and pedagogical implications. International Journal of Computer-Assisted Language Learning and Teaching, 11(3): 74–91. https://doi.org/10.4018/IJCALLT.2021070105 CrossRef Google Scholar

Yu, P., Pan, Y., Li, C., Zhang, Z., Shi, Q., Chu, W., Liu, M. & Zhu, Z. (2016) User-centred design for Chinese-oriented spoken English learning system. Computer Assisted Language Learning, 29(5): 984–1000. https://doi.org/10.1080/09588221.2015.1121877 CrossRef Google Scholar

Yueh, H.-P., Lin, W., Liu, Y.-L., Shoji, T. & Minoh, M. (2014) The development of an interaction support system for international distance education. IEEE Transactions on Learning Technologies, 7(2): 191–196. https://doi.org/10.1109/TLT.2014.2308952 CrossRef Google Scholar

Shadiev and Liu supplementary material

File 41.9 KB

Article contents

Review of research on applications of speech recognition technology to assist language learning

Abstract

Keywords

Access options

Article purchase

Temporarily unavailable

References

Shadiev and Liu supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests