Article contents
Rewriting the orthography of SMS messages
Published online by Cambridge University Press: 24 March 2010
Abstract
Electronic written texts used in computer-mediated interactions (emails, blogs, chats, and the like) contain significant deviations from the norm of the language. This paper presents the detail of a system aiming at normalizing the orthography of French SMS messages: after discussing the linguistic peculiarities of these messages and possible approaches to their automatic normalization, we present, compare, and evaluate various instanciations of a normalization device based on weighted finite-state transducers. These experiments show that using an intermediate phonemic representation and training, our system outperforms an alternative normalization system based on phrase-based statistical machine translation techniques.
- Type
- Papers
- Information
- Copyright
- Copyright © Cambridge University Press 2010
References
- 6
- Cited by