Dependency-based n-gram models for general purpose sentence realisation

YUQING GUO; HAIFENG WANG; JOSEF VAN GENABITH

doi:10.1017/S1351324910000288

Dependency-based n-gram models for general purpose sentence realisation

Published online by Cambridge University Press: 29 November 2010

YUQING GUO ,

HAIFENG WANG and

JOSEF VAN GENABITH

Show author details

YUQING GUO: Affiliation:
Toshiba (China) Research and Development Center 5/F., Tower W2, Oriental Plaza, Dongcheng District, Beijing, 100738, China e-mail: [email protected]
HAIFENG WANG: Affiliation:
Baidu, Inc., Baidu Campus, No. 10, Shangdi 10th Street, Haidian District, Beijing, 100085, China e-mail: [email protected]
JOSEF VAN GENABITH: Affiliation:
NCLT/CNGL, School of Computing, Dublin City University Glasnevin, Dublin 9, Ireland e-mail: [email protected]

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

This paper presents a general-purpose, wide-coverage, probabilistic sentence generator based on dependency n-gram models. This is particularly interesting as many semantic or abstract syntactic input specifications for sentence realisation can be represented as labelled bi-lexical dependencies or typed predicate-argument structures. Our generation method captures the mapping between semantic representations and surface forms by linearising a set of dependencies directly, rather than via the application of grammar rules as in more traditional chart-style or unification-based generators. In contrast to conventional n-gram language models over surface word forms, we exploit structural information and various linguistic features inherent in the dependency representations to constrain the generation space and improve the generation quality. A series of experiments shows that dependency-based n-gram models generalise well to different languages (English and Chinese) and representations (LFG and CoNLL). Compared with state-of-the-art generation systems, our general-purpose sentence realiser is highly competitive with the added advantages of being simple, fast, robust and accurate.

Type: Articles
Information: Natural Language Engineering , Volume 17 , Issue 4 , October 2011 , pp. 455 - 483

DOI: https://doi.org/10.1017/S1351324910000288 [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2010

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Bangalore, S., and Rambow, O. 2000. Exploiting a probabilistic hierarchical model for generation. In Proceedings of the 18th International Conference on Computational Linguistics, pp. 42–48. Saarbrücken, Germany.Google Scholar

Bateman, J. A. 1997. Enabling technology for multilingual natural language generation: the KPML development environment. Journal of Natural Language Engineering 3 (1): 15–55. (Cambridge University Press)CrossRef Google Scholar

Belz, A. 2007. Automatic generation of weather forecast texts using comprehensive probabilistic generation-space models. Journal of Natural Language Engineering 1 (1): 1–26. (Cambridge University Press)Google Scholar

Bilmes, J. A. and Kirchhoff, K. 2003. Factored language models and generalized parallel backoff. In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, pp. 4–6. Edmonton, Canada.Google Scholar

Cahill, A., Burke, M., O'Donovan, R., van Genabith, J., and Way, A. 2004. Long-distance dependency resolution in automatically acquired wide-coverage PCFG-based LFG approximations. In Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, pp. 320–327. Barcelona, Spain.Google Scholar

Cahill, A., Forst, M., and Rohrer, C. 2007. Stochastic realisation ranking for a free word order language. In Proceedings of the 11th European Workshop on Natural Language Generation, pp. 17–24. Schloss Dagstuhl, Germany.Google Scholar

Cahill, A., and van Genabith, J. 2006. Robust PCFG-based generation using automatically acquired LFG approximations. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, pp. 1033–1040. Sydney, Australia.Google Scholar

Callaway, C. B. 2003. Evaluating coverage for large symbolic NLG grammars. In Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, pp. 811–817. Acapulco, Mexico.Google Scholar

Carroll, J., Copestake, A., Flickinger, D., and Poznanski, V. 1999. An efficient chart generator for (semi-)lexicalist grammars. In Proceedings of the 7th European Workshop on Natural Language Generation, pp. 86–95. Toulouse, France.Google Scholar

Corston-Oliver, S., Gamon, M., Ringger, E., and Moore, R. 2002. An overview of Amalgam: a machine-learned generation module. In Proceedings of the 2nd International Natural Language Generation Conference, pp. 33–40. Harriman, NY.Google Scholar

Crouch, D., Dalrymple, M., Kaplan, R., King, T., Maxwell, J., and Newman, P. 2007. XLE Documentation. California: Palo Alto Research Center.Google Scholar

Dalrymple, M., Kaplan, R., Maxwell, J., and Zaenen, A. 1995. Formal Issues in Lexical-Functional Grammar, CSLI Lecture Notes No. 47. Standford, CA: CSLI Publications.Google Scholar

DeVault, D., Traum, D., and Artstein, R. 2008. Practical grammar-based NLG from examples. In Proceedings of the 5th International Natural Language Generation Conference, pp. 77–85. Salt Fork, OH.Google Scholar

Elhadad, M. 1993. FUF: the universal unifier user manual version 5.2. Technical Report, Department of Computer Science, Ben Gurion University of the Negev, Israel. http://www.cs.bgu.ac.il/surge/index.html.Google Scholar

Elhadad, M., and Robin, J. 1996. An overview of SURGE: a reusable comprehensive syntactic realization component. Technical Report 96-03, Mathematics and Computer Science Department, Ben Gurion University of the Negev, Israel.Google Scholar

Filippova, K., and Strube, M. 2007. Generating constituent order in German clauses. In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pp. 320–327. Prague, Czech Republic.Google Scholar

Filippova, K., and Strube, M. 2009. Tree linearization in English: improving language model-based approaches. In Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 225–228. Boulder, CO.Google Scholar

Gamon, M., Ringger, E., Zhang, Z., Moore, R., and Corston-Oliver, S. 2002. Extraposition: a case study in German sentence realization. In Proceedings of the 19th International Conference on Computational Linguistics, pp. 1–7. Taipei, Taiwan.Google Scholar

Goodman, J. T. 2001. A bit of progress in language modeling. Computer Speech and Language 15 (4): 403–434.CrossRef Google Scholar

Guo, Y., van Genabith, J., and Wang, H. 2007. Treebank-based Acquisition of LFG Resources for Chinese. In Proceedings of LFG07 Conference, pp. 214–232. Stanford, CA.Google Scholar

Guo, Y., van Genabith, J., and Wang, H. 2008. Dependency-based N-gram models for general purpose sentence realisation. In Proceedings of the 22nd International Conference on Computational Linguistics, pp. 297–304. Manchester, UK.Google Scholar

Guo, Y., Wang, H., and van Genabith, J. 2010. A linguistically inspired statistical model for Chinese punctuation generation. ACM Transactions on Asian Language Information Processing 9 (2): 1–27.CrossRef Google Scholar

Halliday, M. A. K. 1978. Language as Social Semiotic: The Social Interpretation of Language and Meaning. Maryland: University Park Press.Google Scholar

Hogan, D., Cafferkey, C., Cahill, A., and van Genabith, J. 2007. Exploiting multi-word units in history-based probabilistic generation. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Conference on Computational Natural Language Learning, pp. 267–276. Prague, Czech Republic.Google Scholar

Johansson, R., and Nugues, P. 2007. Extended constituent-to-dependency conversion for English. In Proceedings of the 16th Nordic Conference of Computational Linguistics, pp. 105–112. Tartu, Estonia.Google Scholar

Kaplan, R., and Bresnan, J. 1982. Lexical functional grammar: a formal system for grammatical representation. The Mental Representation of Grammatical Relations, pp. 173–282. Cambridge, MA: MIT Press.Google Scholar

Kaplan, R., and Wedekind, J. 2000. LFG generation produces context-free languages. In Proceedings of the 18th International Conference on Computational Linguistics, pp. 425–431. Saarbrücken, Germany.Google Scholar

Kay, M. 1979. Functional grammar. In Proceedings of the 5th Annual Meeting of the Berkeley Linguistics Society. Berkeley, CA.Google Scholar

Klakow, D. 1998. Log-linear interpolation of language models. In Proceedings of the 5th International Conference on Spoken Language Processing, vol. 5, pp. 1695–1699. Sydney, Australia.Google Scholar

Langkilde, I. 2000. Forest-based statistical sentence generation. In Proceedings of 1st Meeting of the North American Chapter of the Association for Computational Linguistics, pp. 170–177. Seattle, WA.Google Scholar

Langkilde, I. 2002. An empirical verification of coverage and correctness for a general-purpose sentence generator. In Proceedings of the 2nd International Conference on Natural Language Generation, pp. 17–24. Harriman, NY.Google Scholar

Langkilde, I., and Knight, K. 1998. The practical value of N-grams in derivation. In Proceedings of the 9th International Workshop on Natural Language Generation, pp. 248–255. New Brunswick, NJ.Google Scholar

Lavoie, B., and Rambow, O. 1997. A fast and portable realizer for text generation systems. In Proceedings of the 5th Conference on Applied Natural Language Processing, pp. 265–268. Washington, DC.CrossRef Google Scholar

Marciniak, T., and Strube, M. 2004. Classification-based generation using TAG. In Proceedings of the 3rd International Conference on Natural Language Generation, pp. 100–109. Brockenhurst, UK.CrossRef Google Scholar

Marcus, M. P., Santorini, B., and Marcinkiewicz, M. A. 1993. Building a large annotated corpus of English: the Penn Treebank. Computational Linguistics 19 (2): 313–330.Google Scholar

McDonald, R., Crammer, K., and Pereira, F. 2005. Online large-margin training of dependency parsers. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics, pp. 91–98. Ann Arbor, MI.Google Scholar

Mel'čuk, I. A. 1988. Dependency Syntax: Theory and Practice. New York, NY: SUNY Press.Google Scholar

Nakanishi, H., Nakanishi, Y., and Tsujii, J. 2005. Probabilistic models for disambiguation of an HPSG-based chart generator. In Proceedings of the 9th International Workshop on Parsing Technology, pp. 93–102. Vancouver, British Columbia, Canada.CrossRef Google Scholar

Nicolov, N., and Mellish, C. 2000. PROTECTOR: efficient generation with lexicalized grammars. In Recent Advances in Natural Language Processing II, pp. 221–243. Amsterdam, The Netherlands: John Benjamins.CrossRef Google Scholar

Nivre, J. 2006. Inductive Dependency Parsing. New York, NY: Springer.CrossRef Google Scholar

Och, F. J. 2003 Minimum error rate training in statistical machine translation. In Proceedings of the 41st Annual Meeting on Association for Computational Linguistics, pp. 160–167. Sapporo, Japan.Google Scholar

Papineni, K., Roukos, S., Ward, T., and Zhu, W.-J. 2002. Bleu: a method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pp. 311–318. Philadelphia, PA.Google Scholar

Rajkumar, R., White, M., and Espinosa, D. 2009. Exploiting named entity classes in CCG surface realization. In Proceedings of Human Language Technologies: The Conference of the North American Chapter of the Association for Computational Linguistics, pp. 161–164. Boulder, CO.Google Scholar

Ratnaparkhi, A. 2000. Trainable methods for natural language generation. In Proceedings of the 6th Applied Natural Language Processing Conference and 1st Meeting of the North American Chapter of the Association for Computational Linguistics, pp. 194–201. Seattle, WA.Google Scholar

Reiter, E., and Dale, R. 1997. Building applied natural language generation systems. Journal of Natural Language Engineering 3: 57–87. (Cambridge University Press)CrossRef Google Scholar

Ringger, E., Gamon, M., Smets, M. E., Corston-Oliver, S., and Moore, R. C. 2003. Linguistically informed statistical models of constituent structure for ordering in sentence realization. Technical Report MSR-TR-2003-54, Microsoft Research, Redmond, WA.CrossRef Google Scholar

Stolcke, A. 2002. SRILM-An extensible language modeling toolkit. In Proceedings of the 7th International Conference of Spoken Language Processing, pp. 901–904. Denver, CO.Google Scholar

Surdeanu, M., Johansson, R., Meyers, A., Màrquez, L., and Nivre, J. 2008. The CoNLL-2008 shared task on joint parsing of syntactic and semantic dependencies. In Proceedings of the 12th Conference on Computational Natural Language Learning, pp. 159–177. Manchester, UK.Google Scholar

Uchimoto, K., Murata, M., Ma, Q., Sekine, S., and Isahara, H. 2000. Word order acquisition from corpora. In Proceedings of the 18th International Conference on Computational Linguistics, pp. 871–877. Saarbrücken, Germany.Google Scholar

Velldal, E., and Oepen, S. 2005. Maximum entropy models for realization ranking. In Proceedings of the 10th Machine Translation Summit, pp. 109–116. Phuket, Thailand.Google Scholar

White, M. 2004. Reining in CCG chart realization. In Proceedings of the 3rd International Natural Language Generation Conference, pp. 182–191. Brockenhurst, UK.CrossRef Google Scholar

White, M., and Rajkumar, R. 2009. Perceptron reranking for CCG realization. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pp. 410–419. Singapore.Google Scholar

White, M., Rajkumar, R., and Martin, S. 2007. Towards broad coverage surface realization with CCG. In Proceedings of the Workshop on Using Corpora for NLG: Language Generation and Machine Translation, pp. 22–30. Copenhagen, Denmark.Google Scholar

Xue, N., Xia, F., Chiou, Fu dong, and Palmer, M, 2005. The penn Chinese TreeBank: phrase structure annotation of a large corpus. Natural Language Engineering 11 (2): 207–238. (Cambridge University Press)CrossRef Google Scholar

Zhong, H., and Stent, A. J. 2005. Building surface realizers automatically from corpora. In Proceedings of the Corpus Linguistics 2005 Workshop on Using Corpora for Natural Language Generation, pp. 49–54. Birmingham, UK.Google Scholar

Article contents

Dependency-based n-gram models for general purpose sentence realisation

Abstract

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests