Hostname: page-component-745bb68f8f-lrblm Total loading time: 0 Render date: 2025-01-09T22:11:52.874Z Has data issue: false hasContentIssue false

A framework for enriching lexical semantic resources with distributional semantics

Published online by Cambridge University Press:  15 January 2018

CHRIS BIEMANN
Affiliation:
Language Technology Group, Department of Informatics, Faculty of Mathematics, Informatics, and Natural Sciences, Universität Hamburg, Germany e-mails: [email protected], [email protected]
STEFANO FARALLI
Affiliation:
Data and Web Science Group, School of Business Informatics and Mathematics, Universität Mannheim, Germany e-mails: [email protected], [email protected]
ALEXANDER PANCHENKO
Affiliation:
Language Technology Group, Department of Informatics, Faculty of Mathematics, Informatics, and Natural Sciences, Universität Hamburg, Germany e-mails: [email protected], [email protected]
SIMONE PAOLO PONZETTO
Affiliation:
Data and Web Science Group, School of Business Informatics and Mathematics, Universität Mannheim, Germany e-mails: [email protected], [email protected]

Abstract

We present an approach to combining distributional semantic representations induced from text corpora with manually constructed lexical semantic networks. While both kinds of semantic resources are available with high lexical coverage, our aligned resource combines the domain specificity and availability of contextual information from distributional models with the conciseness and high quality of manually crafted lexical networks. We start with a distributional representation of induced senses of vocabulary terms, which are accompanied with rich context information given by related lexical items. We then automatically disambiguate such representations to obtain a full-fledged proto-conceptualization, i.e. a typed graph of induced word senses. In a final step, this proto-conceptualization is aligned to a lexical ontology, resulting in a hybrid aligned resource. Moreover, unmapped induced senses are associated with a semantic type in order to connect them to the core resource. Manual evaluations against ground-truth judgments for different stages of our method as well as an extrinsic evaluation on a knowledge-based Word Sense Disambiguation benchmark all indicate the high quality of the new hybrid resource. Additionally, we show the benefits of enriching top-down lexical knowledge resources with bottom-up distributional information from text for addressing high-end knowledge acquisition tasks such as cleaning hypernym graphs and learning taxonomies from scratch.

Type
Articles
Copyright
Copyright © Cambridge University Press 2018 

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Agirre, E. and Soroa, A. 2007. Semeval-2007 Task 02: evaluating word sense induction and discrimination systems. In Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval'07), Prague, Czech Republic.Google Scholar
Aprosio, A. P., Giuliano, C. and Lavelli, A. 2013. Extending the coverage of DBpedia properties using distant supervision over Wikipedia. In Proceedings of the 2013 International Conference on NLP and DBpedia – Volume 1064 (NLP-DBPEDIA'13), Sydney, Australia.Google Scholar
Banko, M., Cafarella, M. J., Soderland, S., Broadhead, M. and Etzioni, O. 2007. Open information extraction from the web. In Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI'07), Hyderabad, India.Google Scholar
Baroni, M., Bernardi, R., Do, N.-Q. and Shan, C.-C. 2012. Entailment above the word level in distributional semantics. In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL'12), Avignon, France.Google Scholar
Baroni, M., Dinu, G. and Kruszewski, G. 2014. Don't count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL'14), Baltimore, MD, USA.CrossRefGoogle Scholar
Bartunov, S., Kondrashkin, D., Osokin, A. and Vetrov, D. P. 2016. Breaking sticks and ambiguities with adaptive skip-gram. In Proceedings of the 19th International Conference on Artificial Intelligence and Statistics (AISTATS'16), Cadiz, Spain.Google Scholar
Biemann, C. 2005. Ontology learning from text: a survey of methods. LDV Forum 20 (2): 7593.Google Scholar
Biemann, C. 2006. Chinese whispers: an efficient graph clustering algorithm and its application to natural language processing problems. In Proceedings of the First Workshop on Graph Based Methods for Natural Language Processing, TextGraphs-1, Brooklyn, NY, USA.Google Scholar
Biemann, C. and Riedl, M. 2013. Text: now in 2D! A framework for lexical expansion with contextual similarity. Journal of Language Modelling 1 (1): 5595.Google Scholar
Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., and Hellmann, S. 2009. DBpedia – a crystallization point for the web of data. Journal of Web Semantics 7 (3): 154–65.CrossRefGoogle Scholar
Bordea, G., Buitelaar, P., Faralli, S. and Navigli, R. 2015. SemEval-2015 task 17: taxonomy extraction evaluation (TExEval). In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'15), Denver, CO, USA.Google Scholar
Bordea, G., Lefever, E. and Buitelaar, P. 2016. SemEval-2016 task 13: taxonomy extraction evaluation (TExEval-2). In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'16), San Diego, CA, USA.Google Scholar
Bordes, A., Weston, J., Collobert, R. and Bengio, Y. 2011. Learning structured embeddings of knowledge bases. In Proceedings of the 25th AAAI Conference on Artificial Intelligence (AAAI'11), San Francisco, CA, USA.Google Scholar
Brugmann, K. 1904. Kurze vergleichende Grammatik der indogermanischen Sprachen. Strassburg, France: Karl J. Trubner.Google Scholar
Bryl, V. and Bizer, C. 2014. Learning conflict resolution strategies for cross-language Wikipedia data fusion. In Proceedings of the 23rd International Conference on World Wide Web (WWW'14 Companion), Seoul, South Korea.CrossRefGoogle Scholar
Camacho-Collados, J., Pilehvar, M. T., and Navigli, R. 2015a. A unified multilingual semantic representation of concepts. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Volume 1: Long Papers (ACL'15), Beijing, China.Google Scholar
Camacho-Collados, J., Pilehvar, M. T., and Navigli, R. 2015b. NASARI: a novel approach to a semantically aware representation of items. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'15), Denver, CO, USA.CrossRefGoogle Scholar
Carlson, A., Betteridge, J., Kisiel, R., Settles, B., Hruschka, E. R. Jr., and Mitchell, T. M. 2010. Toward an architecture for never-ending language learning. In Proceedings of the 24th AAAI Conference on Artificial Intelligence (AAAI'10), Atlanta, GA, USA.Google Scholar
Cecchini, F. M., Riedl, M. and Biemann, C. 2017. Using pseudowords for algorithm comparison: an evaluation framework for graph-based word sense induction. In Proceedings of the 21st Nordic Conference on Computational Linguistics (NODALIDA'17), Gothenburg, Sweden.Google Scholar
Chen, X., Liu, Z. and Sun, M. 2014. A unified model for word sense representation and disambiguation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, A meeting of SIGDAT, a Special Interest Group of the ACL (EMNLP'14), Doha, Qatar.Google Scholar
Chiarcos, C., Hellmann, S. and Nordhoff, S. 2012. Linking linguistic resources: examples from the open linguistics working group. In Chiarcos, C., Nordhoff, S., and Hellmann, S. (eds.), Linked Data in Linguistics – Representing and Connecting Language Data and Language Metadata, pp. 201–16. Heidelberg, Germany: Springer.Google Scholar
Clark, S. 2015. Vector space models of lexical meaning. In Lappin, S., and Fox, C. (eds.), Handbook of Contemporary Semantics, 2nd edition, pp. 493522. New York, NY, USA: Wiley-Blackwell.Google Scholar
Cleuziou, G. and Moreno, J. G. 2016. QASSIT at SemEval-2016 Task 13: on the integration of semantic vectors in pretopological spaces for lexical taxonomy acquisition. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'16), San Diego, CA, USA.Google Scholar
Cuadros, M. and Rigau, G. 2007. SemEval-2007 task 16: evaluation of wide coverage knowledge resources. In Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval'07), Prague, Czech Republic.Google Scholar
Cuadros, M. and Rigau, G. 2008. KnowNet: building a large net of knowledge from the web. In Proceedings of the 22nd International Conference on Computational Linguistics – Volume 1 (COLING'08), Manchester, UK.Google Scholar
Curran, J. R. 2002. Ensemble methods for automatic thesaurus extraction. In Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing – Volume 10 (EMNLP'02), Philadelphia, PA, USA.Google Scholar
De Marneffe, M.-C., MacCartney, B., and Manning, C. D. 2006. Generating typed dependency parses from phrase structure parses. In Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC'06), Genoa, Italy.Google Scholar
Di Marco, A., and Navigli, R. 2013. Clustering and diversifying web search results with graph-based word sense induction. Computational Linguistics 39 (3): 709–54.Google Scholar
Evert, S. 2005. The Statistics of Word Cooccurrences: Word Pairs and Collocations. Ph.D. thesis. Institut für maschinelle Sprachverarbeitung, University of Stuttgart, Germany.Google Scholar
Fader, A., Soderland, S. and Etzioni, O. 2011. Identifying relations for open information extraction. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP'11), Edinburgh, UK.Google Scholar
Faralli, S. and Navigli, R. 2012. A new minimally supervised framework for domain word sense disambiguation. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL'12), Jeju Island, South Korea.Google Scholar
Faralli, S. and Navigli, R. 2013. Growing multi-domain glossaries from a few seeds using probabilistic topic models. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, A meeting of SIGDAT, a Special Interest Group of the ACL (EMNLP'13), Seattle, WA, USA.Google Scholar
Faralli, S., Panchenko, A., Biemann, C. and Paolo Ponzetto, S. 2017. The contrast medium algorithm: taxonomy induction from noisy knowledge graphs with just a few links. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, Volume 1: Long Papers (EACL'17), Valencia, Spain.CrossRefGoogle Scholar
Faralli, S., Stilo, G. and Velardi, P. 2015. Large scale homophily analysis in twitter using a twixonomy. In Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI'15), Buenos Aires, Argentina.Google Scholar
Faruqui, M., Dodge, J., Jauhar, S. K., Dyer, C., Hovy, E. H., and Smith, N. A. 2015. Retrofitting word vectors to semantic lexicons. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'15), Denver, CO, USA.Google Scholar
Faruqui, M. and Kumar, S. 2015. Multilingual open relation extraction using cross-lingual projection. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'15), Denver, CO, USA.Google Scholar
Fellbaum, C. (ed.) 1998. WordNet: An Electronic Database. Cambridge, MA: MIT Press.Google Scholar
Fleiss, J. L. 1971. Measuring nominal scale agreement among many raters. Psychological Bulletin 76 (5): 378.CrossRefGoogle Scholar
Gangemi, A., Guarino, N., Masolo, C., Oltramari, A., and Schneider, L. 2002. Sweetening ontologies with DOLCE. In Knowledge Engineering and Knowledge Management: Ontologies and the Semantic Web: 13th International Conference (EKAW 2002), Sigüenza, Spain, Berlin, Heidelberg.Google Scholar
Ganitkevitch, J., Van Durme, B., and Callison-Burch, C. 2013. PPDB: the paraphrase database. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'13), Atlanta, GA, USA.Google Scholar
Gil-Lafuente, A. M., and Aluja, J. G. 2012. Towards an Advanced Modelling of Complex Economic Phenomena: Pretopological and Topological Uncertainty Research Tools. Berlin, Heidelberg: Springer.Google Scholar
Glavaš, G. and Ponzetto, S.-P. 2017. Dual tensor model for detecting asymmetric lexico-semantic relations. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP'17), Copenhagen, Denmark.Google Scholar
Goikoetxea, J., Soroa, A. and Agirre, E. 2015. Random walks and neural network language models on knowledge bases. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'15), Denver, CO, USA.Google Scholar
Gurevych, I., Eckle-Kohler, J., Hartmann, S., Matuschek, M., Meyer, C. M., and Wirth, C. 2012. UBY – a large-scale unified lexical-semantic resource based on LMF. In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL'12), Avignon, France.Google Scholar
Hearst, M. A. 1992. Automatic acquisition of hyponyms from large text corpora. In Proceedings of the 14th Conference on Computational Linguistics – Volume 2 (COLING'92), Nantes, France.Google Scholar
Hoffart, J., Suchanek, F. M., Berberich, K. and Weikum, G. 2013. YAGO2: a spatially and temporally enhanced knowledge base from Wikipedia. Artificial Intelligence 194: 2861.Google Scholar
Hovy, E., Navigli, R. and Ponzetto, S.-P. 2013. Collaboratively built semi-structured content and artificial intelligence: the story so far. Artificial Intelligence 194: 227.Google Scholar
Jauhar, S. K., Dyer, C. and Hovy, E. 2015. Ontologically grounded multi-sense representation learning for semantic vector space models. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'15), Denver, CO, USA.Google Scholar
Jenatton, R., Roux, N. L., Bordes, A. and Obozinski, G. 2012. A latent factor model for highly multi-relational data. In Proceedings of Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems (NIPS'12), Lake Tahoe, NV, USA.Google Scholar
Jurgens, D. and Pilehvar, M. T. 2016. SemEval-2016 task 14: semantic taxonomy enrichment. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'16), San Diego, CA, USA.Google Scholar
Kapanipathi, P., Jain, P., Venkataramani, C. and Sheth, A. 2014. User interests identification on Twitter using a hierarchical knowledge base. In Proceedings of the Semantic Web: Trends and Challenges: 11th International Conference (ESWC'14), Cham.Google Scholar
Klein, D. and Manning, C. D. 2003. Accurate unlexicalized parsing. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics – Volume 1 (ACL'03), Sapporo, Japan.Google Scholar
Klein, P., Ponzetto, S. P. and Glavaš, G. 2017. Improving neural knowledge base completion with cross-lingual projections. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers (EACL'17), Valencia, Spain.Google Scholar
Kozareva, Z. and Hovy, E. 2010. A semi-supervised method to learn and construct taxonomies using the web. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP'10), Cambridge, MA, USA.Google Scholar
Krause, S., Hennig, L., Gabryszak, A., Xu, F., and Uszkoreit, H. 2015. Sar-graphs: a linked linguistic knowledge resource connecting facts with language. In Proceedings of the 4th Workshop on Linked Data in Linguistics: Resources and Applications, Beijing, China.CrossRefGoogle Scholar
Lesk, M. 1986. Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone. In Proceedings of the 5th Annual International Conference on Systems Documentation (SIGDOC'86), Toronto, Ontario, Canada.Google Scholar
Levy, O. and Goldberg, Y. 2014. Dependency-based word embeddings. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL'14), Baltimore, MD, USA.CrossRefGoogle Scholar
Li, J. and Jurafsky, D. 2015. Do multi-sense embeddings improve natural language understanding? In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP'15), Lisbon, Portugal.Google Scholar
Lin, D. 1998. Automatic retrieval and clustering of similar words. In Proceedings of the 17th International Conference on Computational Linguistics – Volume 2 (COLING'98), Montreal, Quebec, Canada.Google Scholar
Lin, T., Pantel, P., Gamon, M., Kannan, A., and Fuxman, A. 2012. Active objects: actions for entity-centric search. In Proceedings of the 21st International Conference on World Wide Web (WWW'12), Lyon, France.Google Scholar
Maitra, P. and Das, D. 2016. JUNLP at SemEval-2016 Task 13: a language independent approach for hypernym identification. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'16), San Diego, CA, USA.Google Scholar
McCrae, J. P., Fellbaum, C., and Cimiano, P. 2014. Publishing and linking WordNet using lemon and RDF. In Proceedings of the 3rd Workshop on Linked Data in Linguistics, Reykjavik, Iceland.Google Scholar
Mihalcea, R., Chklovski, T. and Kilgarriff, A. 2004. The Senseval-3 English lexical sample task. In Proceedings of the 3rd International Workshop on the Evaluation of Systems for the Semantic Analysis of Text (Senseval-3), Barcelona, Spain.Google Scholar
Mihalcea, R. and Csomai, A. 2007. Wikify!: linking documents to encyclopedic knowledge. In Proceedings of the 16th ACM Conference on Information and Knowledge Management (CIKM'07), Lisbon, Portugal.Google Scholar
Mihalcea, R. and Moldovan, D. I. 2001. eXtended WordNet: progress report. In Proceedings of NAACL Workshop on WordNet and Other Lexical Resources, Pittsburgh, PA, USA.Google Scholar
Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., and Dean, J. 2013. Distributed representations of words and phrases and their compositionality. In Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems (NIPS'13), Lake Tahoe, NV, USA.Google Scholar
Mikolov, T., Yih, W.-T. and Zweig, G. 2013. Linguistic regularities in continuous space word representations. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT'15), Atlanta, GA, USA.Google Scholar
Miller, G. A., Leacock, C., Tengi, R. and Bunker, R. T. 1993. A semantic concordance. In Proceedings of the Workshop on Human Language Technology, Princeton, NJ, USA.Google Scholar
Mintz, M., Bills, S., Snow, R. and Jurafsky, D. 2009. Distant supervision for relation extraction without labeled data. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Vol. 2 (ACL'09), Suntec, Singapore.Google Scholar
Nastase, V. and Strube, M. 2013. Transforming Wikipedia into a large scale multilingual concept network. Artificial Intelligence 194: 6285.Google Scholar
Navigli, R. 2009. Word sense disambiguation: a survey. ACM CSUR 41 (2): 169.Google Scholar
Navigli, R. and Ponzetto, S. P. 2012a. BabelNet: the automatic construction, evaluation and application of a wide-coverage multilingual semantic network. Artificial Intelligence 193: 217–50.Google Scholar
Navigli, R. and Ponzetto, S. P. 2012b. Joining forces pays off: multilingual joint Word Sense Disambiguation. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL'12), Jeju Island, South Korea.Google Scholar
Navigli, R. and Velardi, P. 2010. Learning Word-class lattices for definition and hypernym extraction. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL'10), Uppsala, Sweden.Google Scholar
Neelakantan, A., Shankar, J., Passos, A. and McCallum, A. 2014. Efficient non-parametric estimation of multiple embeddings per word in vector space. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP'14), Doha, Qatar.Google Scholar
Nickel, M., Murphy, K., Tresp, V. and Gabrilovich, E. 2016. A review of relational machine learning for knowledge graphs. Proceedings of the IEEE 104 (1): 1133.Google Scholar
Niemann, E. and Gurevych, I. 2011. The people's web meets linguistic knowledge: automatic sense alignment of Wikipedia and wordnet. In Proceedings of the Ninth International Conference on Computational Semantics (IWCS'11), Oxford, UK.Google Scholar
Nieto Piña, L., and Johansson, R. 2016. Embedding senses for efficient graph-based word sense disambiguation. In Proceedings of the 2016 Workshop on Graph-based Methods for Natural Language Processing (Textgraphs'16), San Diego, CA, USA.Google Scholar
Norvig, P. 2016. The semantic web and the semantics of the web: where does meaning come from? In Proceedings of the 25th International Conference on World Wide Web (WWW'16), Montreal, Quebec, Canada.Google Scholar
Padó, S. and Lapata, M. 2007. Dependency-based construction of semantic space models. Computational Linguistics 33 (2): 161–99.Google Scholar
Panchenko, A. 2016. Best of both worlds: making word sense embeddings interpretable. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC'16), Portoro , Slovenia.Google Scholar
Panchenko, A., Faralli, S., Ponzetto, S. P. and Biemann, C. 2017a. Using linked disambiguated distributional networks for word sense disambiguation. In Proceedings of the 1st EACL Workshop on Sense, Concept and Entity Representations and their Applications, Valencia, Spain.Google Scholar
Panchenko, A., Faralli, S., Ruppert, E., Remus, S., Naets, H., Fairon, C., Ponzetto, S. P., and Biemann, C. 2016. TAXI at SemEval-2016 task 13: a taxonomy induction method based on lexico-syntactic patterns, substrings and focused crawling. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'16), San Diego, CA, USA.Google Scholar
Panchenko, A., Marten, F., Ruppert, E., Faralli, S., Ustalov, D., Ponzetto, S. P., and Biemann, C. 2017b. Unsupervised, knowledge-free, and interpretable word sense disambiguation. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing: System Demonstrations (EMNLP'17), Copenhagen, Denmark.Google Scholar
Panchenko, A. and Morozova, O. 2012. A study of hybrid similarity measures for semantic relation extraction. In Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data, Avignon, France.Google Scholar
Panchenko, A., Romanov, P., Morozova, O., Naets, H., Philippovich, A., Romanov, A., and Fairon, C. 2013. Serelex: search and visualization of semantically related words. In Proceedings of the European Conference on Information Retrieval (ECIR'13), Moscow, Russia.Google Scholar
Panchenko, A., Ruppert, E., Faralli, S., Ponzetto, S. P., and Biemann, C. 2017c. Unsupervised does not mean uninterpretable: the case for word sense induction and disambiguation. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL'17), Valencia, Spain.Google Scholar
Panchenko, A., Simon, J., Riedl, M. and Biemann, C. 2016. Noun sense induction and disambiguation using graph-based distributional semantics. In Proceedings of the 13th Conference on Natural Language Processing (KONVENS'16), Bochum, Germany.Google Scholar
Parker, R., Graff, D., Kong, J., Chen, K., and Maeda, K. 2011. English Gigaword, 5th ed. Philadelphia: Linguistic Data Consortium.Google Scholar
Pavel, S. and Euzenat, J. 2013. Ontology matching: state of the art and future challenges. IEEE Transaction on Knowledge and Data Engineering 25 (1): 158–76.Google Scholar
Pedersen, T., Patwardhan, S. and Michelizzi, J. 2004. WordNet::similarity – measuring the relatedness of concepts. In Proceedings of the HLT-NAACL 2004: Demonstration Papers (HLT-NAACL'04 Demos), Boston, MA, USA.Google Scholar
Pelevina, M., Arefiev, N., Biemann, C. and Panchenko, A. 2016. Making sense of word embeddings. In Proceedings of the 1st Workshop on Representation Learning for NLP, Berlin, Germany.Google Scholar
Pennington, J., Socher, R. and Manning, C. 2014. GloVe: global vectors for word representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP'14), Doha, Qatar.Google Scholar
Pham, N. T., Lazaridou, A. and Baroni, M. 2015. A multitask objective to inject lexical contrast into distributional semantics. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Volume 2: Short Papers (ACL'15), Beijing, China.Google Scholar
Pocostales, J. 2016. NUIG-UNLP at SemEval-2016 task 13: a simple word embedding-based approach for taxonomy extraction. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'16), San Diego, CA, USA.Google Scholar
Ponzetto, S. P. and Navigli, R. 2009. Large-scale taxonomy mapping for restructuring and integrating Wikipedia. In Proceedings of the 21st International Joint Conference on Artificial Intelligence (IJCAI'09), Pasadena, CA, USA.Google Scholar
Ponzetto, S. P. and Navigli, R. 2010. Knowledge-rich word sense disambiguation rivaling supervised systems. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL'10), Uppsala, Sweden.Google Scholar
Ponzetto, S. P. and Strube, M. 2011. Taxonomy induction based on a collaboratively built knowledge repository. Artificial Intelligence 175: 1737–56.Google Scholar
Pradhan, S. S., Loper, E., Dligach, D. and Palmer, M. 2007. SemEval-2007 task 17: english lexical sample, SRL and all words. In Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval'07), Prague, Czech Republic.Google Scholar
Reisinger, J. and Mooney, R. J. 2010. Multi-prototype vector-space models of word meaning. In Proceedings of the Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics (HLT'10), Los Angeles, CA, USA.Google Scholar
Richter, M., Quasthoff, U., Hallsteinsdóttir, E., and Biemann, C. 2006. Exploiting the Leipzig corpora collection. In Proceedings of IS-LTC'06, Ljubljana, Slovenia.Google Scholar
Riedl, M. 2016. Unsupervised Methods for Learning and Using Semantics of Natural Language. Ph.D. thesis. Germany: TU Darmstadt. http://tuprints.ulb.tu-darmstadt.de/5435/.Google Scholar
Riedl, M. and Biemann, C. 2013. Scaling to large3 data: an efficient and effective method to compute distributional thesauri. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, a Meeting of SIGDAT, a Special Interest Group of the ACL (EMNLP'13), Seattle, WA, USA.Google Scholar
Riedl, M. and Biemann, C. 2015. A single word is not enough: ranking multiword expressions using distributional semantics. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP'15), Lisbon, Portugal.Google Scholar
Roller, S., Erk, K. and Boleda, G. 2014. Inclusive yet selective: supervised distributional hypernymy detection. In Proceedings of the 25th International Conference on Computational Linguistics: Technical Papers (COLING'14), Dublin, Ireland.Google Scholar
Rospocher, M., van Erp, M., Vossen, P., Fokkens, A., Aldabe, I., Rigau, G., Soroa, A., Ploeger, T., and Bogaard, T. 2016. Building event-centric knowledge graphs from news. Journal of Web Semantics 37–38: 132–51.CrossRefGoogle Scholar
Rothe, S. and Schütze, H. 2015. AutoExtend: extending word embeddings to embeddings for synsets and lexemes. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Volume 1: Long Papers (ACL'15), Beijing, China.Google Scholar
Ruppert, E., Kaufmann, M., Riedl, M. and Biemann, C. 2015. JoBimViz: a web-based visualization for graph-based distributional semantic models. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, System Demonstrations (ACL'15), Beijing, China.Google Scholar
Schuhmacher, M., Dietz, L. and Ponzetto, S. P. 2015. Ranking entities for web queries through text and knowledge. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management (CIKM'15), Melbourne, Australia.Google Scholar
Shwartz, V., Goldberg, Y. and Dagan, I. 2016. Improving hypernymy detection with an integrated path-based and distributional method. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, Volume 1: Long Papers (ACL'16), Berlin, Germany.Google Scholar
Snow, R., Jurafsky, D. and Ng, A. 2006. Semantic taxonomy induction from heterogenous evidence. In Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics (ACL'06), Sydney, Australia.Google Scholar
Snow, R., Jurafsky, D. and Ng, A. Y. 2004. Learning syntactic patterns for automatic hypernym discovery. In Proceedings of the Advances in Neural Information Processing Systems 17 (NIPS'04), Vancouver, BC, Canada.Google Scholar
Socher, R., Chen, D., Manning, C. D. and Ng, A. 2013. Reasoning with neural tensor networks for knowledge base completion. In Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems (NIPS'13), Lake Tahoe, NV, USA.Google Scholar
Suchanek, F. M., Kasneci, G. and Weikum, G. 2008. YAGO: a large ontology from Wikipedia and WordNet. Journal of Web Semantics 6 (3): 203–17.Google Scholar
Tan, L., Bond, F. and van Genabith, J. 2016. USAAR at SemEval-2016 task 13: hyponym endocentricity. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval@NAACL-HLT'16), San Diego, CA, USA.Google Scholar
Tarjan, R. 1972. Depth-first search and linear graph algorithms. SIAM Journal on Computing 1 (2): 146–60.CrossRefGoogle Scholar
Turney, P. D. and Pantel, P. 2010. From frequency to meaning: vector space models of semantics. Journal of Artificial Intelligence Research 37: 141–88.Google Scholar
Van de Cruys, T. 2010. Mining for meaning: the extraction of lexico-semantic knowledge from text. Groningen Dissertations in Linguistics 82.Google Scholar
Van Dongen, S. 2008. Graph clustering via a discrete uncoupling process. SIAM Journal on Matrix Analysis and Applications 30 (1): 121–41.Google Scholar
Velardi, P., Faralli, S. and Navigli, R. 2013. OntoLearn reloaded: a graph-based algorithm for taxonomy induction. Computational Linguistics 39 (3): 665707.Google Scholar
Velardi, P., Navigli, R., Faralli, S. and Ruiz-Martínez, J. M. 2012. A new method for evaluating automatically learned terminological taxonomies. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), Istanbul, Turkey.Google Scholar
Véronis, J. 2004. HyperLex: lexical cartography for information retrieval. Computer Speech and Language 18: 223–52.Google Scholar
Wang, Z., Li, J., Wang, Z. and Tang, J. 2012. Cross-lingual knowledge linking across Wiki knowledge bases. In Proceedings of the 21st International Conference on World Wide Web (WWW'12), Lyon, France.CrossRefGoogle Scholar
Weeds, J., Weir, D. and McCarthy, D. 2004. Characterising measures of lexical distributional similarity. In Proceedings of the 20th International Conference on Computational Linguistics (COLING'04), Geneva, Switzerland.Google Scholar
West, R., Gabrilovich, E., Murphy, K., Sun, S., Gupta, R., and Lin, D. 2014. Knowledge base completion via search-based question answering. In Proceedings of the 23rd International Conference on World Wide Web (WWW'14), Seoul, South Korea.Google Scholar
Wu, W., Li, H., Wang, H. and Zhu, K. Q. 2012. Probase: a probabilistic taxonomy for text understanding. In Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data (SIGMOD'12), Scottsdale, AZ, USA.CrossRefGoogle Scholar
Yu, M. and Dredze, M. 2014. Improving lexical embeddings with semantic knowledge. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, Volume 2: Short Papers (ACL'14), Baltimore, MD, USA.Google Scholar