Hostname: page-component-586b7cd67f-g8jcs Total loading time: 0 Render date: 2024-11-24T23:49:50.208Z Has data issue: false hasContentIssue false

Patent-KG: Patent Knowledge Graph Extraction for Engineering Design

Published online by Cambridge University Press:  26 May 2022

H. Zuo*
Affiliation:
Imperial College London, United Kingdom
Y. Yin
Affiliation:
Imperial College London, United Kingdom
P. Childs
Affiliation:
Imperial College London, United Kingdom

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

This paper builds a patent-based knowledge graph, patent-KG, to represent the knowledge facts in patents for engineering design. The arising patent-KG approach proposes a new unsupervised mechanism to extract knowledge facts in a patent, by searching the attention graph in language models. The extracted entities are compared with other benchmarks in the criteria of recall rate. The result reaches the highest 0.8 recall rate in the standard list of mechanical engineering related technical terms, which means the highest coverage of engineering words.

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - ND
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (http://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is unaltered and is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use or in order to create a derivative work.
Copyright
The Author(s), 2022.

References

Andersson, L., Lupu, M., Palotti, J., Hanbury, A. & Rauber, A. 2016. When is the Time Ripe for Natural Language Processing for Patent Passage Retrieval? Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. Indianapolis, Indiana, USA: Association for Computing Machinery.Google Scholar
Aristodemou, L. & Tietze, F. 2018. The state-of-the-art on Intellectual Property Analytics (IPA): A literature review on artificial intelligence, machine learning and deep learning methods for analysing intellectual property (IP) data. World Patent Information, 55, 3751.CrossRefGoogle Scholar
Cascini, G., Fantechi, A. & Spinicci, E. Natural language processing of patents and technical documentation. International Workshop on Document Analysis Systems, 2004. Springer, 508520.CrossRefGoogle Scholar
Hao, Chen, Li Yuan, D. Z., Zhengping, Wang, Yang, Xuan, Jidong, Yang. 2019. Automatic high-precision adjustment control system for common electric valve.Google Scholar
Chen, P.-L., Li, S.-C. & Hung, M.-T. Co-occurrence analysis in innovation management: Data processing of an online brainstorming platform. 2013 Proceedings of PICMET'13: Technology Management in the IT-Driven Services (PICMET), 2013. IEEE, 688694.Google Scholar
Clark, K., Khandelwal, U., Levy, O. & Manning, C. D. J. a. E.-P. 2019. What Does BERT Look At? An Analysis of BERT's Attention. Available: https://ui.adsabs.harvard.edu/abs/2019arXiv190604341C [Accessed June 01, 2019].Google Scholar
Devlin, J., Chang, M.-W., Lee, K. & Toutanova, K. J. a. P. A. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding.Google Scholar
Etzioni, O., Banko, M., Soderland, S. & Weld, D. S. J. C. O. T. A. 2008. Open information extraction from the web. 51, 6874.Google Scholar
Office, European Patent. 2021. CPC classification [Online]. 2021. Available: https://www.cooperativepatentclassification.org/cpcConcordances [Accessed].Google Scholar
Fader, A., Soderland, S. & Etzioni, O. Identifying relations for open information extraction. Proceedings of the 2011 conference on empirical methods in natural language processing, 2011. 15351545.Google Scholar
Fu, K., Cagan, J., Kotovsky, K. & Wood, K. 2013. Discovering Structure in Design Databases Through Functional and Surface Based Mapping. Journal of Mechanical Design, 135.Google Scholar
Matthew, Honnibal, I, M.. 2017. spaCy 2: Natural language understanding with Bloom embeddings, convolutional neural networks and incremental parsing.Google Scholar
Hu, M. 2018. Computer aided patent processing: Natural language processing, machine learning, and information retrieval. Ph.D., Drexel University.Google Scholar
Huang, Z., Xu, W. & Yu, K. J. a. P. A. 2015. Bidirectional LSTM-CRF models for sequence tagging.Google Scholar
Li, J., Sun, A., Han, J., Li, C. J. I. T. O. K. & Engineering, D. 2020. A survey on deep learning for named entity recognition.Google Scholar
Luo, L., Yang, Z., Yang, P., Zhang, Y., Wang, L., Lin, H. & Wang, J. J. B. 2018. An attention-based BiLSTM-CRF approach to document-level chemical named entity recognition. 34, 13811388.Google ScholarPubMed
De Marneffe, Marie-Catherine, M, C. D.. 2016. Stanford typed dependencies manual [Online]. Available: https://downloads.cs.stanford.edu/nlp/software/dependencies_manual.pdf [Accessed August 23,2021].Google Scholar
Mccaffrey, T. & Spector, L. J. a. E. 2018. An approach to human–machine collaboration in innovation. 32, 115.Google Scholar
Miller, G. A. 1998. WordNet: An electronic lexical database, MIT press.Google Scholar
Mintz, M., Bills, S., Snow, R. & Jurafsky, D. Distant supervision for relation extraction without labeled data. Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2-Volume 2, 2009. Association for Computational Linguistics, 10031011.Google Scholar
Radford, A., Wu, J., Child, R., Luan, D., Amodei, D. & Sutskever, I. J. O. B. 2019. Language models are unsupervised multitask learners. 1, 9.Google Scholar
Sarica, S., Luo, J. & Wood, K. L. 2020. TechNet: Technology semantic network based on patent data. Expert Systems with Applications, 142, 112995.CrossRefGoogle Scholar
Schmitz, M., Soderland, S., Bart, R. & Etzioni, O. Open language learning for information extraction. Proceedings of the 2012 joint conference on empirical methods in natural language processing and computational natural language learning, 2012. 523534.Google Scholar
Shi, F. 2018. A Data-Driven Text Mining and Semantic Network Analysis for Design Information Retrieval.Google Scholar
Siddharth, L., Blessing, L. T. M., Wood, K. L. & Luo, J. J. a. E.-P. 2021. Engineering Knowledge Graph from Patent Database. Available: https://ui.adsabs.harvard.edu/abs/2021arXiv210606739S [Accessed June 01, 2021].Google Scholar
Speer, R., Chin, J. & Havasi, C. Conceptnet 5.5: An open multilingual graph of general knowledge. Thirty-first AAAI conference on artificial intelligence, 2017.Google Scholar
Swanson, D. R. J. T. L. Q. 1986. Undiscovered public knowledge. 56, 103118.Google Scholar
Van Noorden, R. 2014. Scientists may be reaching a peak in reading habits. Nature.CrossRefGoogle Scholar
Verberne, S., D'hondt, E., Oostdijk, N. & Koster, C. H. 2010. Quantifying the challenges in parsing patent claims.Google Scholar
Wang, A., Singh, A., Michael, J., Hill, F., Levy, O. & Bowman, S. R. J. a. P. A. 2018. GLUE: A multi-task benchmark and analysis platform for natural language understanding.Google Scholar
Wang, C., Liu, X. & Song, D. 2020. Language Models are Open Knowledge Graphs. Available: https://ui.adsabs.harvard.edu/abs/2020arXiv201011967W [Accessed October 01, 2020].Google Scholar
Wang, Q., Mao, Z., Wang, B., Guo, L. J. I. T. O. K. & Engineering, D. 2017. Knowledge graph embedding: A survey of approaches and applications. 29, 27242743.Google Scholar
World Intellectual Property Organization. 2019. World Intellectual Property Indicators 2019 [Online]. Available: https://www.wipo.int/publications/en/details.jsp?id=4464 [Accessed August 23, 2021].Google Scholar