A Survey on Machine Reading Comprehension Systems

Razieh Baradaran; Razieh Ghiasi; Hossein Amirkhani

doi:10.1017/S1351324921000395

A Survey on Machine Reading Comprehension Systems

Published online by Cambridge University Press: 19 January 2022

Razieh Baradaran

Razieh Ghiasi and

Hossein Amirkhani

Show author details

Razieh Baradaran: Affiliation:
Computer and Information Technology Department, University of Qom, Qom, Iran
Razieh Ghiasi: Affiliation:
Computer and Information Technology Department, University of Qom, Qom, Iran
Hossein Amirkhani*: Affiliation:
Computer and Information Technology Department, University of Qom, Qom, Iran
*: *Corresponding author. E-mail: [email protected]

Article contents

Abstract
Footnotes
References

Get access

Rights & Permissions

Abstract

Machine Reading Comprehension (MRC) is a challenging task and hot topic in Natural Language Processing. The goal of this field is to develop systems for answering the questions regarding a given context. In this paper, we present a comprehensive survey on diverse aspects of MRC systems, including their approaches, structures, input/outputs, and research novelties. We illustrate the recent trends in this field based on a review of 241 papers published during 2016–2020. Our investigation demonstrated that the focus of research has changed in recent years from answer extraction to answer generation, from single- to multi-document reading comprehension, and from learning from scratch to using pre-trained word vectors. Moreover, we discuss the popular datasets and the evaluation metrics in this field. The paper ends with an investigation of the most-cited papers and their contributions.

Keywords

Natural Language Processing question answering Machine Reading Comprehension deep learning literature review

Type: Survey Paper
Information: Natural Language Engineering , Volume 28 , Issue 6 , November 2022 , pp. 683 - 732

DOI: https://doi.org/10.1017/S1351324921000395 [Opens in a new window]
Copyright: © The Author(s), 2022. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

Equal contribution

References

Aghaebrahimian, A. (2018). Linguistically-Based Deep Unstructured Question Answering. Proceedings of the 22nd Conference on Computational Natural Language Learning, pp. 433-43.CrossRef Google Scholar

Akour, M., Abufardeh, S., Magel, K. and Al-Radaideh, Q. (2011). QArabPro: A Rule Based Question Answering System for Reading Comprehension Tests in Arabic. American Journal of Applied Sciences 8, 652–661.CrossRef Google Scholar

Amirkhani, H., Jafari, M.A., Amirak, A., Pourjafari, Z., Jahromi, S.F. and Kouhkan, Z. (2020). FarsTail: A Persian Natural Language Inference Dataset. arXiv preprint arXiv:2009.08820.Google Scholar

Andor, D., He, L., Lee, K. and Pitler, E. (2019). Giving BERT a Calculator: Finding Operations and Arguments with Reading Comprehension. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 5947-52. Hong Kong, China.CrossRef Google Scholar

Angelidis, S., Frermann, L., Marcheggiani, D., Blanco, R. and MÀrquez, L. (2019). Book QA: Stories of Challenges and Opportunities. Proceedings of the 2nd Workshop on Machine Reading for Question Answering, pp. 78-85. Hong Kong, China.CrossRef Google Scholar

Anuranjana, K., Rao, V. and Mamidi, R. (2019). HindiRC: A Dataset for Reading Comprehension in Hindi. 0th International Conference on Computational Linguistics and Intelligent Text, La Rochelle, France.Google Scholar

Arivuchelvan, K.M. and Lakahmi, K. (2017). Reading Comprehension System-A Review. Indian Journal of Scientific Research (IJSR) 14, 83–90.Google Scholar

Asai, A. and Hajishirzi, H. (2020). Logic-Guided Data Augmentation and Regularization for Consistent Question Answering. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 5642–50.CrossRef Google Scholar

Back, S., Chinthakindi, S.C., Kedia, A., Lee, H. and Choo, J. (2020). NeurQuRI: Neural Question Requirement Inspector for Answerability Prediction in Machine Reading Comprehension. International Conference on Learning Representations.Google Scholar

Back, S., Yu, S., Indurthi, S.R., Kim, J. and Choo, J. (2018). MemoReader: Large-Scale Reading Comprehension through Neural Memory Controller. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 2131-40.CrossRef Google Scholar

Bahdanau, D., Cho, K. and Bengio, Y. (2015). Neural machine translation by jointly learning to align and translate. Proceedings of the International Conference on Learning Representations (ICLR).Google Scholar

Bajgar, O., Kadlec, R. and Kleindienst, J. (2017). Embracing data abundance: BookTest Dataset for Reading Comprehension. International Conference on Learning Representations (ICLR) Workshop.Google Scholar

Banerjee, S. and Lavie, A. (2005). METEOR: An automatic metric for MT evaluation with improved correlation with human judgments. Proceedings of the acl workshop on intrinsic and extrinsic evaluation measures for machine translation and/or summarization, pp. 65-72.Google Scholar

Bao, H., Dong, L., Wei, F., Wang, W., Yang, N., Cui, L., Piao, S. and Zhou, M. (2019). Inspecting Unification of Encoding and Matching with Transformer: A Case Study of Machine Reading Comprehension. Proceedings of the 2nd Workshop on Machine Reading for Question Answering, pp. 14-18.CrossRef Google Scholar

Bauer, L., Wang, Y. and Bansal, M. (2018). Commonsense for Generative Multi-Hop Question Answering Tasks. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 4220–30CrossRef Google Scholar

Berant, J., Srikumar, V., Chen, P.-C., Vander Linden, A., Harding, B., Huang, B., Clark, P. and Manning, C.D. (2014). Modeling Biological Processes for Reading Comprehension. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1499-510.CrossRef Google Scholar

Berzak, Y., Malmaud, J. and Levy, R. (2020). STARC: Structured Annotations for Reading Comprehension. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 5726–35.CrossRef Google Scholar

Bi, B., Wu, C., Yan, M., Wang, W., Xia, J. and Li, C. (2019). Incorporating External Knowledge into Machine Reading for Generative Question Answering. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 2521-30. Hong Kong, China.CrossRef Google Scholar

Bouziane, A., Bouchiha, D., Doumi, N. and Malki, M. (2015). Question answering systems: survey and trends. Procedia Computer Science 73, 366–375.CrossRef Google Scholar

Cao, Y., Fang, M., Yu, B. and Zhou, J.T. (2020). Unsupervised Domain Adaptation on Reading Comprehension. AAAI, pp. 7480-87.CrossRef Google Scholar

Charlet, D., Damnati, G., BÉchet, F. and Heinecke, J. (2020). Cross-lingual and cross-domain evaluation of Machine Reading Comprehension with Squad and CALOR-Quest corpora. Proceedings of The 12th Language Resources and Evaluation Conference, pp. 5491-97.Google Scholar

Chaturvedi, A., Pandit, O. and Garain, U. (2018). CNN for Text-Based Multiple Choice Question Answering. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 272-77.CrossRef Google Scholar

Chen, A., Stanovsky, G., Singh, S. and Gardner, M. (2019a). Evaluating question answering evaluation. Proceedings of the 2nd Workshop on Machine Reading for Question Answering, pp. 119-24.CrossRef Google Scholar

Chen, D. (2018). Neural Reading Comprehension and Beyond. Stanford University.Google Scholar

Chen, D., Bolton, J. and Manning, C.D. (2016). A Thorough Examination of the Cnn/Daily Mail Reading Comprehension Task. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 2358–67.CrossRef Google Scholar

Chen, D., Fisch, A., Weston, J. and Bordes, A. (2017). Reading wikipedia to answer open-domain questions. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, pp. 1870-79.CrossRef Google Scholar

Chen, X., Liang, C., Yu, A.W., Zhou, D., Song, D. and Le, Q.V. (2020). Neural symbolic reader: Scalable integration of distributed and symbolic representations for reading comprehension. International Conference on Learning Representations (ICLR2020).Google Scholar

Chen, Z., Cui, Y., Ma, W., Wang, S. and Hu, G. (2019b). Convolutional spatial attention model for reading comprehension with multiple-choice questions. Proceedings of the AAAI Conference on Artificial Intelligence, pp. 6276-83.CrossRef Google Scholar

Cho, K., Van MerriËnboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H. and Bengio, Y. (2014). Learning phrase representations using RNN encoder-decoder for statistical machine translation. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1724–34. Doha, Qatar.CrossRef Google Scholar

Choi, E., Hewlett, D., Lacoste, A., Polosukhin, I., Uszkoreit, J. and Berant, J. (2017a). Hierarchical Question Answering for Long Documents. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL), pp. 209-20. Vancouver, Canada.CrossRef Google Scholar

Choi, E., Hewlett, D., Uszkoreit, J., Polosukhin, I., Lacoste, A. and Berant, J. (2017b). Coarse-to-fine question answering for long documents. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 209-20.CrossRef Google Scholar

Chollet, F. (2017). Deep learning with python. Manning Publications Co.Google Scholar

Clark, C. and Gardner, M. (2018). Simple and effective multi-paragraph reading comprehension. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Long Papers, pp. 845–55CrossRef Google Scholar

Collobert, R. and Weston, J. (2008). A unified architecture for natural language processing: Deep neural networks with multitask learning. Proceedings of the 25th international conference on Machine learning, pp. 160-67. ACM.CrossRef Google Scholar

Cui, L., Huang, S., Wei, F., Tan, C., Duan, C. and Zhou, M. (2017a). Superagent: a customer service chatbot for e-commerce websites. Proceedings of ACL, System Demonstrations, pp. 97-102. Association for Computational Linguistics.CrossRef Google Scholar

Cui, Y., Che, W., Liu, T., Qin, B., Wang, S. and Hu, G. (2019a). Cross-Lingual Machine Reading Comprehension. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 1586-95. Hong Kong, China.CrossRef Google Scholar

Cui, Y., Chen, Z., Wei, S., Wang, S., Liu, T. and Hu, G. (2017b). Attention-over-attention neural networks for reading comprehension. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics pp. 593–602.CrossRef Google Scholar

Cui, Y., Liu, T., Che, W., Xiao, L., Chen, Z., Ma, W., Wang, S. and Hu, G. (2019b). A Span-Extraction Dataset for Chinese Machine Reading Comprehension. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 5883-89. Hong Kong, China.CrossRef Google Scholar

Cui, Y., Liu, T., Chen, Z., Wang, S. and Hu, G. (2016). Consensus Attention-Based Neural Networks for Chinese Reading Comprehension. Proceedings of the COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 1777–86.Google Scholar

Das, R., Dhuliawala, S., Zaheer, M. and McCallum, A. (2019). Multi-step Retriever-Reader Interaction for Scalable Open-domain Question Answering. International Conference on Learning Representations.Google Scholar

Dasigi, P., Liu, N.F., MarasoviĆ, A., Smith, N.A. and Gardner, M. (2019). Quoref: A Reading Comprehension Dataset with Questions Requiring Coreferential Reasoning. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 5925-32. Hong Kong, China.CrossRef Google Scholar

Dehghani, M., Azarbonyad, H., Kamps, J. and de Rijke, M. (2019). Learning to transform, combine, and reason in open-domain question answering. Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, pp. 681-89.CrossRef Google Scholar

Deng, L and Liu, Y. (2018). Deep Learning in Natural Language Processing. Springer Singapore.CrossRef Google Scholar

Devlin, J., Chang, M.-W., Lee, K. and Toutanova, K. (2018). Bert: Pre-training of deep bidirectional transformers for language understanding. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 4171-86.Google Scholar

Dhingra, B., Liu, H., Yang, Z., Cohen, W.W. and Salakhutdinov, R. (2017). Gated-attention readers for text comprehension. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics pp. 1832–46.CrossRef Google Scholar

Dhingra, B., Pruthi, D. and Rajagopal, D. (2018). Simple and Effective Semi-Supervised Question Answering. Proceedings of Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2018), pp. 582–87.CrossRef Google Scholar

Dhingra, B., Zhou, Z., Fitzpatrick, D., Muehl, M. and Cohen, W.W. (2016). Tweet2vec: Character-based distributed representations for social media. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 269–74.CrossRef Google Scholar

Ding, M., Zhou, C., Chen, Q., Yang, H. and Tang, J. (2019). Cognitive Graph for Multi-Hop Reading Comprehension at Scale. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 2694-703. Florence, Italy.CrossRef Google Scholar

Du, X. and Cardie, C. (2018). Harvesting Paragraph-Level Question-Answer Pairs from Wikipedia. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Long Papers), pp. 1907–17.Google Scholar

Dua, D., Singh, S. and Gardner, M. (2020). Benefits of Intermediate Annotations in Reading Comprehension. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 5627-34.CrossRef Google Scholar

Dua, D., Wang, Y., Dasigi, P., Stanovsky, G., Singh, S. and Gardner, M. (2019). DROP: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 2368-78. Minneapolis, Minnesota.Google Scholar

Duan, N., Tang, D., Chen, P. and Zhou, M. (2017). Question generation for question answering. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp. 866-74.CrossRef Google Scholar

Dunietz, J., Burnham, G., Bharadwaj, A., Chu-Carroll, J., Rambow, O. and Ferrucci, D. (2020). To Test Machine Comprehension, Start by Defining Comprehension. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 7839–59.CrossRef Google Scholar

Elgohary, A., Zhao, C. and Boyd-Graber, J. (2018). A dataset and baselines for sequential open-domain question answering. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 1077-83.CrossRef Google Scholar

Ethayarajh, K. (2019). How contextual are contextualized word representations? Comparing the geometry of BERT, ELMo, and GPT-2 embeddings. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, pp. 55–65. Hong Kong, China.CrossRef Google Scholar

Fisch, A., Talmor, A., Jia, R., Seo, M., Choi, E. and Chen, D. (2019). MRQA 2019 Shared Task: Evaluating Generalization in Reading Comprehension. Proceedings of the 2nd Workshop on Machine Reading for Question Answering, pp. 1-13. Hong Kong, China.CrossRef Google Scholar

Frermann, L. (2019). Extractive NarrativeQA with Heuristic Pre-Training. Proceedings of the 2nd Workshop on Machine Reading for Question Answering, pp. 172-82.CrossRef Google Scholar

Gardner, M., Berant, J., Hajishirzi, H., Talmor, A. and Min, S. (2019). On Making Reading Comprehension More Comprehensive. Proceedings of the 2nd Workshop on Machine Reading for Question Answering, pp. 105-12.CrossRef Google Scholar

Ghaeini, R., Fern, X.Z., Shahbazi, H. and Tadepalli, P. (2018). Dependent Gated Reading for Cloze-Style Question Answering. Proceedings of the 27th International Conference on Computational Linguistics pp. 3330–45.Google Scholar

Giuseppe, A. (2017). Question Dependent Recurrent Entity Network for Question Answering. NL4AI: 1st Workshop on Natural Language for Artificial Intelligence, pp. 69-80. CEUR.Google Scholar

Golub, D., Huang, P.-S., He, X. and Deng, L. (2017). Two-stage synthesis networks for transfer learning in machine comprehension. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing pp. 835–44.CrossRef Google Scholar

Gong, Y. and Bowman, S.R. (2018). Ruminating reader: Reasoning with gated multi-hop attention. Proceedings of the Workshop on Machine Reading for Question Answering, pp. 1–11 Association for Computational Linguistics.CrossRef Google Scholar

Greco, C., Suglia, A., Basile, P., Rossiello, G. and Semeraro, G. (2016). Iterative multi-document neural attention for multiple answer prediction. 2016 AI* IA Workshop on Deep Understanding and Reasoning: A Challenge for Next-Generation Intelligent Agents (URANIA).Google Scholar

Guo, S., Li, R., Tan, H., Li, X., Guan, Y., Zhao, H. and Zhang, Y. (2020). A Frame-based Sentence Representation for Machine Reading Comprehension. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 891-96.CrossRef Google Scholar

Gupta, D., Lenka, P., Ekbal, A. and Bhattacharyya, P. (2018). Uncovering Code-Mixed Challenges: A Framework for Linguistically Driven Question Generation and Neural based Question Answering. Proceedings of the 22nd Conference on Computational Natural Language Learning, pp. 119-30.CrossRef Google Scholar

Gupta, M., Kulkarni, N., Chanda, R., Rayasam, A. and Lipton, Z.C. (2019). AmazonQA: A Review-Based Question Answering Task. Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI-19), pp. 4996-5002.CrossRef Google Scholar

Gupta, S. and Khade, N. (2020). BERT Based Multilingual Machine Comprehension in English and Hindi. ACM Trans. Asian Low-Resour. Lang. Inf. Process.s 19.Google Scholar

Gupta, S., Rawat, B.P.S. and Yu, H. (2020). Conversational Machine Comprehension: a Literature Review. Proceedings of the 28th International Conference on Computational Linguistics, pp. 2739-53.CrossRef Google Scholar

Hardalov, M., Koychev, I. and Nakov, P. (2019). Beyond English-Only Reading Comprehension: Experiments in Zero-Shot Multilingual Transfer for Bulgarian. Proceedings of Recent Advances in Natural Language Processing, pp. 447-59. Varna, Bulgaria.CrossRef Google Scholar

Hashemi, H., Aliannejadi, M., Zamani, H. and Croft, W.B. (2020). ANTIQUE: A non-factoid question answering benchmark. European Conference on Information Retrieval, pp. 166-73. Springer.CrossRef Google Scholar

He, W., Liu, K., Liu, J., Lyu, Y., Zhao, S., Xiao, X., Liu, Y., Wang, Y., Wu, H. and She, Q. (2018). Dureader: a Chinese Machine Reading Comprehension Dataset from Real-World Applications. Proceedings of the Workshop on Machine Reading for Question Answering, pp. 37–46. Association for Computational Linguistics.CrossRef Google Scholar

Hermann, K.M., Kocisky, T., Grefenstette, E., Espeholt, L., Kay, W., Suleyman, M. and Blunsom, P. (2015). Teaching machines to read and comprehend. Advances in neural information processing systems, pp. 1693-701.Google Scholar

Hewlett, D., Jones, L. and Lacoste, A. (2017). Accurate supervised and semi-supervised machine reading for long documents. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp. 2011-20.Google Scholar

Hill, F., Bordes, A., Chopra, S. and Weston, J. (2016). The goldilocks principle: Reading children’s books with explicit memory representations. Proceedings of the International Conference on Learning Representations (ICLR).Google Scholar

Hirschman, L., Light, M., Breck, E. and Burger, J.D. (1999). Deep Read: A Reading Comprehension System. Proceedings of the 37th annual meeting of the Association for Computational Linguistics, pp. 325-32.Google Scholar

Hoang, L., Wiseman, S. and Rush, A.M. (2018). Entity Tracking Improves Cloze-style Reading Comprehension. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 1049–55CrossRef Google Scholar

Hochreiter, S. and Schmidhuber, J. (1997). Long short-term memory. Neural Computation 9, 1735–1780.CrossRef Google Scholar PubMed

Horbach, A., Aldabe, I., Bexte, M., de Lacalle, O.L. and Maritxalar, M. (2020). Linguistic Appropriateness and Pedagogic Usefulness of Reading Comprehension Questions. Proceedings of The 12th Language Resources and Evaluation Conference, pp. 1753-62.Google Scholar

Htut, P.M., Bowman, S.R. and Cho, K. (2018). Training a Ranking Function for Open-Domain Question Answering. Proceedings of NAACL-HLT 2018: Student Research Workshop, pp. 120–27.Google Scholar

Hu, M., Peng, Y., Huang, Z. and Li, D. (2019a). A Multi-Type Multi-Span Network for Reading Comprehension that Requires Discrete Reasoning. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 1596-606. Hong Kong, China.CrossRef Google Scholar

Hu, M., Peng, Y., Huang, Z. and Li, D. (2019b). Retrieve, Read, Rerank: Towards End-to-End Multi-Document Reading Comprehension. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy.CrossRef Google Scholar

Hu, M., Peng, Y., Huang, Z., Qiu, X., Wei, F. and Zhou, M. (2018a). Reinforced Mnemonic Reader for Machine Reading Comprehension. Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence (IJCAI-18), pp. 4099-106.Google Scholar

Hu, M., Peng, Y., Wei, F., Huang, Z., Li, D., Yang, N. and Zhou, M. (2018b). Attention-Guided Answer Distillation for Machine Reading Comprehension. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 4243–52.CrossRef Google Scholar

Hu, M., Wei, F., Peng, Y., Huang, Z., Yang, N. and Li, D. (2019c). Read+ Verify: Machine Reading Comprehension with Unanswerable Questions. Proceedings of the AAAI Conference on Artificial Intelligence, pp. 6529-37.CrossRef Google Scholar

Huang, H.-Y., Zhu, C., Shen, Y. and Chen, W. (2018). Fusionnet: Fusing via fully-aware attention with application to machine comprehension. Proceedings of the International Conference on Learning Representations (ICLR).Google Scholar

Huang, K., Tang, Y., Huang, J., He, X. and Zhou, B. (2019a). Relation Module for Non-Answerable Predictions on Reading Comprehension. Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL), pp. 747-56.CrossRef Google Scholar

Huang, L., Le Bras, R., Bhagavatula, C. and Choi, Y. (2019b). Cosmos QA: Machine Reading Comprehension with Contextual Commonsense Reasoning. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 2391-401. Hong Kong, China.CrossRef Google Scholar

Indurthi, S., Yu, S., Back, S. and CuayÁhuitl, H. (2018). Cut to the Chase: A Context Zoom-in Network for Reading Comprehension. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 570–75. Association for Computational Linguistics.CrossRef Google Scholar

Ingale, V. and Singh, P. (2019). Datasets for Machine Reading Comprehension: A Literature Review. Available at SSRN 3454037.CrossRef Google Scholar

Inoue, N., Stenetorp, P. and Inui, K. (2020). R4C: A Benchmark for Evaluating RC Systems to Get the Right Answer for the Right Reason. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 6740-50.CrossRef Google Scholar

Jia, R. and Liang, P. (2017). Adversarial examples for evaluating reading comprehension systems. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing pp. 2021–31.CrossRef Google Scholar

Jiang, Y., Joshi, N., Chen, Y.-C. and Bansal, M. (2019). Explore, Propose, and Assemble: An Interpretable Model for Multi-Hop Reading Comprehension. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 2714–25. Florence, Italy.CrossRef Google Scholar

Jin, D., Gao, S., Kao, J.-Y., Chung, T. and Hakkani-tur, D. (2020). Mmm: Multi-stage multi-task learning for multi-choice reading comprehension. The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20).CrossRef Google Scholar

Jin, W., Yang, G. and Zhu, H. (2019). An Efficient Machine Reading Comprehension Method Based on Attention Mechanism. 2019 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom), pp. 1297-302. IEEE.CrossRef Google Scholar

Jing, Y., Xiong, D. and Zhen, Y. (2019). BiPaR: A Bilingual Parallel Dataset for Multilingual and Cross-lingual Reading Comprehension on Novels. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, pp. 2452–62. Hong Kong, China.CrossRef Google Scholar

Joshi, M., Choi, E., Weld, D.S. and Zettlemoyer, L. (2017). Triviaqa: A large scale distantly supervised challenge dataset for reading comprehension. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1601-11.CrossRef Google Scholar

Jurafsky, D. and Martin, J.H. (2019). Speech and language processing. Pearson London.Google Scholar

Kadlec, R., Bajgar, O. and Kleindienst, J. (2016a). From Particular to General: A Preliminary Case Study of Transfer Learning in Reading Comprehension. NIPS Machine Intelligence Workshop.Google Scholar

Kadlec, R., Schmid, M., Bajgar, O. and Kleindienst, J. (2016b). Text understanding with the attention sum reader network. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics pp. 908–18.CrossRef Google Scholar

Ke, N.R., Zolna, K., Sordoni, A., Lin, Z., Trischler, A., Bengio, Y., Pineau, J., Charlin, L. and Pal, C. (2018). Focused Hierarchical RNNs for Conditional Sequence Processing. Proceedings of the 35 th International Conference on Machine Learning. Google Scholar

Khashabi, D., Chaturvedi, S., Roth, M., Upadhyay, S. and Roth, D. (2018). Looking Beyond the Surface: A Challenge Set for Reading Comprehension over Multiple Sentences. Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pp. 252-62.Google Scholar

Kim, Y. (2014). Convolutional neural networks for sentence classification. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1746-51. Doha, Qatar.CrossRef Google Scholar

Kim, Y., Jernite, Y., Sontag, D. and Rush, A.M. (2016). Character-aware neural language models. Thirtieth AAAI Conference on Artificial Intelligence.CrossRef Google Scholar

Kobayashi, S., Tian, R., Okazaki, N. and Inui, K. (2016). Dynamic entity representation with max-pooling improves machine reading. Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 850-55.CrossRef Google Scholar

KoČiskÝ, T., Schwarz, J., Blunsom, P., Dyer, C., Hermann, K.M., Melis, G. and Grefenstette, E. (2018). The narrativeqa reading comprehension challenge. Transactions of the Association for Computational Linguistics 6, 317–328.CrossRef Google Scholar

Kodra, L. and Meçe, E.K. (2017). Question Answering Systems: A Review on Present Developments, Challenges and Trends. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE 8, 217–224.Google Scholar

Kundu, S. and Ng, H.T. (2018a). A Nil-Aware Answer Extraction Framework for Question Answering. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 4243-52.CrossRef Google Scholar

Kundu, S. and Ng, H.T. (2018b). A Question-Focused Multi-Factor Attention Network for Question Answering. Association for the Advancement of Artificial Intelligence (AAAI2018).Google Scholar

Kwiatkowski, T., Palomaki, J., Redfield, O., Collins, M., Parikh, A., Alberti, C., Epstein, D., Polosukhin, I., Devlin, J. and Lee, K. (2019). Natural questions: a benchmark for question answering research. Transactions of the Association for Computational Linguistics 7, 453–466.Google Scholar

Lai, G., Xie, Q., Liu, H., Yang, Y. and Hovy, E. (2017). Race: Large-scale reading comprehension dataset from examinations. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp. 785–94CrossRef Google Scholar

LeCun, Y., Bottou, L., Bengio, Y. and Haffner, P. (1998). Gradient-based learning applied to document recognition. Proceedings of the IEEE 86, 2278–2324.CrossRef Google Scholar

Lee, G., Hwang, S.-w. and Cho, H. (2020). SQuAD2-CR: Semi-supervised Annotation for Cause and Rationales for Unanswerability in SQuAD 2.0. Proceedings of The 12th Language Resources and Evaluation Conference, pp. 5425-32.Google Scholar

Lee, H.-g. and Kim, H. (2020). GF-Net: Improving Machine Reading Comprehension with Feature Gates. Pattern Recognition Letters 129, 8–15.CrossRef Google Scholar

Lee, K., Park, S., Han, H., Yeo, J., Hwang, S.-w. and Lee, J. (2019a). Learning with limited data for multilingual reading comprehension. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 2833-43.Google Scholar

Lee, S., Kim, D. and Park, J. (2019b). Domain-agnostic Question-Answering with Adversarial Training. Proceedings of the 2nd Workshop on Machine Reading for Question Answering, pp. 196-202. Hong Kong, China.Google Scholar

Lehnert, W.G. (1977). The process of question answering. Yale Univ New Haven Conn Dept Of Computer Science.Google Scholar

Li, H., Zhang, X., Liu, Y., Zhang, Y., Wang, Q., Zhou, X., Liu, J., Wu, H. and Wang, H. (2019a). D-NET: A Simple Framework for Improving the Generalization of Machine Reading Comprehension. Proceedings of 2nd Machine Reading for Reading Comprehension Workshop at EMNLP.Google Scholar

Li, J., Li, B. and Lv, X. (2018). Machine Reading Comprehension Based on the Combination of BIDAF Model and Word Vectors. Proceedings of the 2nd International Conference on Computer Science and Application Engineering, pp. 89. ACM.CrossRef Google Scholar

Li, X., Zhang, Z., Zhu, W., Li, Z., Ni, Y., Gao, P., Yan, J. and Xie, G. (2019b). Pingan Smart Health and SJTU at COIN-Shared Task: utilizing Pre-trained Language Models and Common-sense Knowledge in Machine Reading Tasks. Proceedings of the First Workshop on Commonsense Inference in Natural Language Processing, pp. 93-98.Google Scholar

Li, Y., Li, H. and Liu, J. (2019c). Towards Robust Neural Machine Reading Comprehension via Question Paraphrases. 2019 International Conference on Asian Language Processing (IALP), pp. 290-95. IEEE.Google Scholar

Liang, Y., Li, J. and Yin, J. (2019). A New Multi-Choice Reading Comprehension Dataset for Curriculum Learning. Asian Conference on Machine Learning, pp. 742-57.Google Scholar

Lin, C.-Y. (2004). Rouge: A package for automatic evaluation of summaries. Proceedings of the Workshop on Text Summarization Branches Out (WAS 2004): Association for Computational Linguistics.Google Scholar

Lin, K., Tafjord, O., Clark, P. and Gardner, M. (2019). Reasoning Over Paragraph Effects in Situations. Proceedings of the 2nd Workshop on Machine Reading for Question Answering, pp. 58-62. Hong Kong, China.CrossRef Google Scholar

Lin, X., Liu, R. and Li, Y. (2018). An Option Gate Module for Sentence Inference on Machine Reading Comprehension. Proceedings of the 27th ACM International Conference on Information and Knowledge Management, pp. 1743-46. ACM.CrossRef Google Scholar

Liu, C., Zhao, Y., Si, Q., Zhang, H., Li, B. and Yu, D. (2018a). Multi-Perspective Fusion Network for Commonsense Reading Comprehension. Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, pp. 262-74. Springer.CrossRef Google Scholar

Liu, D., Gong, Y., Fu, J., Yan, Y., Chen, J., Jiang, D., Lv, J. and Duan, N. (2020a). RikiNet: Reading Wikipedia Pages for Natural Question Answering. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 6762–71.CrossRef Google Scholar

Liu, F. and Perez, J. (2017). Gated end-to-end memory networks. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers, pp. 1-10.CrossRef Google Scholar

Liu, J., Lin, Y., Liu, Z. and Sun, M. (2019a). XQA: A cross-lingual open-domain question answering dataset. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 2358-68.CrossRef Google Scholar

Liu, J., Wei, W., Sun, M., Chen, H., Du, Y. and Lin, D. (2018b). A Multi-answer Multi-task Framework for Real-world Machine Reading Comprehension. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 2109-18.CrossRef Google Scholar

Liu, K., Liu, X., Yang, A., Liu, J., Su, J., Li, S. and She, Q. (2020b). A Robust Adversarial Training Approach to Machine Reading Comprehension. AAAI, pp. 8392-400.CrossRef Google Scholar

Liu, R., Hu, J., Wei, W., Yang, Z. and Nyberg, E. (2017). Structural embedding of syntactic trees for machine comprehension. Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing pp. 815–24CrossRef Google Scholar

Liu, S., Zhang, S., Zhang, X. and Wang, H. (2019b). R-trans: RNN Transformer Network for Chinese Machine Reading Comprehension. IEEE Access 7, 27736–27745.Google Scholar

Liu, S., Zhang, X., Zhang, S., Wang, H. and Zhang, W. (2019c). Review Neural Machine Reading Comprehension: Methods and Trends. Applied Sciences 9, 3698.Google Scholar

Liu, X., Shen, Y., Duh, K. and Gao, J. (2018c). Stochastic Answer Networks for Machine Reading Comprehension. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Long Papers): 1694–704CrossRef Google Scholar

Liu, Y., Huang, Z., Hu, M., Du, S., Peng, Y., Li, D. and Wang, X. (2018d). MFM: A Multi-level Fused Sequence Matching Model for Candidates Filtering in Multi-paragraphs Question-Answering. Pacific Rim Conference on Multimedia, pp. 449-58. Springer.CrossRef Google Scholar

Longpre, S., Lu, Y., Tu, Z. and DuBois, C. (2019). An Exploration of Data Augmentation and Sampling Techniques for Domain-Agnostic Question Answering. Proceedings of the 2nd Workshop on Machine Reading for Question Answering, pp. 220-27.CrossRef Google Scholar

Ma, K., Jurczyk, T. and Choi, J.D. (2018). Challenging Reading Comprehension on Daily Conversation: Passage Completion on Multiparty Dialog. Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pp. 2039-48.CrossRef Google Scholar

Manning, C.D., Raghavan, P. and Schütze, H. (2008). Chapter 8: Evaluation in information retrieval. Introduction to information retrieval: Cambridge University Press.Google Scholar

Miao, H., Liu, R. and Gao, S. (2019a). A Multiple Granularity Co-Reasoning Model for Multi-choice Reading Comprehension. 2019 International Joint Conference on Neural Networks (IJCNN), pp. 1-7. IEEE.Google Scholar

Miao, H., Liu, R. and Gao, S. (2019b). Option Attentive Capsule Network for Multi-choice Reading Comprehension. International Conference on Neural Information Processing, pp. 306-18. Springer.CrossRef Google Scholar

Mihaylov, T. and Frank, A. (2018). Knowledgeable Reader: Enhancing Cloze-Style Reading Comprehension with External Commonsense Knowledge. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Long Papers), pp. 821–32.CrossRef Google Scholar

Mihaylov, T. and Frank, A. (2019). Discourse-Aware Semantic Self-Attention for Narrative Reading Comprehension. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 2541-52. Hong Kong, China.CrossRef Google Scholar

Mihaylov, T., Kozareva, Z. and Frank, A. (2017). Neural Skill Transfer from Supervised Language Tasks to Reading Comprehension. Workshop on Learning with Limited Labeled Data: Weak Supervision and Beyond at NIPS.Google Scholar

Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S. and Dean, J. (2013). Distributed representations of words and phrases and their compositionality. Advances in neural information processing systems, pp. 3111-19.Google Scholar

Min, S., Seo, M. and Hajishirzi, H. (2017). Question answering through transfer learning from large fine-grained supervision data. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Short Papers) pp. 510–17.CrossRef Google Scholar

Min, S., Zhong, V., Socher, R. and Xiong, C. (2018). Efficient and Robust Question Answering from Minimal Context over Documents. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Long Papers), pp. 1725–35.CrossRef Google Scholar

Min, S., Zhong, V., Zettlemoyer, L. and Hajishirzi, H. (2019). Multi-hop Reading Comprehension through Question Decomposition and Rescoring. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 6097-109. Florence, Italy.CrossRef Google Scholar

Mozannar, H., Maamary, E., El Hajal, K. and Hajj, H. (2019). Neural Arabic Question Answering. Proceedings of the Fourth Arabic Natural Language Processing Workshop, pp. 108-18. Florence, Italy.CrossRef Google Scholar

Munkhdalai, T. and Yu, H. (2017). Reasoning with memory augmented neural networks for language comprehension. Proceedings of the International Conference on Learning Representations (ICLR).Google Scholar

Nakatsuji, M. and Okui, S. (2020). Answer Generation through Unified Memories over Multiple Passages. Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence (IJCAI-20).CrossRef Google Scholar

Ng, H.T., Teo, L.H. and Kwan, J.L.P. (2000). A Machine Learning Approach to Answering Questions for Reading Comprehension Tests. Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics, pp. 124-32.CrossRef Google Scholar

Nguyen, T., Rosenberg, M., Song, X., Gao, J., Tiwary, S., Majumder, R. and Deng, L. (2016). MS MARCO: A Human Generated Machine Reading Comprehension Dataset. Proceedings of the Workshop on Cognitive Computation: Integrating neural and symbolic approaches 2016 co-located with the 30th Annual Conference on Neural Information Processing Systems (CoCo@NIPS), Barcelona, Spain.Google Scholar

Nie, Y., Wang, S. and Bansal, M. (2019). Revealing the Importance of Semantic Retrieval for Machine Reading at Scale. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 2553-66. Hong Kong, China.CrossRef Google Scholar

Nishida, K., Nishida, K., Nagata, M., Otsuka, A., Saito, I., Asano, H. and Tomita, J. (2019a). Answering while Summarizing: Multi-task Learning for Multi-hop QA with Evidence Extraction. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 2335-45. Florence, Italy.CrossRef Google Scholar

Nishida, K., Nishida, K., Saito, I., Asano, H. and Tomita, J. (2020). Unsupervised Domain Adaptation of Language Models for Reading Comprehension. Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), pp. 5392–99.Google Scholar

Nishida, K., Saito, I., Nishida, K., Shinoda, K., Otsuka, A., Asano, H. and Tomita, J. (2019b). Multi-style Generative Reading Comprehension. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 2273-84. Florence, Italy.CrossRef Google Scholar

Nishida, K., Saito, I., Otsuka, A., Asano, H. and Tomita, J. (2018). Retrieve-and-read: Multi-task learning of information retrieval and reading comprehension. Proceedings of the 27th ACM International Conference on Information and Knowledge Management, pp. 647-56. ACM.CrossRef Google Scholar

Niu, Y., Jiao, F., Zhou, M., Yao, T., Xu, J. and Huang, M. (2020). A Self-Training Method for Machine Reading Comprehension with Soft Evidence Extraction. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 3916–27.Google Scholar

Onishi, T., Wang, H., Bansal, M., Gimpel, K. and McAllester, D. (2016). Who did what: A large-scale person-centered cloze dataset. Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 2230–35.CrossRef Google Scholar

Osama, R., El-Makky, N.M. and Torki, M. (2019). Question Answering Using Hierarchical Attention on Top of BERT Features. Proceedings of the 2nd Workshop on Machine Reading for Question Answering, pp. 191-95.CrossRef Google Scholar

Ostermann, S., Modi, A., Roth, M., Thater, S. and Pinkal, M. (2018). MCScript: A Novel Dataset for Assessing Machine Comprehension Using Script Knowledge. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC), Miyazaki, Japan.Google Scholar

Ostermann, S., Roth, M. and Pinkal, M. (2019). MCScript2.0: A Machine Comprehension Corpus Focused on Script Events and Participants. Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics (*SEM 2019), pp. 103-17. Minneapolis, Minnesota.Google Scholar

Pampari, A., Raghavan, P., Liang, J. and Peng, J. (2018). emrQA: A Large Corpus for Question Answering on Electronic Medical Records. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 2357–68CrossRef Google Scholar

Pang, L., Lan, Y., Guo, J., Xu, J., Su, L. and Cheng, X. (2019). Has-qa: Hierarchical answer spans model for open-domain question answering. Proceedings of the AAAI Conference on Artificial Intelligence, pp. 6875-82.CrossRef Google Scholar

Papineni, K., Roukos, S., Ward, T. and Zhu, W.-J. (2002). BLEU: a method for automatic evaluation of machine translation. Proceedings of the 40th annual meeting on association for computational linguistics, pp. 311-18. Association for Computational Linguistics.Google Scholar

Pappas, D., Stavropoulos, P., Androutsopoulos, I. and McDonald, R. (2020). BioMRC: A Dataset for Biomedical Machine Reading Comprehension. Proceedings of the 19th SIGBioMed Workshop on Biomedical Language Processing, pp. 140-49.CrossRef Google Scholar

Park, C., Lee, C. and Song, H. (2019). VS3-NET: Neural Variational Inference Model for Machine-Reading Comprehension. ETRI Journal 41, 771–781.CrossRef Google Scholar

Pennington, J., Socher, R. and Manning, C. (2014). Glove: Global vectors for word representation. Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp. 1532-43.CrossRef Google Scholar

Peters, M.E., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K. and Zettlemoyer, L. (2018). Deep contextualized word representations. Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), pp. 2227-37. New Orleans, Louisiana.CrossRef Google Scholar

Prakash, T., Tripathy, B.K. and Banu, K.S. (2018). ALICE: A Natural Language Question Answering System Using Dynamic Attention and Memory. International Conference on Soft Computing Systems, pp. 274-82. Springer.CrossRef Google Scholar

Pugaliya, H., Route, J., Ma, K., Geng, Y. and Nyberg, E. (2019). Bend but Don’t Break? Multi-Challenge Stress Test for QA Models. Proceedings of the 2nd Workshop on Machine Reading for Question Answering, pp. 125-36.CrossRef Google Scholar

Qiu, D., Zhang, Y., Feng, X., Liao, X., Jiang, W., Lyu, Y., Liu, K. and Zhao, J. (2019). Machine Reading Comprehension Using Structural Knowledge Graph-aware Network. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 5898-903.CrossRef Google Scholar

Radford, A., Narasimhan, K., Salimans, T. and Sutskever, I. (2018). Improving language understanding by generative pre-training.Google Scholar

Rajpurkar, P., Jia, R. and Liang, P. (2018). Know What You Don’t Know: Unanswerable Questions for SQuAD. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Short Papers), pp. 784–89.CrossRef Google Scholar

Rajpurkar, P., Zhang, J., Lopyrev, K. and Liang, P. (2016). Squad: 100,000+ questions for machine comprehension of text. Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 2383–92.Google Scholar

Ran, Q., Lin, Y., Li, P., Zhou, J. and Liu, Z. (2019). NumNet: Machine Reading Comprehension with Numerical Reasoning. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 2474-84. Hong Kong, China.CrossRef Google Scholar

Reddy, S., Chen, D. and Manning, C.D. (2019). Coqa: A conversational question answering challenge. Transactions of the Association for Computational Linguistics 7, 249–266.Google Scholar

Ren, M., Huang, H., Wei, R., Liu, H., Bai, Y., Wang, Y. and Gao, Y. (2019). Multiple Perspective Answer Reranking for Multi-passage Reading Comprehension. CCF International Conference on Natural Language Processing and Chinese Computing, pp. 736-47. Springer.CrossRef Google Scholar

Ren, Q., Cheng, X. and Su, S. (2020). Multi-Task Learning with Generative Adversarial Training for Multi-Passage Machine Reading Comprehension. AAAI, pp. 8705-12.CrossRef Google Scholar

Richardson, M., Burges, C.J.C and Renshaw, E. (2013). Mctest: A challenge dataset for the open-domain machine comprehension of text. Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 193-203.Google Scholar

Riloff, E. and Thelen, M. (2000). A Rule-Based Question Answering System for Reading Comprehension Tests. Proceedings of the 2000 ANLP/NAACL Workshop on Reading comprehension tests as evaluation for computer-based language understanding sytems, pp. 13-19. Association for Computational Linguistics.Google Scholar

Ruder, S. (2019). Neural Transfer Learning for Natural Language Processing. NATIONAL UNIVERSITY OF IRELAND, GALWAY.Google Scholar

Sachan, M. and Xing, E. (2018). Self-Training for Jointly Learning to Ask and Answer Questions. Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pp. 629-40.CrossRef Google Scholar

Saha, A., Aralikatte, R., Khapra, M.M. and Sankaranarayanan, K. (2018). DuoRC: Towards Complex Language Understanding with Paraphrased Reading Comprehension. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Long Papers), pp. 1683–93.CrossRef Google Scholar

Salant, S. and Berant, J. (2018). Contextualized Word Representations for Reading Comprehension. Proceedings of NAACL-HLT 2018, pp. 554–59.Google Scholar

Sayama, H.F., Araujo, A.V. and Fernandes, E.R. (2019). FaQuAD: Reading Comprehension Dataset in the Domain of Brazilian Higher Education. 2019 8th Brazilian Conference on Intelligent Systems (BRACIS), pp. 443-48. IEEE.Google Scholar

Schlegel, V., Valentino, M., Freitas, A., Nenadic, G. and Batista-Navarro, R. (2020). A framework for evaluation of Machine Reading Comprehension Gold Standards. Proceedings of the 12th Conference on Language Resources and Evaluation, pp. 5359–69.Google Scholar

Seo, M., Kembhavi, A., Farhadi, A. and Hajishirzi, H. (2017). Bidirectional attention flow for machine comprehension. Proceedings of the 5th International Conference on Learning Representations (ICLR).Google Scholar

Seo, M., Kwiatkowski, T., Parikh, A.P., Farhadi, A. and Hajishirzi, H. (2018). Phrase-Indexed Question Answering: A New Challenge for Scalable Document Comprehension. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing pp. 559–64.CrossRef Google Scholar

Shao, C.C., Liu, T., Lai, Y., Tseng, Y. and Tsai, S. (2018). DRCD: a Chinese Machine Reading Comprehension Dataset. Proceedings of the Workshop on Machine Reading for Question Answering, pp. pages 37–46. Association for Computational Linguistics.Google Scholar

Sharma, P. and Roychowdhury, S. (2019). IIT-KGP at COIN 2019: Using pre-trained Language Models for modeling Machine Comprehension. Proceedings of the First Workshop on Commonsense Inference in Natural Language Processing, pp. 80-84.Google Scholar

Shen, Y., Huang, P.-S., Gao, J. and Chen, W. (2017). Reasonet: Learning to Stop Reading in Machine Comprehension. Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2017), pp. 1047-55. ACM.CrossRef Google Scholar

Sheng, Y., Lan, M. and Wu, Y. (2018). ECNU at SemEval-2018 Task 11: Using Deep Learning Method to Address Machine Comprehension Task. Proceedings of The 12th International Workshop on Semantic Evaluation, pp. 1048-52. Association for Computational Linguistics.CrossRef Google Scholar

Song, J., Tang, S., Qian, T., Zhu, W. and Wu, F. (2018). Reading Document and Answering Question via Global Attentional Inference. Pacific Rim Conference on Multimedia (PCM 2018), pp. 335-45. Springer.CrossRef Google Scholar

Song, L., Wang, Z., Yu, M., Zhang, Y., Florian, R. and Gildea, D. (2020). Evidence Integration for Multi-hop Reading Comprehension with Graph Neural Networks. IEEE Transactions on knowledge data engineering.Google Scholar

Soni, S. and Roberts, K. (2020). Evaluation of Dataset Selection for Pre-Training and Fine-Tuning Transformer Language Models for Clinical Question Answering. Proceedings of The 12th Language Resources and Evaluation Conference, pp. 5532-38.Google Scholar

Srivastava, R.K., Greff, K. and Schmidhuber, J. (2015). Training very deep networks. Advances in neural information processing systems, pp. 2377-85.Google Scholar

Su, D., Xu, Y., Winata, G.I., Xu, P., Kim, H., Liu, Z. and Fung, P. (2019). Generalizing Question Answering System with Pre-trained Language Model Fine-tuning. Proceedings of the 2nd Workshop on Machine Reading for Question Answering, pp. 203-11.CrossRef Google Scholar

Sugawara, S., Inui, K., Sekine, S. and Aizawa, A. (2018). What Makes Reading Comprehension Questions Easier? Proceedings of 2018 conference on empirical methods in natural language processing, pp. 4208-40219.Google Scholar

Sugawara, S., Kido, Y., Yokono, H. and Aizawa, A. (2017). Evaluation Metrics for Machine Reading Comprehension: Prerequisite Skills and Readability. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 806-17.CrossRef Google Scholar

Sugawara, S., Stenetorp, P., Inui, K. and Aizawa, A. (2020). Assessing the Benchmarking Capacity of Machine Reading Comprehension Datasets. AAAI, pp. 8918-27.CrossRef Google Scholar

Sun, K., Yu, D., Yu, D. and Cardie, C. (2019). Improving Machine Reading Comprehension with General Reading Strategies. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 2633-43.CrossRef Google Scholar

Sun, K., Yu, D., Yu, D. and Cardie, C. (2020). Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension. Transactions of the Association for Computational Linguistics 8, 141–155.CrossRef Google Scholar

Šuster, S. and Daelemans, W. (2018). CliCR: A Dataset of Clinical Case Reports for Machine Reading Comprehension. Proceedings of NAACL-HLT 2018 pp. 1551–63.Google Scholar

Swayamdipta, S., Parikh, A.P. and Kwiatkowski, T. (2018). Multi-mention learning for reading comprehension with neural cascades. Proceeding of the International Conference on Learning Representations (ICLR).Google Scholar

Takahashi, T., Taniguchi, M., Taniguchi, T. and Ohkuma, T. (2019). CLER: Cross-task learning with expert representation to generalize reading and understanding. Proceedings of the 2nd Workshop on Machine Reading for Question Answering, pp. 183-90.Google Scholar

Talmor, A. and Berant, J. (2019). MultiQA: An Empirical Investigation of Generalization and Transfer in Reading Comprehension. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 4911-21. Florence, Italy.Google Scholar

Tan, C., Wei, F., Yang, N., Du, B., Lv, W. and Zhou, M. (2018a). S-Net: From Answer Extraction to Answer Synthesis for Machine Reading Comprehension. Association for the Advancement of Artificial Intelligence (AAAI).Google Scholar

Tan, C., Wei, F., Zhou, Q., Yang, N., Lv, W. and Zhou, M. (2018b). I Know There Is No Answer: Modeling Answer Validation for Machine Reading Comprehension. CCF International Conference on Natural Language Processing and Chinese Computing, pp. 85-97. Springer.CrossRef Google Scholar

Tang, H., Hong, Y., Chen, X., Wu, K. and Zhang, M. (2019a). How to Answer Comparison Questions. 2019 International Conference on Asian Language Processing (IALP), pp. 331-36. IEEE.Google Scholar

Tang, M., Cai, J. and Zhuo, H.H. (2019b). Multi-matching network for multiple choice reading comprehension. Proceedings of the AAAI Conference on Artificial Intelligence, pp. 7088-95.Google Scholar

Tay, Y., Luu, A.T. and Hui, S.C. (2018). Multi-Granular Sequence Encoding via Dilated Compositional Units for Reading Comprehension. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 2141-51.CrossRef Google Scholar

Tay, Y., Wang, S., Luu, A.T., Fu, J., Phan, M.C., Yuan, X., Rao, J., Hui, S.C. and Zhang, A. (2019). Simple and Effective Curriculum Pointer-Generator Networks for Reading Comprehension over Long Narratives. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 4922-31. Florence, Italy.CrossRef Google Scholar

Trischler, A., Wang, T., Yuan, X., Harris, J., Sordoni, A., Bachman, P. and Suleman, K. (2017). Newsqa: A machine comprehension dataset. Proceedings of the 2nd Workshop on Representation Learning for NLP.CrossRef Google Scholar

Tu, M., Huang, K., Wang, G., Huang, J., He, X. and Zhou, B. (2020). Select, Answer and Explain: Interpretable Multi-Hop Reading Comprehension over Multiple Documents. AAAI, pp. 9073-80.CrossRef Google Scholar

Tu, M., Wang, G., Huang, J., Tang, Y., He, X. and Zhou, B. (2019). Multi-hop Reading Comprehension across Multiple Documents by Reasoning over Heterogeneous Graphs. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 2704-13. Florence, Italy.CrossRef Google Scholar

Turpin, A. and Scholer, F. (2006). User performance versus precision measures for simple search tasks. Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 11-18. ACM.CrossRef Google Scholar

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Ł. and Polosukhin, I. (2017). Attention is all you need. Advances in neural information processing systems, pp. 5998-6008.Google Scholar

Vedantam, R., Lawrence Zitnick, C. and Parikh, D. (2015). Cider: Consensus-based image description evaluation. Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4566-75.CrossRef Google Scholar

Wang, B., Guo, S., Liu, K., He, S. and Zhao, J. (2016). Employing External Rich Knowledge for Machine Comprehension. International Joint Conferences on Artificial Intelligence (IJCAI), pp. 2929-25.Google Scholar

Wang, B., Yao, T., Zhang, Q., Xu, J. and Wang, X. (2020b). ReCO: A Large Scale Chinese Reading Comprehension Dataset on Opinion. AAAI, pp. 9146-53.Google Scholar

Wang, B., Yao, T., Zhang, Q., Xu, J., Liu, K., Tian, Z. and Zhao, J. (2019a). Unsupervised Story Comprehension with Hierarchical Encoder-Decoder. Proceedings of the 2019 ACM SIGIR International Conference on Theory of Information Retrieval, pp. 93-100.CrossRef Google Scholar

Wang, B., Zhang, X., Zhou, X. and Li, J. (2020a). A Gated Dilated Convolution with Attention Model for Clinical Cloze-Style Reading Comprehension. International Journal of Environmental Research Public Health 17, 1323.CrossRef Google Scholar PubMed

Wang, C. and Jiang, H. (2019). Explicit Utilization of General Knowledge in Machine Reading Comprehension. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 2263–72. Florence, Italy.Google Scholar

Wang, H., Gan, Z., Liu, X., Liu, J., Gao, J. and Wang, H. (2019e). Adversarial Domain Adaptation for Machine Reading Comprehension. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 2510-20. Hong Kong, China.CrossRef Google Scholar

Wang, H., Lu, W. and Tang, Z. (2019d). Incorporating External Knowledge to Boost Machine Comprehension Based Question Answering. European Conference on Information Retrieval, pp. 819-27. Springer.Google Scholar

Wang, H., Yu, D., Sun, K., Chen, J., Yu, D., McAllester, D. and Roth, D. (2019b). Evidence Sentence Extraction for Machine Reading Comprehension. Proceedings of the 23rd Conference on Computational Natural Language Learning, pp. 696–707. Hong Kong, China.Google Scholar

Wang, H., Yu, M., Guo, X., Das, R., Xiong, W. and Gao, T. (2019c). Do Multi-hop Readers Dream of Reasoning Chains? Proceedings of the 2nd Workshop on Machine Reading for Question Answering, pp. 91-97. Hong Kong, China.CrossRef Google Scholar

Wang, S. and Jiang, J. (2017). Machine comprehension using match-lstm and answer pointer. Proceedings of the International Conference on Learning Representations (ICLR), pp. 1-15. Toulon, France.Google Scholar

Wang, S., Yu, M., Chang, S. and Jiang, J. (2018a). A Co-Matching Model for Multi-choice Reading Comprehension. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Short Papers): 746–51.CrossRef Google Scholar

Wang, S., Yu, M., Guo, X., Wang, Z., Klinger, T., Zhang, W., Chang, S., Tesauro, G., Zhou, B. and Jiang, J. (2018b). R3: Reinforced ranker-reader for open-domain question answering. Association for the Advancement of Artificial Intelligence (AAAI 2018).Google Scholar

Wang, S., Yu, M., Jiang, J., Zhang, W., Guo, X., Chang, S., Wang, Z., Klinger, T., Tesauro, G. and Campbell, M. (2018c). Evidence Aggregation for Answer Re-Ranking in Open-Domain Question Answering. Proceedings of the International Conference on Learning Representations (ICLR).Google Scholar

Wang, T., Yuan, X. and Trischler, A. (2017a). A joint model for question answering and question generation. Learning to generate natural language workshop, ICML 2017.Google Scholar

Wang, W., Yan, M. and Wu, C. (2018d). Multi-granularity hierarchical attention fusion networks for reading comprehension and question answering. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1705-14.CrossRef Google Scholar

Wang, W., Yang, N., Wei, F., Chang, B. and Zhou, M. (2017b). Gated self-matching networks for reading comprehension and question answering. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 189-98.CrossRef Google Scholar

Wang, Y. and Bansal, M. (2018). Robust Machine Comprehension Models via Adversarial Training. Proceedings of Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2018), pp. 575–81. Association for Computational Linguistics.Google Scholar

Wang, Y., Liu, K., Liu, J., He, W., Lyu, Y., Wu, H., Li, S. and Wang, H. (2018e). Multi-Passage Machine Reading Comprehension with Cross-Passage Answer Verification. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1918-27.CrossRef Google Scholar

Wang, Z., Liu, J., Xiao, X., Lyu, Y. and Wu, T. (2018f). Joint Training of Candidate Extraction and Answer Selection for Reading Comprehension. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Long Papers), pp. 1715–24.Google Scholar

Watarai, T. and Tsuchiya, M. (2020). Developing Dataset of Japanese Slot Filling Quizzes Designed for Evaluation of Machine Reading Comprehension. Proceedings of The 12th Language Resources and Evaluation Conference, pp. 6895-901.Google Scholar

Weissenborn, D., Wiese, G. and Seiffe, L. (2017). Making Neural QA as Simple as Possible but not Simpler. Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL), pp. 271–80. Vancouver, Canada.CrossRef Google Scholar

Welbl, J., Liu, N.F. and Gardner, M. (2017). Crowdsourcing multiple choice science questions. Proceedings of the 3rd Workshop on Noisy User-generated Text, pp. 94–106. Association for Computational Linguistics.Google Scholar

Welbl, J., Minervini, P., Bartolo, M., Stenetorp, P. and Riedel, S. (2020). Undersensitivity in neural reading comprehension. International Conference on Learning Representations (ICLR2020).CrossRef Google Scholar

Welbl, J., Stenetorp, P. and Riedel, S. (2018). Constructing datasets for multi-hop reading comprehension across documents. Transactions of the Association for Computational Linguistics 6, 287–302.CrossRef Google Scholar

Wu, B., Huang, H., Wang, Z., Feng, Q., Yu, J. and Wang, B. (2019). Improving the Robustness of Deep Reading Comprehension Models by Leveraging Syntax Prior. Proceedings of the 2nd Workshop on Machine Reading for Question Answering, pp. 53-57.CrossRef Google Scholar

Wu, Z. and Xu, H. (2020). Improving the Robustness of Machine Reading Comprehension Model with Hierarchical Knowledge and Auxiliary Unanswerability Prediction. Knowledge-Based Systems: 106075.Google Scholar

Xia, J., Wu, C. and Yan, M. (2019). Incorporating Relation Knowledge into Commonsense Reading Comprehension with Multi-task Learning. Proceedings of the 28th ACM International Conference on Information and Knowledge Management, pp. 2393–96. Beijing, China.CrossRef Google Scholar

Xie, P. and Xing, E. (2017). A constituent-centric neural architecture for reading comprehension. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1405-14.CrossRef Google Scholar

Xie, Q., Lai, G., Dai, Z. and Hovy, E. (2018). Large-scale Cloze Test Dataset Created by Teachers. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 2344-56.CrossRef Google Scholar

Xiong, C., Zhong, V. and Socher, R. (2017). Dynamic coattention networks for question answering. Proceedings of the 5th International Conference on Learning Representations (ICLR).Google Scholar

Xiong, C., Zhong, V. and Socher, R. (2018). Dcn+: Mixed objective and deep residual coattention for question answering. Proceedings of the International Conference on Learning Representations (ICLR).Google Scholar

Xiong, W., Yu, M., Guo, X., Wang, H., Chang, S., Campbell, M. and Wang, W.Y. (2019). Simple yet Effective Bridge Reasoning for Open-Domain Multi-Hop Question Answering. Proceedings of the 2nd Workshop on Machine Reading for Question Answering, pp. 48-52. Hong Kong, China.CrossRef Google Scholar

Xu, Y., Liu, W., Chen, G., Ren, B., Zhang, S., Gao, S. and Guo, J. (2019a). Enhancing Machine Reading Comprehension With Position Information. IEEE Access 7, 141602–141611.CrossRef Google Scholar

Xu, Y., Liu, X., Shen, Y., Liu, J. and Gao, J. (2019b). Multi-task Learning with Sample Re-weighting for Machine Reading Comprehension. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 2644-55. Minneapolis, Minnesota.CrossRef Google Scholar

Yadav, M., Vig, L. and Shroff, G. (2017). Learning and Knowledge Transfer with Memory Networks for Machine Comprehension. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers, pp. 850-59.CrossRef Google Scholar

Yan, M., Xia, J., Wu, C., Bi, B., Zhao, Z., Zhang, J., Si, L., Wang, R., Wang, W. and Chen, H. (2019). A deep cascade model for multi-document reading comprehension. Proceedings of the AAAI Conference on Artificial Intelligence, pp. 7354-61.CrossRef Google Scholar

Yan, M., Zhang, H., Jin, D. and Zhou, J.T. (2020). Multi-source Meta Transfer for Low Resource Multiple-Choice Question Answering. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 7331-41.CrossRef Google Scholar

Yang, A., Wang, Q., Liu, J., Liu, K., Lyu, Y., Wu, H., She, Q. and Li, S. (2019a). Enhancing Pre-Trained Language Representations with Rich Knowledge for Machine Reading Comprehension. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 2346-57.CrossRef Google Scholar

Yang, Y., Kang, S. and Seo, J. (2020). Improved Machine Reading Comprehension Using Data Validation for Weakly Labeled Data. IEEE Access 8, 5667–5677.CrossRef Google Scholar

Yang, Z., Dai, Z., Yang, Y., Carbonell, J., Salakhutdinov, R. and Le, Q.V. (2019b). XLNet: Generalized Autoregressive Pretraining for Language Understanding. Advances in neural information processing systems, pp. 5753-63.Google Scholar

Yang, Z., Dhingra, B., Yuan, Y., Hu, J., Cohen, W.W. and Salakhutdinov, R. (2017a). Words or characters? fine-grained gating for reading comprehension. Proceedings of the 5th International Conference on Learning Representations, (ICLR), Toulon, France.Google Scholar

Yang, Z., Hu, J., Salakhutdinov, R. and Cohen, W.W. (2017b). Semi-supervised QA with generative domain-adaptive nets. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1040-50.CrossRef Google Scholar

Yang, Z., Qi, P., Zhang, S., Bengio, Y., Cohen, W., Salakhutdinov, R. and Manning, C.D. (2018). HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 2369-80.CrossRef Google Scholar

Yao, J., Feng, M., Feng, H., Wang, Z., Zhang, Y. and Xue, N. (2019). Smart: A stratified machine reading test. CCF International Conference on Natural Language Processing and Chinese Computing, pp. 67-79. Springer.CrossRef Google Scholar

Yin, W., Ebert, S. and Schütze, H. (2016). Attention-based convolutional neural network for machine comprehension. Proceedings of 2016 NAACL Human-Computer Question Answering Workshop, pp. 15–21.CrossRef Google Scholar

Yu, A.W., Dohan, D., Luong, M.-T., Zhao, R., Chen, K., Norouzi, M. and Le, Q.V. (2018). QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension. Proceedings of the International Conference on Learning Representations (ICLR).Google Scholar

Yu, J., Zha, Z. and Yin, J. (2019). Inferential machine comprehension: Answering questions by recursively deducing the evidence chain from text. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 2241-51.CrossRef Google Scholar

Yu, W., Jiang, Z., Dong, Y. and Feng, J. (2020). ReClor: A Reading Comprehension Dataset Requiring Logical Reasoning. International Conference on Learning Representations (ICLR2020).Google Scholar

Yuan, F., Shou, L., Bai, X., Gong, M., Liang, Y., Duan, N., Fu, Y. and Jiang, D. (2020a). Enhancing Answer Boundary Detection for Multilingual Machine Reading Comprehension. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: 925–34.CrossRef Google Scholar

Yuan, F., Xu, Y., Lin, Z., Wang, W. and Shi, G. (2019). Multi-perspective Denoising Reader for Multi-paragraph Reading Comprehension. International Conference on Neural Information Processing, pp. 222-34. Springer.CrossRef Google Scholar

Yuan, X., Fu, J., Cote, M.-A., Tay, Y., Pal, C. and Trischler, A. (2020b). Interactive machine comprehension with information seeking agents. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: 2325–38.CrossRef Google Scholar

Yue, X., Gutierrez, B.J. and Sun, H. (2020). Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.CrossRef Google Scholar

Zhang, C., Luo, C., Lu, J., Liu, A., Bai, B., Bai, K. and Xu, Z. (2020a). Read, Attend, and Exclude: Multi-Choice Reading Comprehension by Mimicking Human Reasoning Process. Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1945-48.Google Scholar

Zhang, J., Zhu, X., Chen, Q., Ling, Z., Dai, L., Wei, S. and Jiang, H. (2017). Exploring question representation and adaptation with neural networks. Computer and Communications (ICCC), 2017 3rd IEEE International Conference on, pp. 1975-84. IEEE.CrossRef Google Scholar

Zhang, S., Zhao, H., Wu, Y., Zhang, Z., Zhou, X. and Zhou, X. (2020b). DCMN+: Dual co-matching network for multi-choice reading comprehension. AAAI CrossRef Google Scholar

Zhang, X. and Wang, Z. (2020). Reception: Wide and Deep Interaction Networks for Machine Reading Comprehension (Student Abstract). AAAI, pp. 13987-88.CrossRef Google Scholar

Zhang, X., Wu, J., He, Z., Liu, X. and Su, Y. (2018). Medical Exam Question Answering with Large-scale Reading Comprehension. The Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-18).CrossRef Google Scholar

Zhang, X., Yang, A., Li, S. and Wang, Y. (2019a). Machine Reading Comprehension: a Literature Review. arXiv preprint arXiv:1907.01686.Google Scholar

Zhang, Z., Wu, Y., Zhou, J., Duan, S., Zhao, H. and Wang, R. (2020c). SG-Net: Syntax-Guided Machine Reading Comprehension. AAAI, pp. 9636-43.Google Scholar

Zhang, Z., Zhao, H., Ling, K., Li, J., Li, Z., He, S. and Fu, G. (2019b). Effective subword segmentation for text comprehension. IEEE/ACM Transactions on Audio, Speech, and Language Processing 27, 1664–1674.CrossRef Google Scholar

Zheng, B., Wen, H., Liang, Y., Duan, N., Che, W., Jiang, D., Zhou, M. and Liu, T. (2020). Document Modeling with Graph Attention Networks for Multi-grained Machine Reading Comprehension. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 6708–18.CrossRef Google Scholar

Zhou, M., Huang, M. and Zhu, X. (2020a). Robust reading comprehension with linguistic constraints via posterior regularization. IEEE/ACM Transactions on Audio, Speech, and Language Processing 28.CrossRef Google Scholar

Zhou, X., Luo, S. and Wu, Y. (2020b). Co-Attention Hierarchical Network: Generating Coherent Long Distractors for Reading Comprehension. Proceedings of the AAAI Conference on Artificial Intelligence, pp. 9725-32.CrossRef Google Scholar

Zhu, H., Dong, L., Wei, F., Wang, W., Qin, B. and Liu, T. (2019). Learning to Ask Unanswerable Questions for Machine Reading Comprehension. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 4238-48. Florence, Italy.CrossRef Google Scholar

Zhuang, Y. and Wang, H. (2019). Token-level dynamic self-attention network for multi-passage reading comprehension. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 2252-62.CrossRef Google Scholar

Article contents

A Survey on Machine Reading Comprehension Systems

Abstract

Keywords

Access options

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests