Hostname: page-component-78c5997874-g7gxr Total loading time: 0 Render date: 2024-11-20T04:57:39.118Z Has data issue: false hasContentIssue false

A knowledge-based approach for selecting information sources*

Published online by Cambridge University Press:  01 May 2007

THOMAS EITER
Affiliation:
Institut für Informationssysteme, Technische Universität Wien, Favoritenstraβe 9-11, A-1040 Vienna, Austria e-mail: [email protected], [email protected], [email protected]
MICHAEL FINK
Affiliation:
Institut für Informationssysteme, Technische Universität Wien, Favoritenstraβe 9-11, A-1040 Vienna, Austria e-mail: [email protected], [email protected], [email protected]
HANS TOMPITS
Affiliation:
Institut für Informationssysteme, Technische Universität Wien, Favoritenstraβe 9-11, A-1040 Vienna, Austria e-mail: [email protected], [email protected], [email protected]

Abstract

Through the Internet and the World-Wide Web, a vast number of information sources has become available, which offer information on various subjects by different providers, often in heterogeneous formats. This calls for tools and methods for building an advanced information-processing infrastructure. One issue in this area is the selection of suitable information sources in query answering. In this paper, we present a knowledge-based approach to this problem, in the setting where one among a set of information sources (prototypically, data repositories) should be selected for evaluating a user query. We use extended logic programs (ELPs) to represent rich descriptions of the information sources, an underlying domain theory, and user queries in a formal query language (here, XML-QL, but other languages can be handled as well). Moreover, we use ELPs for declarative query analysis and generation of a query description. Central to our approach are declarative source-selection programs, for which we define syntax and semantics. Due to the structured nature of the considered data items, the semantics of such programs must carefully respect implicit context information in source-selection rules, and furthermore combine it with possible user preferences. A prototype implementation of our approach has been realized exploiting the DLV KR system and its PLP front-end for prioritized ELPs. We describe a representative example involving specific movie databases, and report about experimental results.

Type
Regular Papers
Copyright
Copyright © Cambridge University Press 2007

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Abiteboul, S., Buneman, P. and Suciu, D. 2000. Data on the Web: From Relations to Semistructured Data and XML. Morgan Kaufmann, Los Altos.Google Scholar
Alferes, J., Pereira, L., Przymusinska, H. and Przymusinski, T. 2002. LUPS – A Language for Updating Logic Programs. Artificial Intelligence 138, 1–2, 87116.CrossRefGoogle Scholar
Apt, K., Blair, H. and Walker, A. 1988. Towards a Theory of Declarative Knowledge. See Minker (1988), 89–148.Google Scholar
Arens, Y., Chee, C., Hsu, C. and Knoblock, C. 1993. Retrieving and Integrating Data from Multiple Information Sources. International Journal of Cooperative Information Systems 2, 2, 127158.CrossRefGoogle Scholar
Arens, Y. and Knoblock, C. 1992. Planning and Reformulating Queries for Semantically-Modeled Multidatabase Systems. Proceedings of the First International Conference on Information and Knowledge Managements. 92–101.Google Scholar
Arens, Y., Knoblock, C. and Shen, W. 1996. Query Reformulation for Dynamic Information Integration. Journal of Intelligent Information Systems 6, 2–3, 99130.CrossRefGoogle Scholar
Baral, C. 2003. Knowledge Representation, Reasoning and Declarative Problem Solving with Answer Sets. Cambridge University Press.CrossRefGoogle Scholar
Bayardo, R., Bohrer, B., Brice, R., Cichocki, A., Fowler, J., Helal, A., Kashyap, V., Ksiezyk, T., Martin, G., Nodine, M., Rashid, M., Rusinkiewicz, M., Shea, R., Unnikrishnan, C., Unruh, A. and Woelk, D. 1997. InfoSleuth: Semantic Integration of Information in Open and Dynamic Environments (Experience Paper). Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD '97). 195–206.CrossRefGoogle Scholar
Borgida, A., Brachman, R. J., McGuinness, D. L. and Resnick, L. A. 1989. CLASSIC: A Structural Data Model for Objects. Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD '89), J. Clifford, B. G. Lindsay, and D. Maier, Eds. ACM Press, 58–67.Google Scholar
Brewka, G. and Eiter, T. 1999. Preferred Answer Sets for Extended Logic Programs. Artificial Intelligence 109, 1–2, 297356.CrossRefGoogle Scholar
Buccafurri, F., Leone, N. and Rullo, P. 1996. Stable Models and their Computation for Logic Programming with Inheritance and True Negation. Journal of Logic Programming 27, 1, 543.CrossRefGoogle Scholar
Buccafurri, F., Leone, N. and Rullo, P. 2000. Enhancing Disjunctive Datalog by Constraints. IEEE Transactions on Knowledge and Data Engineering 12, 5, 845860.CrossRefGoogle Scholar
Burke, R., Hammond, K. and Kozlovsky, J. 1995. Knowledge-Based Information Retrieval from Semi-Structured Text. Working Notes of the AAAI '95 Fall Symposium, Series on AI Applications in Knowledge Navigation and Retrieval, Cambridge, MA. 19–24.Google Scholar
Chen, Y.-J. and Soo, V.-W. 2001. Ontology-Based Information Gathering Agents. Proceedings of the First Asia-Pacific Conference on Web Intelligence (WI 2001), N. Zhong et al., Ed. LNCS, subseries LNAI, vol. 2198. Springer, 423–427.Google Scholar
Collet, C., Huhns, M. and Shen, W.-M. 1991. Resource Integration using a Large Knowledge Base in Carnot. IEEE Computer 24, 12, 5562.CrossRefGoogle Scholar
Decker, K., Sycara, K. and Williamson, M. 1997. Middle-Agents for the Internet. Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence (IJCAI '97). Vol. 1. Morgan Kaufmann, 578–583.Google Scholar
Delgrande, J. and Schaub, T. 1994. A General Approach to Specificity in Default Reasoning. Proceedings of the Fourth International Conference on Principles of Knowledge Representation and Reasoning (KR '94). 146–157.CrossRefGoogle Scholar
Delgrande, J., Schaub, T. and Tompits, H. 2001. plp: A Generic Compiler for Ordered Logic Programs. Proceedings of the Sixth International Conference on Logic Programming and Nonmonotonic Reasoning (LPNMR 2001), T. Eiter, W. Faber, and M. Truszczyński, Eds. LNCS, subseries LNAI, vol. 2173. Springer, 411–415.Google Scholar
Delgrande, J. P., Schaub, T. and Tompits, H. 2003. A Framework for Compiling Preferences in Logic Programs. Theory and Practice of Logic Programming 3, 2, 129187.CrossRefGoogle Scholar
Deutsch, A., Fernandez, M., Florescu, D., Levy, A. and Suciu, D. 1999. A Query Language for XML. Computer Networks 31, 11–16, 11551169.CrossRefGoogle Scholar
Dimopoulos, Y. and Kakas, A. 2001. Information Integration and Computational Logic. Computational Logic, Special Issue on the Future Technological Roadmap of Compulog-Net, 105–135.Google Scholar
Eiter, T., Fink, M., Sabbatini, G. and Tompits, H. 2002a. On Properties of Update Sequences Based on Causal Rejection. Theory and Practice of Logic Programming 2, 6, 721777.CrossRefGoogle Scholar
Eiter, T., Fink, M., Sabbatini, G. and Tompits, H. 2002b. Using Methods of Declarative Logic Programming for Intelligent Information Agents. Theory and Practice of Logic Programming 2, 6, 645719.CrossRefGoogle Scholar
Eiter, T., Fink, M. and Tompits, H. 2003. A Knowledge-Based Approach for Selecting Information Sources. Tech. Rep. INFSYS RR-1843-03-14, 2003, Institut für Informations-systeme, Technische Universität Wien.Google Scholar
Eiter, T., Gottlob, G. and Mannila, H. 1997. Disjunctive Datalog. ACM Transactions on Database Systems 22, 3, 364418.CrossRefGoogle Scholar
Eiter, T., Ianni, G., Schindlauer, R. and Tompits, H. 2005a. A Uniform Integration of Higher-Order Reasoning and External Evaluations in Answer-Set Programming. Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence (IJCAI 2005). Morgan Kaufmann.Google Scholar
Eiter, T., Ianni, G., Schindlauer, R. and Tompits, H. 2005b. Nonmonotonic Description Logic Programs: Implementation and Experiments. Proceedings of the Twelfth International Conference on Logic for Programming, Artificial Intelligence and Reasoning (LPAR 2004), F. Baader and A. Voronkov, Eds. LNCS, vol. 3452. Springer, 511–517.Google Scholar
Eiter, T., Lukasiewicz, T., Schindlauer, R. and Tompits, H. 2004. Combining Answer-Set Programming with Description Logics for the Semantic Web. Proceedings of the Ninth International Conference on Principles of Knowledge Representation and Reasoning (KR 2004), D. Dubois, C. Welty, and M.-A. Williams, Eds. Morgan Kaufmann, 141–151.Google Scholar
Faber, W., Leone, N. and Pfeifer, G. 2004. Recursive Aggregates in Disjunctive Logic Programs: Semantics and Complexity. Proceedings of the Ninth European Conference on Logics in Artificial Intelligence (JELIA 2004), J. J. Alferes and J. A. Leite, Eds. LNCS, subseries LNAI, vol. 3229. Springer, 200–212.Google Scholar
Fellbaum, C. 1998. WordNet: An Electronic Lexical Database. MIT Press.CrossRefGoogle Scholar
Fink, M. 2002. Declarative Logic-Programming Components for Information Agents. Ph.D. thesis, Institut für Informationssysteme, Technische Universität Wien, Austria.Google Scholar
Fowler, J., Perry, B., Nodine, M. H. and Bargmeyer, B. 1999. Agent-Based Semantic Interoperability in InfoSleuth. SIGMOD Record 28, 1, 6067.CrossRefGoogle Scholar
Fuhr, N. 1999. A Decision-Theoretic Approach to Database Selection in Networked IR. ACM Transactions on Information Systems 17, 3, 229249.CrossRefGoogle Scholar
Garcia-Molina, H., Papakonstantinou, Y., Quass, D., Rajaraman, A., Sagiv, Y., Ullman, J., Vassalos, V. and Widom, J. 1997. The TSIMMIS Approach to Mediation: Data Models and Languages. Journal of Intelligent Information Systems 8, 2, 117132.CrossRefGoogle Scholar
Geerts, P. and Vermeir, D. 1993. A Nonmonotonic Reasoning Formalism using Implicit Specificity Information. Proceedings of the Second International Workshop on Logic Programming and Nonmonotonic Reasoning (LPNMR '93), L.-M. Pereira and A. Nerode, Eds. LNCS, subseries LNAI. Springer, 380–396.Google Scholar
Geerts, P. and Vermeir, D. 1995. Specificity by Default. Proceedings of the European Conference on Symbolic and Quantitative Approaches to Reasoning and Uncertainty (ECSQARU '95). LNCS, subseries LNAI, vol. 946. Springer, 207–216.Google Scholar
Gelfond, M. and Lifschitz, V. 1991. Classical Negation in Logic Programs and Disjunctive Databases. New Generation Computing 9, 3–4, 365386.CrossRefGoogle Scholar
Genesereth, M., Keller, A. and Duschka, O. 1997. Infomaster: An Information Integration System. Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD '97), J. Peckham, Ed. ACM Press, 539–542.Google Scholar
Goto, S., Ozono, T. and Shintani, T. 2001. A Method for Information Source Selection using Thesaurus for Distributed Information Retrieval. Proceedings of the Pacific Asian Conference on Intelligent Systems 2001 (PAIS 2001). 272–277.Google Scholar
Grosof, B. N., Horrocks, I., Volz, R. and Decker, S. 2003. Description Logic Programs: Combining Logic Programs with Description Logics. Proceedings of the Twelfth International World Wide Web Conference (WWW 2003). ACM Press, 48–57.Google Scholar
Huffman, S. B. and Steier, D. 1995. A Navigation Assistant for Data Source Selection and Integration. Working Notes of the AAAI '95 Fall Symposium Series on AI Applications in Knowledge Navigation and Retrieval, Cambridge, MA. AAAI Press, 72–77.Google Scholar
Huhns, M. and Singh, M. 1992. The Semantic Integration of Information Models. Proceedings of the AAAI Workshop on Cooperation among Heterogeneous Intelligent Agents.Google Scholar
Inoue, K. and Sakama, C. 2000. Prioritized Logic Programming and Its Applications to Commonsense Reasoning. Artificial Intelligence 123, 1–2, 185222.Google Scholar
Kirk, T., Levy, A., Sagiv, Y., and Srivastava, D. 1995. The Information Manifold. Proceedings of the AAAI 2001 Spring Symposium on Information Gathering in Distributed Heterogeneous Environments. AAAI Press, 85–91.Google Scholar
Kowalski, R. A. and Sadri, F. 1990. Logic Programs with Exceptions. Proceedings of the Seventh International Conference on Logic Programming (ICLP '90). MIT Press, 598–616.Google Scholar
Krentel, M. 1988. The Complexity of Optimization Problems. Journal of Computer and System Sciences 36, 490509.CrossRefGoogle Scholar
Laenens, E. and Vermeir, D. 1990. A Logical Basis for Object-Oriented Programming. Proceedings of the Second European Workshop on Logics in Artificial Intelligence (JELIA '90). LNCS, subseries LNAI. Springer, 317–332.Google Scholar
Lenat, D. B. and Guha, R. V. 1990. Building Large Knowledge-Based Systems: Representation and Inference in the Cyc Project. Addison-Wesley.Google Scholar
Leone, N., Pfeifer, G., Faber, W., Eiter, T., Gottlob, G., Perri, S. and Scarcello, F. 2006. The DLV System for Knowledge Representation and Reasoning. ACM Transactions on Computational Logic. To appear.CrossRefGoogle Scholar
Levy, A., Rajaraman, A. and Ordille, J. 1996. Querying Heterogeneous Information Sources using Source Descriptions. Proceedings of the Twentysecond International Conference on Very Large Data Bases (VLDB '96), T. Vijayaraman, A. Buchmann, C. Mohan, and N. Sarda, Eds. Morgan Kaufmann, 251–262.Google Scholar
Levy, A., Srivastava, D. and Kirk, T. 1995. Data Model and Query Evaluation in Global Information Systems. Journal of Intelligent Information Systems 5, 2, 121143.CrossRefGoogle Scholar
Levy, A. and Weld, D. 2000. Intelligent Internet Systems. Artificial Intelligence 118, 1–2, 114.CrossRefGoogle Scholar
Lifschitz, V. and Turner, H. 1994. Splitting a Logic Program. Proceedings of the Eleventh International Conference on Logic Programming (ICLP '94). MIT Press, 23–38.Google Scholar
Luke, S., Spector, L., Rager, D. and Hendler, J. 1997. Ontology-Based Web Agents. Proceedings of the First International Conference on Autonomous Agents (Agents '97), W. L. Johnson, Ed. 59–66.Google Scholar
MacGregor, R. and Bates, R. 1987. The LOOM Knowledge Representation Language. Tech. Rep. RS-87-188, Information Sciences Institute, University of Southern California. Project Web page http://www.isi.edu/isd/LOOM/.Google Scholar
Minker, J., Ed. 1988. Foundations of Deductive Databases and Logic Programming. Morgan Kaufman, Washington DC.Google Scholar
Motik, B., Volz, R. and Maedche, A. 2003. Optimizing Query Answering in Description Logics using Disjunctive Deductive Databases. Proceedings of the Tenth International Workshop on Knowledge Representation meets Databases (KRDB 2003), F. Bry, C. Lutz, U. Sattler, and M. Schoop, Eds. CEUR Workshop Proceedings, vol. 79. RWTH Aachen University, 39–50. http://sunsite.informatik.rwth-aachen.de/Publications/CEUR-WS/Vol-79/.Google Scholar
Nodine, M., Ngu, A., Cassandra, A. and Bohrer, W. 2003. Scalable Semantic Brokering over Dynamic Heterogeneous Data Sources in InfoSleuth. IEEE Transactions on Knowledge and Data Engineering 15, 5, 10821098.CrossRefGoogle Scholar
Przymusinski, T. C. 1988. On the Declarative Semantics of Deductive Databases and Logic Programs. See Minker (1998), 193–216.Google Scholar
Sadri, F. and Toni, F. 2000. Computational Logic and Multi-Agent Systems: A Roadmap. Computational Logic, Special Issue on the Future Technological Roadmap of Compulog-Net, 1–31.Google Scholar
Schindlauer, R. 2002. Representation of SQL Queries for Declarative Query Analysis. M.S. thesis, Institut für Informationssysteme, Technische Universität Wien, Austria.Google Scholar
Sim, K. M. and Wong, P. T. 2001. Web-Based Information Retrieval using Agent and Ontology. In Proceedings of the First Asia-Pacific Conference on Web Intelligence (WI 2001), N. Zhong et al., Ed. LNCS, subseries LNAI, vol. 2198. Springer, 384–388.Google Scholar
Singh, M., Cannata, P., Huhns, M., Jacobs, N., Ksiezyk, T., Ong, K., Sheth, A., Tomlinson, C. and Woelk, D. 1997. The Carnot Heterogeneous Database Project: Implemented Applications. Distributed and Parallel Databases 5, 2, 207225.CrossRefGoogle Scholar
Subrahmanian, V., Bonatti, P., Dix, J., Eiter, T., Kraus, S., Ozcan, F. and Ross, R. 2000. Heterogeneous Agent Systems: Theory and Implementation. MIT Press.CrossRefGoogle Scholar
Swift, T. 2004. Deduction in Ontologies via ASP. Proceedings of the Seventh International Conference on Logic Programming and Nonmonotonic Reasoning (LPNMR 2004), I. Niemelä and V. Lifschitz, Eds. LNCS, subseries LNAI, vol. 2923. Springer, 275–288.Google Scholar
Van Nieuwenborgh, D., and Vermeir, D. 2002. Preferred Answer Sets of Ordered Logic Programs. Proceedings of the Eighth European Conference on Logics in Artificial Intelligence (JELIA 2002), S. Flesca, S. Greco, G. Ianni, and N. Leone, Eds. LNCS, subseries LNAI, vol. 2424. 432–443.Google Scholar
Wendlandt, E. B. and Driscoll, J. R. 1991. Incorporating a Semantic Analysis into a Document Retrieval Strategy. Proceedings of the Fourteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, A. Bookstein, Y. Chiaramella, G. Salton, and V. V. Raghavan, Eds. ACM Press, 270–279.Google Scholar
Wiederhold, G. 1993. Intelligent Intration of Information. Proceedings of the ACM SIGMOD Conference on Management of Data (SIGMOD '93). 434–437.CrossRefGoogle Scholar