A survey of interestingness measures for knowledge discovery

KEN MCGARRY

doi:10.1017/S0269888905000408

Abstract

It is a well-known fact that the data mining process can generate many hundreds and often thousands of patterns from data. The task for the data miner then becomes one of determining the most useful patterns from those that are trivial or are already well known to the organization. It is therefore necessary to filter out those patterns through the use of some measure of the patterns actual worth. This article presents a review of the available literature on the various measures devised for evaluating and ranking the discovered patterns produced by the data mining process. These so-called interestingness measures are generally divided into two categories: objective measures based on the statistical strengths or properties of the discovered patterns and subjective measures that are derived from the user's beliefs or expectations of their particular problem domain. We evaluate the strengths and weaknesses of the various interestingness measures with respect to the level of user integration within the discovery process.

Crossref Citations

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Geng, Liqiang and Hamilton, Howard J. 2006. Interestingness measures for data mining. ACM Computing Surveys, Vol. 38, Issue. 3, p. 9.

Ayadi, Wassim and Arour, Khedija 2007. A Binary Decision Diagram to discover low threshold support frequent itemsets. p. 509.

Ohsaki, Miho Abe, Hidenao Tsumoto, Shusaku Yokoi, Hideto and Yamaguchi, Takahira 2007. Evaluation of rule interestingness measures in medical knowledge discovery in databases. Artificial Intelligence in Medicine, Vol. 41, Issue. 3, p. 177.

Holeňa, Martin 2007. Symbolic and Quantitative Approaches to Reasoning with Uncertainty. Vol. 4724, Issue. , p. 430.

Dong, Jie and Han, Min 2007. BitTableFI: An efficient mining frequent itemsets algorithm. Knowledge-Based Systems, Vol. 20, Issue. 4, p. 329.

Lallich, Stéphane Vaillant, Benoît and Lenca, Philippe 2007. A Probabilistic Framework Towards the Parameterization of Association Rule Interestingness Measures. Methodology and Computing in Applied Probability, Vol. 9, Issue. 3, p. 447.

Hofmann, Alexander Dedinski, Ivan Sick, Bernhard and de Meer, Hermann 2007. A Novelty-Driven Approach to Intrusion Alert Correlation Based on Distributed Hash Tables. p. 71.

Brisson, Laurent 2007. Ontologies-Based Databases and Information Systems. Vol. 4623, Issue. , p. 119.

Lenca, Philippe Vaillant, Benoît Meyer, Patrick and Lallich, Stephane 2007. Quality Measures in Data Mining. Vol. 43, Issue. , p. 51.

Yang, Jian Zhong, Ning Yao, Yiyu and Wang, Jue 2008. Local peculiarity factor and its application in outlier detection. p. 776.

Kasperkiewicz, Janusz and Marks, Maria 2008. Computational Science – ICCS 2008. Vol. 5103, Issue. , p. 702.

Li, Dong (Haoyuan) Laurent, Anne and Poncelet, Pascal 2008. Advances in Data Mining. Medical Applications, E-Commerce, Marketing, and Theoretical Aspects. Vol. 5077, Issue. , p. 283.

Lenca, Philippe Meyer, Patrick Vaillant, Benoît and Lallich, Stéphane 2008. On selecting interestingness measures for association rules: User oriented description and multiple criteria decision aid. European Journal of Operational Research, Vol. 184, Issue. 2, p. 610.

Kumar, Navin Gangopadhyay, Aryya Bapna, Sanjay Karabatis, George and Chen, Zhiyuan 2008. Measuring interestingness of discovered skewed patterns in data cubes. Decision Support Systems, Vol. 46, Issue. 1, p. 429.

Shaw, Gavin Xu, Yue and Geva, Shlomo 2008. Utilizing Non-redundant Association Rules from Multi-level Datasets. p. 681.

Glass, David H. 2008. Fuzzy confirmation measures. Fuzzy Sets and Systems, Vol. 159, Issue. 4, p. 475.

Sebastian, Yakub Loh, Brian Chung Shiong and Then, Patrick Hang Hui 2009. A Paradigm Shift: Combined Literature and Ontology-Driven Data Mining for Discovering Novel Relations in Biomedical Domain. p. 51.

Choudhary, A. K. Harding, J. A. and Tiwari, M. K. 2009. Data mining in manufacturing: a review based on the kind of knowledge. Journal of Intelligent Manufacturing, Vol. 20, Issue. 5, p. 501.

Vo, Bay and Le, Bac 2009. Mining traditional association rules using frequent itemsets lattice. p. 1401.

LI, DONG (HAOYUAN) LAURENT, ANNE and PONCELET, PASCAL 2009. DISCOVERING FUZZY UNEXPECTED SEQUENCES WITH CONCEPT HIERARCHIES. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, Vol. 17, Issue. supp01, p. 113.

Download full list

Article contents

A survey of interestingness measures for knowledge discovery

Abstract

Access options

Article purchase

Temporarily unavailable

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Article contents

A survey of interestingness measures for knowledge discovery

Abstract

Access options

Article purchase

Temporarily unavailable

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests