Abstract
The role of data mining is to search "the space of candidate hypotheses" to offer solutions, whereas the role of statistics is to validate the hypotheses offered by the data-mining process. In this paper we propose Association Rules Networks (ARNs) as a structure for synthesizing, pruning, and analyzing a collection of association rules to construct candidate hypotheses. From a knowledge discovery perspective, ARNs allow for a goal-centric, context-driven analysis of the output of association rules algorithms. From a mathematical perspective, ARNs are instances of backward-directed hypergraphs. Using two extensive case studies, we show how ARNs and statistical theory can be combined to generate and test hypotheses.
Original language | English |
---|---|
Pages (from-to) | 260-279 |
Number of pages | 20 |
Journal | Statistical Analysis and Data Mining |
Volume | 1 |
Issue number | 4 |
DOIs | |
Publication status | Published - Mar 2009 |
Externally published | Yes |
Keywords
- Association rules
- Association rules network
- Data mining