Istituto di Scienza e Tecnologie dell'Informazione     
Orlando S., Lucchese C., Perego R. Distributed mining of frequent closed itemsets: some preliminary results. In: International Workshop on High Performance and Distributed Mining (HPDM '05) (Newport Beach, California, USA., April 23, 2005).
In this paper we address the problem of mining frequent closed itemsets in a distributed setting. We gure out an environment where a transactional dataset is horizontally partitioned and stored in di erent sites. We assume that due to the huge size of datasets and privacy concerns dataset partitions cannot be moved to a centralized site where to materialize the whole dataset and perform the mining task. Thus it becomes mandatory to perform separate mining on each site, and then merge the local results do derive a global knowledge. This paper shows how frequent closed itemsets, mined independently in each site, can be merged in order to derive globally frequent closed itemsets. Unfortunately, such merging might produce a superset of all the frequent closed itemsets, while the associated supports could be smaller than the exact ones because some globally frequent closed itemsets might be not locally frequent in some partition. A post-processing
Subject Frequent Closed Itemsets Mining
H.2.8 Database Applications

Icona documento 1) Download Document PDF

Icona documento Open access Icona documento Restricted Icona documento Private


Per ulteriori informazioni, contattare: Librarian http://puma.isti.cnr.it

Valid HTML 4.0 Transitional