PUMA
Istituto di Scienza e Tecnologie dell'Informazione     
Lucchese C., Orlando S., Perego R. DCI Closed: a fast and memory efficient algorithm to mine frequent closed itemsets. In: IEEE ICDM Workshop on Frequent Itemset Mining Implementations (Brighton, UK, 1 November 2004). Proceedings, vol. 126 Bayardo, Roberto J. and Goethals, Bart and Zaki, Mohammed Javeed. CEUR-WS.org, 2004.
 
 
Abstract
(English)
One of the main problems raising up in the frequent closed itemsets mining problem is the duplicate detection. In this paper we propose a general technique for promptly detecting and discarding duplicate closed itemsets, without the need of keeping in the main memory the whole set of closed patterns. Our approach can be exploited with substantial performance benefits by any algorithm that adopts a vertical representation of the dataset. We implemented our technique within a new depth-first closed itemsets mining algorithm. The experimental evaluation demonstrates that our algorithm outperforms other state of the art algorithms like CLOSET+ and FPCLOSE.
Subject Frequet Closed Itemsets Mining
H.2.8 Database Applications


Icona documento 1) Download Document PDF


Icona documento Open access Icona documento Restricted Icona documento Private

 


Per ulteriori informazioni, contattare: Librarian http://puma.isti.cnr.it

Valid HTML 4.0 Transitional