PUMA
Istituto di Scienza e Tecnologie dell'Informazione     
Silvestri F., Orlando S., Perego R. WINGS: A Parallel Indexer for Web Contents. In: Computational Science - ICCS 2004: 4th International Conference (Kraków, Poland,, 6-9 June 2004). Proceedings, pp. 263 - 270. Marian Bubak, Geert Dick van Albada, Peter M. A. Sloot, et al. (eds.). (Lecture Notes in Computer Science, vol. 3036). Springer, 2004.
 
 
Abstract
(English)
In this paper we discuss the design of a parallel indexer for Web documents. By exploiting both data and pipeline parallelism, our prototype indexer e ciently builds a partitioned inverted compressed index, a suitable data structure commonly utilized by modern Web Search Engines. We discuss implementation issues and report the results of preliminary tests conducted on a SMP PCs.
Subject Indexing web documents
H.3.3 Information Search and Retrieval
H.3.4 Systems and Software


Icona documento 1) Download Document PDF


Icona documento Open access Icona documento Restricted Icona documento Private

 


Per ulteriori informazioni, contattare: Librarian http://puma.isti.cnr.it

Valid HTML 4.0 Transitional