Istituto di Scienza e Tecnologie dell'Informazione     
Fagni T., Perego R., Silvestri F. A Highly Scalable Parallel Caching System for Web Search Engine Results. In: Euro-Par 2004 Parallel Processing, 10th International Euro-Par (Pisa, Italy, August 31-September 3 2004). Proceedings, pp. 347 - 354. Marco Danelutto, Marco Vanneschi, Domenico Laforenza (eds.). (Lecture Notes in Computer Science, vol. 3149). Springer, 2004.
This paper discusses the design and implementation of SDC, a new caching strategy aimed to e ciently exploit the locality present in the stream of queries submitted to a Web Search Engine. SDC stores the results of the most frequently submitted queries in a fixed-size read-only portion of the cache, while the queries that cannot be satis ed by the static portion compete for the remaining entries of the cache according to a given cache replacement policy. We experimentally demonstrated the superiority of SDC over purely static and dynamic policies by measuring the hit-ratio achieved on two large query logs by varying cache parameters and the replacement policy used. Finally, we propose an implementation optimized for concurrent accesses, and we accurately evaluate its scalability.
Subject Caching
Search engines
H.3.3 Information Search and Retrieval

Icona documento 1) Download Document PDF

Icona documento Open access Icona documento Restricted Icona documento Private


Per ulteriori informazioni, contattare: Librarian http://puma.isti.cnr.it

Valid HTML 4.0 Transitional