Istituto di Scienza e Tecnologie dell'Informazione     
Tonellotto N., Ounis I., Macdonald C. Query efficiency prediction for dynamic pruning. In: LSDS-IR' 11 - 9th Workshop on Large-scale and Distributed Informational Retrieval (Glasgow, UK, 24-28 October 2011). Proceedings, pp. 3 - 8. ACM Press, 2011.
Dynamic pruning strategies are effective yet permit efficient retrieval by pruning - i.e. not fully scoring all postings of all documents matching a given query. However, the amount of pruning possible for a query can vary, resulting in queries with similar properties (query length, total numbers of postings) taking different amounts of time to retrieve search results. In this work, we investigate the causes for inefficient queries, identifying reasons such as the balance between informativeness of query terms, and the distribution of retrieval scores within the posting lists. Moreover, we note the advantages in being able to predict the efficiency of a query, and propose various query efficiency predictors. Using 10,000 queries and the TREC ClueWeb09 category B corpus for evaluation, we find that combining predictors using regression can accurately predict query response time.
URL: http://dl.acm.org/citation.cfm?id=2064734&CFID=74367916&CFTOKEN=80133412
DOI: 10.1145/2064730.2064734
Subject Information Retrieval
H.3.3 Information Search and Retrieval

Icona documento 1) Download Document PDF

Icona documento Open access Icona documento Restricted Icona documento Private


Per ulteriori informazioni, contattare: Librarian http://puma.isti.cnr.it

Valid HTML 4.0 Transitional