Tonellotto N., Ounis I., Macdonald C. Effect of different docid orderings on dynamic pruning retrieval strategies. In: SIGIR' 11 - 34th International ACM SIGIR Conference on Research and Development in Information Retrieval (Beijing, China, 24-28 July 2011). Proceedings, pp. 1179 - 1180. ACM, 2011.
Document-at-a-time (DAAT) dynamic pruning strategies for information retrieval systems such as textsc{MaxScore} and textsc{Wand} can increase querying efficiency without decreasing effectiveness. Both work on posting lists sorted by ascending document identifier (docid). The order in which docids are assigned -- and hence the order of postings in the posting lists -- is known to have a noticeable impact on posting list compression. However, the resulting impact on dynamic pruning strategies is not well understood. In this poster, we examine the impact on the efficiency of these strategies across different docid orderings, by experimenting using the TREC ClueWeb09 corpus. We find that while the number of postings scored by dynamic pruning strategies do not markedly vary for different docid orderings, the ordering still has a marked impact on mean query response time. Moreover, when docids are assigned by lexicographical URL ordering, the benefit to response time for is more pronounced for textsc{Wand} than for textsc{MaxScore}.
URL: http://dl.acm.org/citation.cfm?id=2010108
DOI: 10.1145/2009916.2010108
Subject Information Retrieval
H.3.3 Information Search and Retrieval

