PUMA
Istituto di Scienza e Tecnologie dell'Informazione     
Meghini C., Spyratos N. Computing intensions of digital library collections. In: 5th International Conference on Formal Concept Analysis. ICFCA (Clermont-Ferrand, France, February 12-16 2007). Proceedings, pp. 66 - 81. S. O. Kuznetsov, Stefan Schmidt (eds.). (Lecture Notes in Computer Science, vol. 4390). Springer-Verlag, 2007.
 
 
Abstract
(English)
We model a Digital Library as a formal context in which objects are documents and attributes are terms describing documents contents. A formal concept is very close to the notion of a collection: the concept extent is the extension of the collection; the concept intent consists of a set of terms, the collection intension. The collection intension can be viewed as a simple conjunctive query which evaluates precisely to the extension. However, for certain collections no concept may exist, in which case the concept that best approximates the extension must be used. In so doing, we may end up with a too imprecise concept, in case too many documents denoted by the intension are outside the extension. We then look for a more precise intension by exploring 3 different query languages: conjunctive queries with negation; disjunctions of negationfree conjunctive queries; and disjunctions of conjunctive queries with negation. We show that a precise description can always be found in one of these languages for any set of documents. However, when disjunction is introduced, uniqueness of the solution is lost. In order to deal with this problem, we define a preferential criterion on queries, based on the conciseness of their expression. We then show that minimal queries are hard to find in the last 2 of the three languages above.
DOI: 10.1007/978-3-540-70901-5
Subject Digital Library Collections
Formal Concept Analysis
H.3.7 Digital Libraries


Icona documento 1) Download Document PDF


Icona documento Open access Icona documento Restricted Icona documento Private

 


Per ulteriori informazioni, contattare: Librarian http://puma.isti.cnr.it

Valid HTML 4.0 Transitional