PUMA
Istituto di Scienza e Tecnologie dell'Informazione     
Peters C., Picchi E. Capturing the comparable : a system for querying comparable text corpora. In: JADT 1995 - 3. Giornate Internazionali di Analisi Statistica dei Dati Testuali = 3rd international conference of Statistical analysis on Textual data = 3. Journ (Roma, 11-13 dicembre 1995). Proceedings, vol. 1 pp. 247 - 254. S. Bolasco, L. Lebart, A. Salem (eds.). CISU, 1995.
 
 
Abstract
(English)
We discuss the importance of bilingual and multilingual text corpora in many types of cross language investigations and illustrate the differences between parallel and comparable text archives. The advantages of comparable over parallel data for certain kinds of contrastive linguistic studies arc outlined. A prototype version of a system for querying comparable text archives is then described and examples nf the first results arc given. The system will form part of an integrated works talion for mono- and bilingual lexical and text database management and interrogation under development at the Istituto di Linguistica Computazionole, Pisa.
Subject Textual Databases
Bilingual Reference Corpora
Contrastive Textology
H.2 Database management
H.3.3 Information Search and Retrieval


Icona documento 1) Download Document PDF


Icona documento Open access Icona documento Restricted Icona documento Private

 


Per ulteriori informazioni, contattare: Librarian http://puma.isti.cnr.it

Valid HTML 4.0 Transitional